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Essential  Astrophysics  is  a  book  to  learn  or  teach  from,  as  well  as  a  fundamental 
reference  for  anyone  interested  in  astronomy  and  astrophysics.  This  unique  volume 
can  be  used  as  a  textbook,  teaching  guide,  or  reference  source  for  just  about 
anyone  interested  in  astronomy  and  astrophysics. 

It  serves  as  a  comprehensive,  introductory  text,  which  takes  the  student  through 
the  field  of  astrophysics  in  lecture-sized  chapters  of  basic  physical  principles 
applied  to  the  cosmos.  Undergraduate  students  with  an  interest  in  the  physical 
sciences,  such  as  astronomy,  chemistry,  engineering,  or  physics,  will  enjoy  this 
one- semester  overview. 

The  text  is  of  sufficient  breadth  and  depth  to  prepare  the  interested  student  for 
more  advanced,  specialized  courses  in  the  future.  The  clarity  and  comprehensive 
nature  of  Essential  Astrophysics  make  it  a  significant  resource  for  the  curious 
reader  that  is  unfamiliar  with  astrophysics  or  for  professional  astronomers  who 
may  have  forgotten  the  basics. 

Astronomical  examples  are  provided  throughout  the  text,  to  reinforce  the  basic 
concepts  and  physics,  and  to  demonstrate  the  use  of  the  relevant  formulae.  In  this 
way,  the  student  learns  to  apply  the  fundamental  equations  and  principles  to 
cosmic  objects  and  situations.  All  of  the  example  problems  are  solved  with  the 
rough  accuracy  needed  to  portray  the  basic  result.  Such  order-of-magnitude  esti¬ 
mates  are  commonly  used  in  astronomy  and  astrophysics,  where  large  numbers  are 
involved,  and  an  understanding  of  the  underlying  physics  does  not  require  engi¬ 
neering  accuracy. 

Essential  Astrophysics  is  a  serious  introduction  to  astrophysics  complete  with 
the  necessary  formulae.  These  equations  sometimes  include  the  calculus  of  inte¬ 
gration,  or  adding  up,  and  differentiation,  that  are  found  in  the  author’s  classic 
Astrophysical  Formulae  and  more  advanced  textbooks.  Nevertheless,  the  end 
result  in  Essential  Astrophysics  is  always  a  simple  algebraic  relationship  that  can 
be  applied  to  cosmic  objects.  These  fundamental  equations  are  given  in  the  text 
and  collected  at  the  end  of  the  book  in  Appendix  III,  for  future  reference  and  use. 
Therefore,  only  elementary  algebra  is  required  to  solve  any  of  the  example 
problems  or  other  numerical  conclusions  in  this  book. 
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There  are  two  types  of  intended  readers.  One  type  will  be  interested  in  broad, 
general  conclusions,  without  use  of  calculus.  This  reader  will  be  content  with  the 
existing  text  with  no  further  elaboration.  The  more  mathematically  competent 
reader  will  want  to  use  Essential  Astrophysics  as  a  foundation  for  more  advanced 
considerations,  with  the  guidance  of  the  references,  an  instructor,  or  an  advanced 
textbook,  using  the  formulae  found  in  the  text  or  within  set  aside  Focus  Elements 
of  Essential  Astrophysics  as  a  starting  point. 

The  modern  SI  (International  System)  units  are  used  in  the  equations  and 
example  problems,  which  is  another  unique  aspect  of  this  book  when  compared  to 
most  previous  texts  of  astrophysics.  A  conversion  table  between  the  SI  and  c.g.s. 
units  is  provided  in  the  first  chapter,  to  help  the  reader  follow  the  details  of  many 
papers  and  textbooks  that  use  the  older  c.g.s.  system.  Astronomical  and  physical 
constants,  units,  and  fundamental  equations  are  provided  in  appendices,  for  quick 
reference. 

Essential  Astrophysics  goes  beyond  the  typical  textbook  by  providing  com¬ 
prehensive  access  to  astrophysical  discoveries,  concepts,  and  facts  that  are  not 
available  in  any  other  way.  It  gives  us  access  to  that  long-forgotten  formula,  idea, 
or  reference,  while  also  providing  the  material  needed  to  introduce  anyone  to  a 
new  area  of  astrophysics.  Here,  the  reader  can  obtain  the  background  required  for  a 
general  understanding  and  find  guidance  to  the  relevant  literature  including  sem¬ 
inal  discoveries,  original  research,  and  comprehensive  up-to-date  reviews  that  will 
enable  the  curious  reader  to  delve  deeper  into  a  particular  topic.  A  more  extensive 
reference  compilation  of  developments  in  astrophysics,  from  then  to  now,  can  be 
found  in  Astrophysical  Formulae. 

We  are  the  benefactors  of  300  years  of  cumulative  discovery  in  astronomy  and 
astrophysics,  and  Essential  Astrophysics  helps  pass  on  these  fundamental  insights 
to  the  next  generation.  It  also  reveals  both  the  exciting  moments  of  the  past  and 
relatively  recent  discoveries.  Historical  aspects  are  illuminated  through  a  pro¬ 
gressive  flow  of  chapter  topics  and  by  guidance  to  the  earliest  ideas,  with  reference 
to  the  original  sources  as  well  as  contemporary  reviews.  Perhaps  because  of  the 
rapid  pace  of  modern  research,  contemporary  texts  often  focus  on  specialized 
topics  and  overlook  these  broader  perspectives  that  Essential  Astrophysics 
provides. 

There  are  50  set-aside  focus  elements  that  enhance  and  amplify  the  discussion 
with  fascinating  details.  They  include  the  intriguing  development  of  particular 
themes,  which  is  missing  in  most  astrophysics  textbooks,  or  provide  further 
astrophysics  or  equations  for  use  in  examples,  problems  or  further  investigations. 

In  Essential  Astrophysics  we  can  rediscover  basic  physical  concepts  such  as 
space,  time,  radiation,  mass,  gravity,  motion,  heat,  atoms,  radioactivity,  and  cos¬ 
mic  rays,  which  are  required  to  understand  the  observable  universe.  These  fun¬ 
damental  topics  are  discussed  in  the  first  seven  chapters,  beginning  with  the 
introductory  chapter  that  describes  how  astronomers  observe  the  contents  of  the 
universe  and  how  astrophysicists  interpret  them.  The  SI  units  of  distance,  mass, 
time,  energy,  and  luminosity  are  introduced,  together  with  their  astronomical  units 
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such  as  the  Angstrom,  light-year,  parsec,  and  the  Sun’s  mass,  luminosity,  and 
radius.  The  magnitude  unit  is  also  defined,  but  used  sparingly  in  examples. 

Chapter  2  describes  radiation,  of  both  the  visible  and  invisible  sort,  which 
carries  messages  from  the  cosmos  and  tells  us  much  of  what  we  know  about  it. 
Chapter  3  discusses  gravity,  together  with  mass  that  helps  determine  its  strength, 
and  related  tidal  phenomena  and  space  curvature.  Chapter  4  discusses  cosmic 
motion,  and  its  balanced  equilibrium  with  gravitation.  Chapter  5  discusses  the 
motion  of  particles  in  a  gas,  together  with  the  related  concepts  of  speed  distri¬ 
bution,  heat,  temperature,  and  pressure.  The  inside  of  the  atom  is  explored  in 
Chap.  6,  where  the  reader  learns  about  atomic  spectral  lines  and  their  use  in 
determining  the  composition  of  stars  and  the  measurement  of  motions  and  mag¬ 
netic  fields.  The  transformation  of  elements  in  both  radioactivity  and  by  sub¬ 
atomic  bombardment  is  presented  in  Chap.  7. 

The  fundamental  concepts  described  in  these  first  seven  chapters  provide  a 
necessary  prelude  to  the  rest  of  the  book.  It  includes  the  discoveries  that  the 
universe  is  predominantly  hydrogen;  that  the  stars  shine  by  nuclear  fusion;  that  the 
stars  live  and  die  while  new  ones  continue  to  be  formed;  that  the  interstellar  spaces 
are  not  empty  but  filled  with  dust,  atoms,  and  molecules;  and  that  the  observable 
universe  is  expanding  and  has  a  history.  The  last  half  of  Essential  Astrophysics 
also  includes  relatively  recent  discoveries,  such  as  pulsars,  black  holes,  the  three- 
degree  cosmic  microwave  background,  the  formation  of  stars  and  galaxies, 
invisible  dark  matter,  and  the  dark  energy  that  is  now  accelerating  the  expansion  of 
the  universe. 

Chapter  8  provides  an  account  of  the  nuclear  fusion  reactions  that  make  the  Sun 
shine.  This  is  followed  in  Chap.  9  by  modern  discoveries  of  the  Sun’s  expanding 
atmosphere,  the  solar  winds,  explosions  on  the  Sun,  the  solar  flares  and  coronal 
mass  ejections,  and  their  space-weather  threats  to  spacecraft  and  humans  in  space. 

Chapter  10  presents  an  overview  of  the  stars,  telling  us  how  far  away,  bright, 
luminous,  hot,  big,  and  massive  they  are.  It  also  includes  discussions  of  stellar 
spectra,  as  well  as  the  evolution  of  stars  and  their  role  in  the  origin  of  the  chemical 
elements. 

The  space  between  the  stars  is  discussed  in  Chap.  11,  beginning  with  bright 
stars  that  illuminate  nearby  space  and  continuing  with  the  dust,  gas,  radio  emis¬ 
sion,  and  molecules  within  interstellar  space.  This  is  naturally  followed  in 
Chap.  12  by  the  ongoing  formation  of  stars  and  their  planets;  recent  discoveries  of 
protoplanetary  disks  and  planets  around  nearby  stars  can  also  be  found  in  this 
chapter. 

The  final  destiny  of  stars,  when  they  have  depleted  their  nuclear  resources,  is 
presented  in  Chap.  13.  It  includes  planetary  nebulae,  white  dwarf  stars,  degenerate 
pressure,  novae,  supernovae,  neutron  stars,  pulsars,  and  stellar  black  holes. 

Our  last  two  chapters  discuss  the  observable  universe  in  its  entirety,  including 
the  Milky  Way,  the  receding  galaxies,  the  big  bang  with  its  background  radiation, 
the  first  atoms,  stars,  and  galaxies,  the  evolution  of  galaxies,  dark  matter  and  dark 
energy,  and  the  ultimate  destiny  of  the  universe. 
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A  total  of  69  tables  provide  vital  facts  and  physical  information  for  the  main 
types  of  cosmic  objects;  students,  teachers,  and  researchers  may  also  consult  this 
information  throughout  their  careers.  In  alphabetical  order,  they  include  the 
physical  properties  of  atmospheres,  clusters  of  galaxies,  the  cosmic  microwave 
background  radiation,  the  Earth,  emission  nebulae,  galaxies,  our  Galaxy,  giant 
molecular  clouds,  H  I  regions,  H  II  regions,  interstellar  molecules,  the  Milky  Way, 
our  Moon,  neutron  stars,  novae,  planetary  nebulae,  planets,  pulsars,  radioactive 
isotopes,  the  Sun,  stars,  star  clusters,  supernova  explosions,  and  supernova 
remnants. 

Our  tables  also  include  information  about  cosmic  magnetic  fields,  cosmic  rays, 
cosmological  parameters,  and  nuclear  fusion  processes,  as  well  as  the  range  of 
cosmic  pressures,  cosmic  temperatures  and  stellar  luminosity,  and  the  spectral 
lines  of  active  galaxies,  emission  nebulae,  stars,  the  Sun’s  corona,  and  the  Sun’s 
photosphere. 

There  are  also  excellent  line  drawings,  prepared  by  Kacha  Bradonjich,  and 
several  images  of  astronomical  objects  from  the  ground  and  space  that  help  cement 
our  newfound  knowledge  together.  They  help  crystallize  a  new  concept  with  a 
visual  excitement  that  adds  another  dimension  to  our  understanding. 

The  author  also  writes  another  sort  of  popular  book,  filled  with  personal 
anecdotes,  the  lives  of  contributors  to  the  field,  and  human  metaphors,  without  an 
equation  or  reference  in  sight.  For  this  complementary  approach,  the  reader  is 
referred  to  the  author’ s  two  books  The  Life  and  Death  of  Stars  and  Parting  the 
Cosmic  Veil ,  which  deal  with  many  of  the  same  general  topics  as  Essential 
Astrophysics  in  a  different,  lighter  perspective. 

I  am  indebted  to  Gayle  Grant  for  help  in  assembling  this  book,  and  to  the  Tufts 
Faculty  Research  Committee  for  modest  support  for  typing  some  equations  in  it. 
And  last,  but  not  least,  the  author  thanks  Ramon  Khanna  for  his  skillful  editorial 
suggestions  that  have  made  Essential  Astrophysics  a  better  book. 


Medford,  November  2012 


Kenneth  R.  Fang 
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Chapter  1 

Observing  the  Universe 


1.1  What  Do  Astronomers  and  Astrophysicists  Do? 

Astronomy  is  an  ongoing,  cumulative  science  in  which  astronomers  either  discover 
previously  unseen  constituents  of  the  observable  universe  or  determine  physical 
properties  of  known  ones.  They  measure  the  mass,  luminosity,  distance,  size, 
chemical  composition,  motion,  and  magnetic  fields  of  planets,  stars,  galaxies,  and 
their  surroundings. 

Astrophysicists  apply  the  laws  of  physics  to  celestial  objects  and  events, 
thereby  interpreting  and  explaining  the  astronomical  observations.  They  assume 
that  the  physical  laws  that  apply  on  Earth  are  valid  throughout  the  Cosmos,  but 
often  under  extreme  conditions  that  cannot  be  achieved  on  our  planet.  The  diverse 
aspects  of  physics  used  in  astrophysics  include  radiation  processes  and  universal 
gravitation,  cosmic  and  particle  motion,  atomic  and  nuclear  physics,  and  special 
and  general  relativity. 

Astronomers  and  astrophysicists  together  investigate  how  everything  in  the 
universe  originates,  changes,  interacts,  moves,  and  radiates.  Theoretical  studies, 
analytical  models,  and  numerical  simulations  with  computers  are  also  employed  to 
help  understand  these  processes. 

Astronomy,  and  therefore  astrophysics,  is  an  instrument-driven  science.  Many 
of  the  seminal  discoveries  in  astronomy  have  been  accidental  and  unanticipated, 
often  made  when  using  unique  telescopes,  new  technology,  and  novel  detection 
equipment  (Lang  2009).  These  instruments  extend  our  vision  to  places  that  are  not 
accessible  to  direct  observation,  enabling  us  to  “see”  the  invisible  and  permitting 
us  to  look  at  the  universe  in  new  ways.  Without  a  telescope,  for  example,  the  vast 
majority  of  stars  cannot  be  seen,  and  all  but  a  very  few  of  the  billions  of  galaxies 
and  most  of  the  expanding  universe  are  invisible  to  the  unaided  eye. 

Observations  provide  the  crucial  data  for  our  celestial  science.  Without  them, 
astrophysicists  would  have  nothing  to  describe.  Fortunately,  an  astronomical 
object  can  be  observed  over  and  over  again,  in  different  ways,  once  it  has  been 
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1  Observing  the  Universe 


discovered.  These  observations  require  knowledge  of  our  location  on  the  Earth,  the 
location  of  the  object  in  the  sky,  and  an  understanding  of  both  angular  measure  and 
passing  time. 


1.2  Our  Place  on  Earth 

In  order  to  observe  cosmic  objects  with  any  accuracy,  we  must  first  establish  our 
bearings  here  on  Earth.  In  arguments  used  by  Pythagoras  (572—479  BC),  and 
subsequently  recorded  by  Aristotle  (384—322  BC),  it  was  shown  that  the  Earth  is  a 
sphere.  During  a  lunar  eclipse,  when  the  Moon’s  motion  carries  it  through  the 
Earth’s  shadow,  observers  at  different  locations  invariably  saw  a  curved  shadow  on 
the  Moon.  Only  a  spherical  body  can  cast  a  round  shape  in  all  orientations.  The 
curved  surface  of  the  ocean  was  also  inferred  by  watching  a  ship  disappear  over 
the  horizon;  first  the  hull  and  then  the  mast  disappear  from  view. 

So  we  can,  to  first  approximation,  assume  the  Earth  is  a  sphere,  and  locate 
ourselves  within  a  grid  of  great  circles  on  it.  A  great  circle  divides  the  sphere  in 
half;  the  name  derives  from  the  fact  that  no  greater  circles  can  be  drawn  on  a 
sphere.  A  great  circle  halfway  between  the  North  and  South  Poles  is  called  the 
Equator  because  it  is  equally  distant  between  both  poles. 

Circles  of  longitude  are  great  circles  that  pass  around  the  Earth  from  pole  to 
pole,  perpendicular  to  the  Equator.  Each  circle  of  longitude  intersects  the  equator 
in  two  points  that  are  180°  apart.  We  halve  the  great  circles  of  longitude  into 
semicircles,  called  meridians.  Long  ago,  in  1884,  it  was  decided  that  the  half-circle 


North  Pole 


Fig.  1.1  Latitude  and  longitude  Great  circles  through  the  North  and  South  Poles  of  the  Earth 
create  circles  of  longitude.  They  are  perpendicular  to  the  Equator  where  they  intersect  it.  The 
circle  of  longitude  that  passes  through  Greenwich  England  is  called  the  Prime  Meridian.  The 
longitude  of  any  point,  P,  is  the  angle  lambda,  2,  measured  westward  along  the  Equator  from 
the  intersection  of  the  Prime  Meridian  with  the  Equator  to  the  equatorial  intersection  of  the  circle 
of  longitude  that  passes  through  the  point.  The  latitude  is  the  angle  phi,  (p,  measured  northward 
(positive)  or  southward  (negative)  along  the  circle  of  longitude  from  the  Equator  to  the  point. 
In  this  figure,  the  point  P  corresponds  to  San  Francisco 
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of  longitude  passing  through  the  old  Royal  Observatory  in  Greenwich,  England, 
would  mark  0°  longitude.  It  was  designated  as  the  “Prime  Meridian”,  the  starting 
point  of  counting  longitudes. 

The  longitude,  denoted  by  the  Greek  letter  2,  of  any  point  on  the  Earth’s  surface 
is  the  angle  measured  westward  from  the  intersection  of  the  Prime  Meridian  with 
the  Equator  to  the  equatorial  intersection  of  the  circle  of  longitude  that  passes 
through  the  point  (Fig.  1.1).  The  latitude,  designated  by  the  symbol  0,  is  the  angle 
measured  northward  (positive)  or  southward  (negative)  along  a  circle  of  longitude 
from  the  equator  to  the  point.  Sobel  (1995)  has  discussed  early  determinations  of 
terrestrial  longitude,  whereas  Carter  and  Carter  (2002)  have  provided  a  historical 
account  of  latitude  variations.  Alder  (2002)  has  discussed  early  measurements  of 
the  size  of  the  Earth,  and  the  associated  beginning  of  the  metric  system. 

Example:  Location  and  rotation  speed  on  the  Earth 

The  length  of  the  day  and  the  rotation  period  is  the  same  for  every  place  on 
Earth,  but  the  speed  of  rotation  around  its  axis  depends  on  the  surface 
location.  The  surface  speed  of  rotation  is  greatest  at  the  equator  and  reduces 
to  lower  values  at  higher  latitudes.  Using  an  equatorial  radius  of  about 
6,378  km,  the  Earth  would  have  to  be  rotating  at  a  speed  of  about  464  m  s-1 
to  spin  about  its  equatorial  circumference  once  every  24  h.  To  calculate  this 
speed,  just  multiply  the  equatorial  radius  by  2tz  to  get  the  equatorial  cir¬ 
cumference,  and  divide  by  24  h,  where  there  are  86,400  s  per  hour.  The 
constant  n  =  3.1416.  At  higher  latitudes,  closer  to  the  poles,  the  circum¬ 
ferential  distance  around  the  Earth,  and  perpendicular  to  a  great  circle  of 
longitude,  is  less,  so  the  speed  is  less.  The  speed  diminishes  to  almost 
nothing  at  the  geographic  poles,  which  are  pierced  by  the  rotation  axis. 


Every  location  on  the  Earth  rotates  about  an  axis  that  pierces  the  Earth  and 
extends  between  its  North  and  South  Poles.  The  period  of  rotation,  and  the  length 
of  the  day,  is  everywhere  the  same,  but  the  rotation  speed  is  fastest  at  the  equator 
and  systematically  lower  at  higher  latitudes. 

The  geographic  description  of  the  location  of  an  observatory  includes  its  height,  h , 
in  meters  above  mean  sea  level.  The  geodetic  coordinates  of  longitude,  latitude  and 
height,  designated  2,  0,  and  h ,  are  specified  online  in  The  Astronomical  Almanac  for 
all  observatories  engaged  in  professional  programs  of  astronomical  observations. 

The  Global  Positioning  System  (GPS)  is  now  used  to  determine  reliable  loca¬ 
tion  and  time  information.  It  is  a  system  of  about  30  navigation  satellites  devel¬ 
oped,  maintained  and  operated  by  the  U.S.  Air  Force  for  military  and  civilian 
purposes.  Each  satellite  is  constantly  beaming  radio  signals  that  contain  the  exact 
time.  These  signals  take  a  few  milliseconds  to  travel  from  a  satellite  to  the  GPS 
receiver,  and  it  has  a  built-in  computer  that  calculates  its  precise  position  on  Earth 
using  signal  time  delays  from  four  or  more  satellites.  The  time  differences  are 
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converted  into  distance  by  multiplication  with  the  speed  of  light,  and  these  dis¬ 
tances  are  translated  by  triangulation  into  an  exact  position  accurate  to  about 
100  m. 

The  numbers  after  the  N  (north)  notation  on  the  GPS  receiver  indicate  its 
latitude,  while  the  numbers  after  the  W  (west)  notation  indicates  its  longitude.  The 
GPS  devices  used  in  automobiles  specify  the  longitude  and  latitude  of  your  start 
and  end  points,  and  map  the  route  between  them. 

The  GPS  was  initially  developed  by  the  military  and  is  still  used  by  them. 
Soldiers  can  use  a  GPS  device  to  find  an  enemy  objective,  even  in  the  dark  or 
unfamiliar  territory;  weapons  systems  can  use  them  to  track  potential  ground  and 
air  targets.  All  GPS  receivers  capable  of  functioning  above  18  km  in  altitude  and 
moving  faster  than  515  m  s-1  are  classified  as  weapons. 


1.3  Location  in  the  Sky 

You  may  have  watched  the  stars  as  they  rise  at  the  horizon  on  one  side  of  the 
Earth,  slowly  move  overhead,  and  eventually  set  on  the  other  side  of  the  planet, 
only  to  reappear  the  next  night.  This  slow  coursing  of  stars  was  initially  attributed 
to  a  revolving  celestial  sphere,  which  carried  its  embedded  stars  about  a  stationary 
Earth,  but  appearances  can  be  deceiving.  The  Earth  is  instead  spinning  under  an 
imaginary  celestial  sphere  concentric  with  the  Earth,  on  which  the  stars  and  other 
astronomical  objects  are  placed.  Such  a  celestial  sphere  explains  why  people 
located  at  different  places  on  Earth  invariably  see  just  half  of  all  the  stellar  sky. 
As  the  Earth  rotates,  day  turns  into  night  and  these  stars  glide  by. 

Astronomers  define  points  and  circles  on  the  celestial  sphere  (Fig.  E2).  If  you 
extend  the  Earth’s  rotation  axis  in  both  directions,  it  intersects  the  celestial  sphere 
at  the  north  and  south  celestial  poles.  They  are  the  pivotal  points  of  the  night  sky’s 
apparent  daily  rotation.  When  the  plane  of  the  Earth’s  Equator  is  extended  outward 
in  all  directions,  it  cuts  the  celestial  sphere  in  half,  at  the  celestial  equator.  The 
point  where  the  Sun  crosses  the  celestial  equator  going  northward  in  spring  is 
called  the  Vernal  Equinox.  The  Vernal  Equinox  is  sometimes  called  the  first  point 
of  Aries,  and  is  given  the  symbol  y. 

The  projection  of  the  plane  of  the  Earth’s  orbit  onto  the  celestial  sphere  is 
known  as  the  ecliptic,  and  the  angular  separation  between  the  ecliptic  and  the 
celestial  equator  is  called  the  obliquity  of  the  ecliptic,  designated  s ,  which  is  about 
23.5°.  The  obliquity  is  also  the  angle  between  the  Earth’s  rotational  axis  and  a  line 
perpendicular  to  its  orbital  plane.  On  the  standard  reference  date  of  January  1.5,  or 
at  noon  in  January  1,  in  the  year  2000.0,  the  slowly  changing  obliquity  had  the 
exact  value  of: 


e  =  23°  26' 21.406" 
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Fig.  1.2  Celestial  coordinates  Stars,  galaxies,  and  other  cosmic  objects  are  placed  on  an 
imaginary  celestial  sphere.  The  celestial  equator  divides  the  sphere  into  northern  and  southern 
halves,  and  the  ecliptic  is  the  annual  path  of  the  Sun  on  the  celestial  sphere.  The  celestial  equator 
intersects  the  ecliptic  at  the  Vernal  Equinox  and  the  Autumnal  Equinox.  Every  cosmic  object  has 
two  celestial  coordinates.  They  are  the  right  ascension,  designated  by  the  angle  alpha,  a,  or  by 
R.A.,  and  the  declination,  denoted  by  the  angle  delta,  d,  or  Dec.  Right  ascension  is  measured 
eastward  along  the  celestial  equator  from  the  Vernal  Equinox  to  the  foot  of  the  great  circle  that 
passes  through  the  object.  Declination  is  the  angular  distance  from  the  celestial  equator  to  the 
object  along  the  great  circle  that  passes  through  the  object,  positive  to  the  north  and  negative  to 
the  south.  Precession  results  in  a  slow  motion  of  the  Vernal  Equinox,  producing  a  steady  change 
in  the  celestial  coordinates 


where  the  symbol  °  denotes  angular  degrees,  the  '  symbol  designates  minutes  of 
arc  or  angle,  and  the  symbol  "  denotes  seconds  of  arc. 

Positions  on  the  celestial  sphere  are  defined  by  angles  along  great  circles. 
By  analogy  with  terrestrial  longitude,  right  ascension,  denoted  a,  is  a  celestial 
object’s  longitude,  but  it  is  measured  eastward  along  the  celestial  equator  from  the 
Vernal  Equinox.  The  right  ascension  is  expressed  in  hours  and  minutes  of  time, 
with  24  h  in  the  complete  circle  of  360  degrees,  denoted  as  360°.  For  conversion, 
1  h  of  time  is  equivalent  to  15°  of  angle,  orlh=15°;lsof  time  is  equal  to  15  s 
of  arc,  or  1  s  =  15";  and  1  min  of  arc  equals  4  s  of  time,  or  V  =  4  s. 

Just  as  latitude  is  a  measure  of  a  one’s  distance  from  the  Equator  of  the  Earth, 
declination,  denoted  3 ,  is  a  celestial  object’s  angular  distance  from  the  celestial 
equator.  The  declination  is  positive  for  objects  located  north  of  the  celestial 
equator,  and  they  are  observed  by  inhabitants  of  the  northern  hemisphere  of  the 
Earth.  Celestial  objects  in  the  southern  sky  have  negative  declinations,  and  people 
living  in  the  southern  half  of  our  planet  observe  them. 
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Example:  What  can  you  see  in  the  night  sky  from  where  you  are? 

An  observer  can  see  only  half  of  the  celestial  sphere  -  the  half  above  the 
local  horizon.  The  celestial  objects  that  can  be  seen  depend  on  only  two 
things,  the  observer’s  latitude,  denoted  by  0,  and  the  object’s  declination, 
designated  3.  At  the  Earth’s  geographic  North  Pole,  and  latitude  0  =  90°, 
the  north  celestial  pole  is  directly  overhead  and  located  near  the  star  Polaris, 
and  the  horizon  runs  along  the  celestial  equator.  This  observer  can  therefore 
only  see  the  northern  half  of  the  celestial  sphere,  and  objects  with  positive, 
northern  declination;  all  the  objects  with  negative,  southern  declinations  are 
forever  invisible  from  this  location.  For  an  observer  located  at  different 
northern  latitudes  0,  stars  can  be  observed  with  declinations  greater  than 
0  —  90°,  or  3  >  0  —  90°,  for  0  <  0  <  90°;  only  these  stars  rise  above  the 
horizon.  The  southern  half  of  the  celestial  sphere  is  fully  visible  from  the 
geographic  South  Pole,  while  the  northern  sky  is  unseen.  At  the  Equator 
where  0  =  0,  the  complete  range  of  positive  and  negative  declinations  are 
visible.  Of  course,  the  observer  has  to  wait  for  the  rotating  Earth  to  bring  any 
potentially  observable  object  above  the  horizon  and  into  the  observable  half 
of  the  celestial  sphere. 

For  centuries,  astronomers  have  used  catalogues  of  right  ascension,  a,  and 
declination,  3 ,  of  celestial  objects  to  locate  them  in  the  sky  (Focus  1.1).  Modern 
celestial  positions  of  the  highest  accuracy  are  referred  to  the  center  of  mass  of  all 
bodies  in  the  solar  system,  known  as  the  solar  system  barycenter,  and  specified 
within  an  International  Celestial  Reference  System  (Kaplan  2005). 

Focus  1.1  Astronomical  catalogues 

The  positions,  brightness,  spectra,  angular  size,  and  other  data  of  different 
kinds  for  celestial  objects  are  given  in  catalogues  provided  over  centuries  of 
meticulous  observations  by  dedicated  astronomers.  Stars  were  the  first 
celestial  objects  to  be  catalogued.  Their  accurate  positions  were  compiled 
and  used  to  discover  such  things  as  stellar  motions,  the  planet  Uranus,  and 
the  first  asteroid.  The  positions  and  spectral  classification  of  about  235,000 
stars  were  compiled  in  the  famous  Henry  Draper  Catalogue ,  published 
between  1918  and  1924.  The  letters  “HD”  followed  by  the  listed  number  in 
this  catalogue  often  designates  a  star.  Gliese  (1969)  and  Gliese  and  Jahreiss 
(1979)  have  catalogued  the  closest  stars  other  than  the  Sun.  Planets  are  now 
being  discovered  around  such  nearby  stars  designated  by  “GJ”  followed  by 
their  number  in  this  catalogue  -  GJ  581,  for  example. 

The  French  astronomer  Charles  Messier  (1730—1817)  compiled  one  of 
the  most  famous  catalogues  (Messier  1781).  His  list  of  just  over  100  bright 
non-stellar  objects  includes  some  of  the  most  widely  studied  objects  in  the 
universe,  now  known  as  emission  nebulae,  galaxies,  star  clusters,  and 
supernova  remnants.  The  letter  “M”  followed  by  the  number  in  the  Messier 
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Catalogue  indicates  them.  The  Crab  Nebula  supernova  remnant,  M  1,  is  the 
first  on  the  list,  and  M  31  is  the  closest  spiral  galaxy,  also  known  as 
Andromeda. 

In  astronomical  parlance,  a  nebula  is  a  diffuse,  non-stellar  object.  Some 
extragalactic  nebulae,  which  reside  outside  our  Milky  Way,  were  eventually 
designated  as  galaxies.  They  contain  as  many  as  100  billion  stars,  as  well  as 
diffuse  gaseous  nebulae.  An  emission  nebula  consists  of  interstellar  gas 
glowing  from  the  ultraviolet  light  of  a  nearby  luminous  star. 

William  Herschel  (1738—1822)  dramatically  increased  the  number  of 
known  non-stellar  objects  to  2,500,  during  20  years  of  systematically 
observing  the  heavens,  from  1783  to  1802.  This  sweep  of  the  sky’s  northern 
hemisphere  was  extended  to  the  southern  hemisphere  by  William’s  son,  Sir 
John  Herschel  (1792—1871),  who  published  data  for  5,079  objects  in  his 
General  Catalogue  in  1864,  the  combined  result  of  more  than  half  a  century 
of  painstaking  observations. 

Using  the  Herschel  catalogue  as  a  basis,  J.  L.  E.  Dreyer  (1852—1926) 
published  his  New  General  Catalogue  (NGC)  of  nebulae  and  star  clusters, 
followed  by  two  Index  Catalogues ,  designated  IC.  Many  galaxies,  as  well  as 
emission  nebulae  and  star  clusters,  are  still  known  by  their  NGC  and  IC 
numbers. 

Later  on,  the  photographic  Palomar  Sky  Survey,  using  the  wide-angle 
1.2  m  (48  in.)  Schmidt  telescope  on  Palomar  Mountain,  was  used  to  cata¬ 
logue  tens  of  thousands  of  galaxies;  an  observatory  telescope  is  often  des¬ 
ignated  by  a  name  and  its  diameter  in  meters  or  inches.  In  1958,  George 
Abell  (1927—1983)  used  it  to  create  a  catalogue  of  2,712  rich  clusters  of 
galaxies;  the  designation  “A”  followed  by  the  number  in  his  catalogue  is  still 
used  today.  Millions  of  galaxies  were  catalogued  in  the  mid- twentieth  cen¬ 
tury  using  photographs  taken  using  large  telescopes,  but  galaxy  catalogues 
have  now  become  computerized.  An  important  example  is  the  Sloan  Digital 
Sky  Survey,  created  from  a  dedicated,  computer-driven  2.5  m  (98  in.) 
telescope. 

Bright  radio  sources  are  designated  by  “3C”  followed  by  the  number  in 
the  Third  Cambridge  Catalogue  of  Radio  Sources  published  in  1959,  a 
famous  example  is  3C  273,  the  first  quasar  to  be  discovered.  The  most 
intense  radio  and  x-ray  sources  have  been  named  after  the  constellation  they 
appear  in.  For  example,  Cygnus  A  is  a  bright  radio  galaxy,  Cygnus  X-l  is  a 
bright  x-ray  source  and  a  candidate  black  hole,  and  Centaurus  X-3  is  an 
x-ray  pulsar. 

Specific  types  of  objects,  such  as  pulsars,  supernova  remnants,  and  white 
dwarf  stars,  have  their  own  catalogues,  and  are  often  designated  by  letters 
like  “PSR”,  “SNR”,  or  “WD”  followed  by  their  celestial  position. 
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The  international  celestial  reference  frame  is  defined  by  a  catalogue  of 
exceedingly  accurate  positions  for  extragalactic  radio  sources  observed  with 
Very  Long  Baseline  Interferometry.  At  optically  visible  wavelengths,  the 
Tycho-2  catalogue  of  positions  for  more  than  2.5  million  stars,  observed 
from  the  HIPPARCOS  satellite,  is  used.  Accurate  positions  of  major  solar 
system  bodies  are  given  as  a  function  of  time  in  an  Ephemerides  provided  by 
the  Jet  Propulsion  Laboratory. 


As  the  Sun  moves  along  the  ecliptic,  it  crosses  the  celestial  equator  twice,  on  its 
way  north  at  the  Vernal  Equinox,  on  about  March  20,  and  then  at  the  Autumnal 
Equinox  on  about  September  23.  On  either  equinox,  the  Sun  lies  in  the  Earth’s 
equatorial  plane,  so  the  twilight  zone  that  separates  night  and  day  then  cuts  the 
Earth  in  equal  parts  and  the  days  and  nights  are  equally  long.  The  point  at  which 
the  Sun  is  farthest  north,  is  the  Summer  Solstice  (on  about  June  21),  and  its  most 
southerly  point  is  the  Winter  Solstice  on  about  December  22.  The  days  in  the 
northern  hemisphere  are  the  longest  on  the  Summer  Solstice,  and  shortest  on  the 
Winter  Solstice.  So  the  crossing  of  the  Sun  at  the  equinoxes  and  solstices  mark 
the  beginning  of  the  seasons  in  the  Earth’s  northern  hemisphere,  and  the  location 
of  these  points  on  the  celestial  sphere  are  given  in  Table  1.1. 

Right  ascension  and  declination  provide  celestial  bearings  in  the  equatorial 
coordinate  system.  Another  celestial  coordinate  system  is  the  horizon,  or  hori¬ 
zontal,  coordinate  system  that  employs  great-circle  angles  measured  with  respect 
to  the  observer’s  zenith  and  horizon.  The  zenith  is  located  above  your  head, 
directly  away  from  the  center  of  the  Earth.  It  is  the  point  of  intersection  of  the 
celestial  sphere  with  the  upward  prolongation  of  the  observer’ s  plumb  line,  whose 
bob  is  drawn  to  the  terrestrial  center  by  gravity.  If  a  plane  is  extended  outward 
from  the  observer’ s  feet,  perpendicular  to  the  plumb  line,  it  intersects  the  celestial 
sphere  in  a  great  circle  known  as  the  horizon.  Celestial  objects  are  only  visible  if 
they  are  above  the  horizon.  The  altitude  measures  the  angular  distance  from  the 
horizon  to  the  object  in  question,  along  a  great  circle  that  intersects  the  object  and 
the  zenith.  The  azimuth,  an  angle  measured  along  the  horizon,  provides  the  second 
dimension  to  this  coordinate  system.  Angles  are  used  to  designate  celestial  posi¬ 
tions  in  both  types  of  celestial  coordinate  systems. 


Table  1.1  Celestial  positions  of  the  equinoxes  and  solstices 


Position 

Right  ascension  a  (2000.0) 

Declination  S  (2000.0) 

Vernal  (Spring)  equinox 

0  h 

0° 

Summer  solstice 

6  h 

23°  26'  21.4" 

Autumnal  (Fall)  equinox 

12  h 

0° 

Winter  solstice 

24  h 

-23°  26'  21.4" 

1.4  Measuring  Angle  and  Size 
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1.4  Measuring  Angle  and  Size 

Astronomers  measure  angles  in  degrees,  designated  by  the  superscript  °,  and  there 
are  360°  in  a  circle.  They  also  use  the  second  of  arc,  or  arc  second,  denoted  by  the 
symbol  ",  and  the  minute  or  arc,  or  arc  minute,  abbreviated  by  ',  as  a  units  of  angle. 
The  units  mimic  a  clock  with  60  s  in  a  minute,  or  60"  =  V .  A  full  degree  of  angle 
contains  60  min  of  arc,  so  1°  =  60'  =  3,600". 

Mathematicians  use  a  different  unit  of  angular  measurement  called  a  radian. 
The  radian  is  the  ratio  between  the  length  of  an  arc  and  its  radius.  The  ratio  of 
linear  size  to  distance  is  expressed  in  radians,  where  an  angle  of  one  radian,  when 
viewed  from  the  center  of  a  circle,  results  in  an  arc  on  that  circle  equal  to  the  radius 
of  the  circle  (Fig.  1.3).  That  is,  the  radian  unit  of  angular  measure  is  defined  such 
that  an  angle  of  one  radian  subtended  from  the  center  of  a  unit  circle  produces  an 
arc  length  of  one. 

A  full  circle  subtends  In  rad  and  360°,  where  n  =  3.141592654,  so 
1  rad  =  360/(27t)  =  57.2958°,  and  the  conversion  factors  between  seconds  of  arc 
and  radians  are: 

1"  =4.848  x  10-6  rad, 


and 


1  rad  =  2.06265  x  105 "  =  57.2958°. 
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Fig.  1.3  The  Sun’s  angular  size  and  radius  The  solar  radius  can  be  determined  from  the  Sun’s 
angular  size  and  distance.  As  long  as  this  angle  is  small,  the  physical  size  is  only  a  small  arc  of  a 
large  circle,  denoted  by  the  dashed  line,  and  the  angular  size  is  the  ratio  of  the  physical  size  to  the 
distance.  Astronomers  specify  this  angle  as  a  partial  arc  of  a  full  circle  of  360°;  for  the  Sun  it  is 
about  32  min  of  arc,  in  which  there  are  60  min  of  arc  in  1°.  This  angle  has  been  enlarged  to 
display  it  in  this  illustration.  In  mathematics,  the  radian  is  the  standard  unit  of  angular  measure.  It 
describes  the  angle  subtended  by  a  circular  arc  as  the  length  of  the  arc  divided  by  the  radius  of  the 
arc.  When  the  arc  length  is  equal  to  the  arc  radius,  the  angle  is  1  rad.  We  can  convert  between 
the  two  methods  of  describing  angles  by  noting  that  the  circumference  of  a  circle  is  2n  times  its 
radius;  therefore  1  rad  is  equal  to  360°/(27r),  or  57.2958°.  For  the  Sun,  the  angular  size  0Q  = 
2 Rq/Dq  radians,  where  RQ  denotes  the  Sun’s  radius  and  the  mean  distance  of  the  Sun,  D0,  is  1 
AU.  The  observed  angular  size  of  the  Sun  corresponds  to  a  radius  of  695.5  million  meters 
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A  telescope  can  be  used  to  measure  the  angular  size,  0size,  of  a  celestial  source 
if  the  angular  resolution  of  the  telescope  is  smaller  than  the  angular  size  of  the 
source.  If  the  source  distance,  D ,  is  known,  then  one  can  infer  its  linear  extent,  L, 
perpendicular  to  the  line  of  sight,  or  if  spherical  its  radius,  R,  using  the  angular  size 
and  the  relation: 


n  L  2  R 

resize  =  —  =  —  rad. 
D  D 


Example:  Measuring  the  size  of  the  Sun 

The  linear  radius  of  the  Sun,  denoted  by  RQ,  can  be  determined  from  the 
Sun’s  angular  diameter,  denoted  by  0©,  using 


2Rq 

Dq 


rad 


where  the  mean  distance  between  the  Earth  and  the  Sun  has  a  value  of 
Dq  =  1  AU  =  1.496  x  1011  m.  As  illustrated  in  Fig.  1.3,  this  expression 
uses  the  mathematician’s  radian  unit  of  angular  measurement,  and  it  has  to 
be  converted  to  astronomical  measurements  of  angle.  The  radian  is  the  ratio 
between  the  length  of  an  arc  and  its  radius. 

The  mean  equatorial  angular  diameter  of  the  Sun  is  0©  =  31.97  min  of 
arc  =  31.97'  =  1,918.2",  and  this  is  equal  to  0.009299  rad,  since 
1  rad  =  206265  s  of  arc.  So  the  radius  of  the  Sun  is  given  by 


*0  = 


9qDq 


=  6.956  x  108  m, 


which  is  109  times  the  radius  of  the  Earth. 


By  the  way,  the  angular  diameter  of  the  Sun  is  about  the  same  angle  as  that 
subtended  by  the  thumb  when  viewed  at  arm’s  length.  In  one  of  those  fascinating 
coincidences,  the  angular  diameter  of  the  Sun  is  also  about  the  same  as  the  angular 
diameter  of  the  Moon,  which  is  much  closer  to  us  and  smaller  in  radius  than  the 
Sun.  Because  of  this  similarity  in  angular  size,  the  Moon  can  pass  in  front  of  the 
Sun  during  a  total  solar  eclipse,  blocking  out  the  sunlight. 


1.5  The  Locations  of  the  Stars  are  Slowly  Changing 

While  most  stars  sweep  by  as  the  Earth  rotates,  a  star  that  is  aligned  with  our 
planet’s  rotation  axis,  at  the  north  celestial  pole,  seems  to  remain  placed  in  an 
unchanging  location  at  90°  north  declination.  The  Earth’s  northern  rotation  axis, 
for  example,  now  points  close  to  Polaris,  also  known  as  the  North  Star  or  the  Pole 


1.5  The  Locations  of  the  Stars  are  Slowly  Changing 
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Star,  which  would  lie  approximately  overhead  when  viewed  from  the  Earth’s 
geographic  North  Pole.  The  latitude  of  any  location  in  the  Earth’s  Northern 
Hemisphere  is  equal,  within  about  1°,  to  the  angular  altitude  of  Polaris.  The 
uncertainty  is  due  to  the  fact  that  Polaris  is  not  exactly  at  the  north  celestial  pole, 
where  the  north  end  of  the  Earth’s  rotation  axis  pierces  the  night  sky. 

We  can  locate  Polaris  by  following  the  line  joining  the  two  stars  farthest  from 
the  handle  of  the  Big  Dipper,  which  accounts  for  the  phrase  “follow  the  drinking 
gourd”  used  by  southern  slaves  escaping  to  the  northern  parts  of  the  United  States. 
Mariners  have  also  used  the  North  Star  for  navigation,  to  find  the  direction  of  north 
and  the  latitude  of  their  ship. 

Nevertheless,  everything  in  the  universe  is  in  a  state  of  perpetual  change,  and 
the  locations  of  the  so-called  fixed  stars  on  the  celestial  sphere  are  no  exception. 
Their  change  in  position  is  related  to  the  Earth’s  elongated  shape  (Focus  1.2), 
which  has  sent  the  Earth  into  a  wobbling  rotation  that  resembles  a  spinning  top. 
This  causes  a  very  slow  change  of  the  celestial  positions  of  the  north  celestial  pole, 
the  Pole  Star  and  all  the  other  stars,  called  precession.  The  changing  positions  of 
bright  stars  on  the  celestial  sphere  were  first  observed  by  Hipparchus,  a  Greek 
astronomer  who  lived  in  the  second  century  BC  (Hipparchus,  125  BC);  the  tele¬ 
scope  was  not  invented  until  17  centuries  after  Hipparchus  established  the  stellar 
positions  using  his  eyes. 

Focus  1.2  The  elongated  shape  of  the  Earth 

The  Earth  isn’t  precisely  spherical  in  shape.  It  has  a  slight  bulge  around  its 
equatorial  middle  and  is  flattened  at  its  poles,  with  a  shape  more  like  an  egg 
than  a  marble  or  billiard  ball.  This  elongated,  oblate  shape  is  caused  by  the 
Earth’s  rapid  rotation.  The  outward  force  of  rotation  opposes  the  inward 
gravitational  force,  and  this  reduces  the  pull  of  gravity  in  the  direction  of 
spin.  Since  this  effect  is  most  pronounced  at  the  equator,  and  least  at  the 
poles,  the  solid  Earth  adjusts  into  an  oblate  shape  that  is  elongated  along  the 
equator. 

An  ellipse  of  eccentricity,  e ,  and  major  axis,  ae,  which  is  rotated  about  the 
polar  axis,  defines  the  Earth’s  reference  ellipsoid  at  sea  level.  The  planet’s 
equatorial  radius  is  ae,  and  its  polar  radius,  ap.  They  are  given  by: 

ap  =  ae{  1  -/)  =  ae(  1  -  e2)1/2, 

where  the  flattening  factor  f  =  (ae  —  ap) / ae  is  related  to  the  eccentricity,  e , 
by  e2  =  2f  —  f2. 

The  mean  surface  radius  of  the  Earth,  (a),  is  given  by: 

(a)  =  6.371  x  106m, 

which  is  the  radius  of  a  sphere  of  volume  equal  to  the  Earth  ellipsoid. 
Geophysicists  use  another  definition  of  mean  radius  given  by  (2 ae  +  ap)/ 3. 
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The  radius,  r,  of  the  surface  of  the  Earth  geoid  at  any  latitude  /  is  given  by 

r  =  ae(  1  —/sin2  /). 

Two  of  the  primary  constants  of  the  International  Astronomical  Union  are 
(Kaplan  2005): 

z: 

Equatorial  radius  of  the  Earth  =  ae  =  6.3781366  x  10°  m, 
and 

Flattening  factor  for  Earth  =  /  =  0.0033528197  =  1/298.25642. 

These  values  of  ae  and  /  give  a  polar  radius  for  the  Earth  of 

/T 

ap  =  6.356752  x  10  m,  and  the  difference  between  the  equatorial  and  polar 
radius  is  21,385  m  or  about  21  km. 

The  world  geodetic  system,  which  is  the  basis  of  terrestrial  locations 
obtained  from  the  Global  Positioning  System,  or  GPS  for  short,  uses  an 
Earth  ellipsoid  with  ae  =  6.378137  x  106  m  and/=  1/298.257223563. 


The  changing  locations  of  celestial  objects  are  caused  by  the  gravitational 
action  of  the  Moon,  Sun  and  planets  on  the  spinning,  oblate  Earth.  As  a  result  of 
this  gravitational  torque,  the  Earth’s  rotation  axis  is  constantly  changing  with 
respect  to  a  space-fixed  reference  system. 

The  precessional  motion  of  the  Earth’s  rotation  axis  is  caused  by  the  tidal  action 
of  the  Moon  and  Sun  on  the  spinning  Earth.  That  is,  because  the  Moon  and  the  Sun 
lie  in  the  ecliptic  plane,  which  is  inclined  by  23.5°  to  the  plane  of  the  Earth’s 
Equator,  they  exert  a  gravitational  force  on  the  Earth’s  equatorial  bulge.  This 
causes  the  rotation  axis  to  sweep  out  a  cone  in  space,  centered  at  the  axis  of  the 
Earth’s  orbital  motion  and  completing  one  circuit  in  about  26,000  years  (Fig.  1.4). 

So  the  Earth  is  not  placed  firmly  in  space;  instead  it  wobbles  about  causing  the 
identity  of  the  Pole  Star  to  gradually  change  over  time  scales  of  thousands  of  years. 
The  northern  projection  of  the  Earth’s  rotation  axis  is  currently  within  about  0.75° 
of  Polaris  and  will  move  slowly  toward  it  in  the  next  century.  After  that,  the  north 
celestial  pole  will  move  away  from  Polaris  and,  in  about  12,000  years,  the  Earth’s 
rotation  axis  will  point  to  within  5°  of  the  bright  star  Vega. 

The  slow  conical  motion  of  precession  carries  the  Earth’s  Equator  with  it;  as 
that  Equator  moves,  the  two  intersections  between  the  celestial  equator  and  the 
Sun’s  path,  or  ecliptic,  move  westward  against  the  background  stars.  One  of  these 
intersections  is  the  Vernal,  or  Spring,  Equinox,  from  which  right  ascension  is 
measured.  This  equinox  point  moves  forward  (westward)  along  the  ecliptic  at  the 
rate  of  about  50  s  of  arc,  denoted  50",  per  year,  which  is  equivalent  to  3.33  s  per 
year. 

As  the  Earth’s  rotational  axis  precesses,  declinations  also  change,  through  a 
range  of  47°,  or  twice  23.5°,  over  26,000  years. 


1.5  The  Locations  of  the  Stars  are  Slowly  Changing 
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Fig.  1.4  Precession  The  Earth’s  rotation  axis  traces  out  a  circle  on  the  sky  once  every 
26,000  years,  sweeping  out  a  cone  with  an  angular  radius  of  about  23.5°.  The  Greek  astronomer 
Hipparchus  (c.  146  BC)  discovered  this  precession  in  the  second  century  BC.  The  north  celestial 
pole,  which  marks  the  intersection  of  this  rotation  axis  with  the  northern  half  of  the  celestial 
sphere,  now  lies  near  the  bright  star  Polaris.  However,  as  the  result  of  precession,  the  rotational 
axis  will  point  toward  another  bright  north  star,  Vega,  in  roughly  13,000  years.  This  motion  of  the 
Earth’s  rotational  axis  also  causes  a  slow  change  in  the  celestial  coordinates  of  any  cosmic  object 


Because  the  long  period,  26,000  year,  conical  motion  of  the  Earth’s  rotation 
axis  is  caused  by  the  gravitational  action  of  the  Moon  and  Sun,  it  is  called  lunisolar 
precession.  Simon  Newcomb  (1835  —  1909)  derived  the  detailed  theory  for  com¬ 
puting  the  corrections  to  astronomical  coordinates  for  this  precession  (Newcomb 

1895). 

In  addition  to  this  steady,  progressive  motion,  there  are  small,  periodic  varia¬ 
tions  in  both  precessional  speed  and  axial  tilt  caused  by  the  gravitational  action  of 
the  planets  on  the  Earth’s  equatorial  bulge.  The  most  important  term  in  this 
nutation,  first  observed  by  James  Bradley  (1693—1762),  induces  an  18.6  year 
periodic  wobble  in  the  precessional  motion  with  a  size  of  17"  in  the  direction  of 
precession  and  9"  perpendicular  to  it  (Bradley  1748). 

Because  of  positional  changes  caused  by  precession  and  nutation,  the  equinox,  or 
reference  date,  must  be  given  when  specifying  the  right  ascension  or  declination  of 
any  cosmic  object.  The  standard  epoch  that  is  now  in  used  for  celestial  positions  is: 

J2000.0  =  2000  January  1.5  =  JD245 1545.0, 

where  JD  denotes  Julian  Date  and  the  prefix  J  denotes  the  current  system  of 
measuring  time  in  Julian  centuries  of  exactly  36,525  days  in  100  years,  with  each 
day  having  a  duration  of  86,400  s. 

The  combination  of  lunisolar  and  planetary  precession  is  called  general  pre¬ 
cession,  and  the  astronomical  constants  at  standard  epoch  J2000.0  include  (Kaplan 

2005): 

General  precession  in  longitude  =  p  =  5,028.796 195"  per  Julian  century. 
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The  nutation  term,  N,  for  that  epoch  is 

Constant  of  nutation  =  N  =  9.205233 1". 

Kaplan  (2005)  provides  modern  formulas  for  precession  and  nutation.  They 
describe  the  transformation  of  celestial  coordinates  from  one  date  to  another,  as  a 
function  of  time  since  a  reference  epoch. 

The  astronomical  constants  also  include  an  aberration  constant,  k ,  which 
accounts  for  the  observed  position  shift  of  an  astronomical  object  in  the  direction 
of  the  Earth’s  motion  (Focus  1.3).  The  aberration  constant  at  the  standard  epoch  J 
2000.0  is 


Constant  of  aberration  —  k  —  20.49552//. 


Focus  1.3  Stellar  aberration 

As  the  Earth  orbits  the  Sun,  the  stars  all  appear  to  be  shifted  in  the  direction 
of  motion,  a  phenomenon  called  stellar  aberration,  described  by  James 
Bradley  (1693—1767)  in  1728.  Because  the  speed  of  light  is  finite,  the 
apparent  direction  of  a  celestial  object  detected  by  a  moving  observer  is  not 
the  same  as  its  geometric  direction  at  the  time.  For  stars,  the  normal  practice 
is  to  ignore  the  correction  for  the  motion  of  the  celestial  object,  and  to 
compute  the  stellar  aberration  due  to  the  motion  of  the  observer.  This  also 
gave  Bradley  a  means  to  improve  on  the  accuracy  of  previous  estimates  for 
the  speed  of  light  (Bradley  1728). 

The  magnitude  A  6  of  stellar  aberration  depends  on  the  ratio  of  the 
velocity  of  the  observer,  V,  to  the  speed  of  light,  c,  and  the  angle,  6 ,  between 
the  direction  of  observation  and  the  direction  of  motion.  The  displacement, 
A  6 ,  in  the  sense  of  apparent  minus  mean  place,  is  given  by 


Ad 


—  sind  — 
c 


1 

2 


As  the  Earth  orbits  the  Sun,  it  is  moving  at  a  velocity  of  approximately 
30  km  s"1,  and  the  speed  of  light  c  «  300,000  km  s_1,  so  the  term  of  order 
V/c  is  10  rad  or  20  s  of  arc,  denoted  20",  and  the  term  (V/c)  has  a 
maximum  value  of  0.001  s  of  arc.  Bradley  (1728)  used  aberration  obser¬ 
vations  to  determine  the  speed  of  light  as  approximately  183,000  miles  per 
second  or  294,500  km  s_1. 


The  constant  of  aberration,  k ,  at  standard  epoch  J2000.0  is  given  by 


2tt^2  ,  ^ . j_  17  _ • 

Constant  of  aberration  =  k  = - ( 1  —  c  )  2  =  —  =  0.9365  x  10  rad, 

Pc  K  J  c 
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which  is  equivalent  to: 


k  =  20.49552" 

Here  the  constant  n  «  3.14592654,  the  mean  distance  between  the  Earth  and  the 
Sun  is  a  =  1  AU  =  1.49598  x  1011  m,  known  as  the  astronomical  unit, 

7 

e  =  0.01671  is  the  mean  eccentricity  of  the  Earth’s  orbit,  P  =  3.1558  x  10  sis 

o  _ 1 

the  length  of  the  sidereal  year,  c  =  2.997925  x  10°  m  s  is  the  speed  of  light, 
and  one  radian  =  2.062648  x  105  //. 

When  the  observer  is  moving  directly  at  the  star,  6  is  zero  and  there  is  no 
aberration  shift  at  all.  The  shift  achieves  its  greatest  value  of  about  20.5"  when  the 
observer’s  motion  is  perpendicular  to  the  direction  of  the  star,  with  6  =  90°. 


1.6  What  Time  is  It? 

There  are  two  ways  of  keeping  time  in  common  use  today.  One  is  atomic  time,  the 
basis  of  the  Systeme  International,  abbreviated  SI,  second,  and  the  other  is  based 
on  the  rotation  of  the  Earth  (Kaplan  2005;  Seidelmann  2005).  The  SI  second  is  the 
fundamental  unit  of  atomic  time,  which  is  specified  by  atomic  clocks  that  use 
cesium-beam  and  other  atomic  frequency  standards  to  an  accuracy  of  1.5  x  10-14, 
or  to  the  fourteenth  decimal  place  (Essen  1969).  The  frequency  standards  form  a 
standard  timescale  known  as  International  Atomic  Time,  abbreviated  TAI  for 
Temps  Atomique  International.  The  time  distributed  by  the  Global  Positioning 
System,  or  GPS,  remains  at  a  constant  offset  with  International  Atomic  Time.  On 
21  November  2010,  TAI-GPS  =  19  s. 

The  clocks  we  use  in  daily  life  are  set  to  the  Earth’s  rotation  with  respect  to  the 
Sun.  It  establishes  our  daily  rhythm,  from  sunrise  to  sunset  and  back  to  sunrise 
again.  The  24  h  solar  day  is  the  time  it  takes  for  the  Sun  to  make  one  circuit  around 
the  local  sky. 

By  definition: 


1  solar  day  =  24  h  =  1,440  min  =  86,400  s. 

This  is  known  as  the  unit  of  Sun  time,  or  solar  time.  Also  by  definition,  the  Julian 
century  has  36,525  solar  days  and  one  day  is  defined  as  86,400  s  of  International 
Atomic  Time,  or  TAI.  So: 

1  Julian  year  =  365.25  solar  days  =  8,766  h  =  525,960  min  =  31,557,600  s 

Solar  time  is  the  basis  of  Universal  Time,  abbreviated  UT,  which  has  been 
defined  to  be  as  uniform  as  possible  despite  variation  in  the  Earth’s  rotation.  The 
worldwide  system  of  Civil  Time,  and  the  clock  on  your  wall  or  the  watch  on  your 
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arm,  are  synchronized  to  Coordinated  Universal  Time,  denoted  UTC,  which  has 
been  corrected  for  small  variations  in  the  Earth’s  rotation,  but  the  atomic  clocks  in 
satellites  and  the  time  in  the  Global  Positioning  System,  abbreviated  GPS,  are  not 
corrected  in  this  way. 

Because  of  irregularities  in  the  Earth’s  rotation  and  the  lengthening  of  its 
rotation  period  due  to  Moon-induced  tidal  friction,  this  Sun  time  does  not  advance 
at  a  uniform  rate,  and  it  increasingly  lags  behind  the  Si-second  time  scale.  UTC  is 
therefore  a  hybrid  time  scale  using  the  SI  second  on  the  spinning  Earth  as  a 
fundamental  unit,  but  subject  to  occasional  1  s  adjustments.  The  difference 
between  atomic  time  and  universal  time,  or  TAL  —  UTC  is  an  integral  number  of 
seconds,  which  increases  by  1  whenever  a  leap  second  is  introduced  into  UTC,  so 
the  two  kinds  of  clock  share  the  same  seconds  tick.  When  necessary,  the  adjust¬ 
ments  to  UTC  are  introduced  at  the  end  of  June  or  December,  by  international 
agreement. 

Universal  Time  is  equivalent  to  the  standard  Civil  Time  for  0°  longitude,  which 
is  defined  to  be  the  Prime  Meridian  at  Greenwich,  England.  By  specifying  the 
longitude  of  any  other  location  of  the  terrestrial  globe,  we  can  exactly  infer  the 
local  Sun  time.  For  example,  the  Sun  will  be  overhead  at  noon  in  Boston  about  4  h 
44  min  later  than  noon  in  Greenwich;  because  the  longitude  of  Boston  is 
71.06°  =  4.733  h,  where  360°  equals  24  h.  And  in  the  same  way,  noon  in  New 
York  City  will  occur  about  10  min  later  than  in  Boston,  because  New  York  City  is 
slightly  west  of  Boston. 

The  world  has  been  divided  into  standard  time  zones  based  on  about  1  h,  or  15°, 
increments  in  longitude,  so  our  watches  differ  from  others  in  hourly  increments 
and  are  slightly  out  of  synchronism  with  the  Sun.  Standard  time  is  the  result  of 
synchronizing  clocks  in  different  geological  locations  within  a  time  zone.  Blaise 
(2000)  has  described  the  creation  of  standard  time. 

Terrestrial  Time  (TT)  is  the  modern  time  standard  used  for  time  measurements 
of  astronomical  observations  from  the  surface  of  the  Earth.  The  unit  of  TT  is  the  SI 
second.  The  Astronomical  Almanac  uses  TT  in  the  tables  of  positions,  or  Ephe- 
merides ,  of  the  Sun,  Moon  and  planets  as  seen  from  the  Earth.  TT  is  slightly  ahead 
of  atomic  time,  and  can  be  approximated  by 

TT  «  TAI  +  32.184  s, 

which  is  equivalent  to 

TT  «  GPS  +  51.184  s 

With  the  advent  of  atomic  clocks  and  the  exact  targeting  of  planetary  space¬ 
craft,  measurements  of  time  were  further  refined  with  the  introduction  of  the 
Barycentric  Dynamical  Time,  or  TDB  for  short.  It  was  adopted  to  take  into 
account  the  relativistic  time  dilation  (Sect.  2.3)  when  calculating  orbits  and 
astronomical  ephemeris,  and  it  applies  to  the  solar-system-barycentric  reference 
frame.  The  barycenter  is  the  center  of  mass  of  two  or  more  orbiting  bodies.  The 
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difference  between  TDB  and  yet  another  B  ary  centric  Coordinate  Time,  or  TCB,  is 
about  16.6  s,  and 

Mean  ratio  of  the  TCB  second  to  the  TDB  second  =  1  —  LB , 

o 

where  LB  =  1.550519767  x  10  ,  an  exceedingly  small  number. 

In  every  day  life  we  use  Sun  time,  based  on  the  solar  day  that  is  exactly  24  h 
long,  in  solar  time.  This  solar  day  is  the  interval  between  two  successive  passages 
of  the  Sun  across  an  observer’s  meridian.  Astronomers  also  use  star  time,  or 
sidereal  time.  A  sidereal  day  is  the  time  between  successive  passages  of  a  star 
across  the  local  meridian,  and  this  star  day  is  about  4  min  less  than  a  solar  day. 


1.7  Telling  Time  by  the  Stars 

Astronomers  use  another  sort  of  time,  called  sidereal  time,  to  know  when  and  how 
to  point  their  telescopes  to  view  a  particular  star  or  any  other  cosmic  object.  The 
term  sidereal  is  derived  from  the  Latin  sidus  meaning  “star.”  As  with  solar  time, 
this  star  time  is  based  on  the  Earth’s  rate  of  rotation,  but  measured  relative  to  the 
fixed  stars  rather  than  the  Sun. 

The  sidereal  day  is  the  time  it  takes  for  a  star  -  or  any  other  celestial  objects  -  to 
proceed  from  its  highest  point  in  the  sky  one  day  to  its  highest  point  the  next  day. 
The  Earth  makes  one  rotation  about  its  axis  in  a  sidereal  day,  but  during  that  time  it 
moves  a  short  distance  along  its  orbit  around  the  Sun.  At  the  end  of  a  sidereal  day, 
the  Earth  therefore  needs  to  rotate  a  little  more  before  the  Sun  reaches  its  highest 
point  (Fig.  1.5).  A  solar  day  is  therefore  about  4  min  longer  than  the  sidereal  day. 
To  be  exact: 

1  sidereal  day  =  23  h  56m  4.09  s  =  23.93447  h  =  86,164.1  s, 

where  the  hours,  minutes  and  seconds  are  in  solar  time. 

Observatories  have  two  kinds  of  clocks  that  tell  either  the  local  solar  time  or  the 
local  star  time.  The  two  kinds  of  time  can  also  be  determined  using  clocks  or  time 
simulators  on  the  web. 

At  any  moment,  the  Local  Sidereal  Time  equals  the  right  ascension,  designated  a, 
of  a  celestial  object  on  the  local  meridian,  so  this  time  tells  an  observer  when  a  given 
celestial  object  with  a  particular  right  ascension  can  be  seen,  provided  its  declination 
is  in  the  observable  range  for  the  observer’s  latitude. 

Example:  When  is  a  celestial  object  visible  in  your  part  of  the  sky? 

For  an  object  with  an  observable  declination,  the  time  at  which  it  can  be 
observed  depends  upon  the  object’s  right  ascension,  a,  and  the  local  sidereal 
time,  abbreviated  LST.  The  object  crosses  the  local  meridian  when 
a  =  LST,  and  may  be  visible  for  several  hours  before  and  after  this  time. 
However,  a  terrestrial  clock  or  watch  is  geared  to  the  Sun  rather  than  the 
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stars,  so  it  does  not  keep  local  sidereal  time.  A  terrestrial  clock  is  equal  to  the 
local  sidereal  time  only  at  midnight  of  the  Autumnal  Equinox,  about  Sep¬ 
tember  23,  and  thereafter  the  local  sidereal  time  gains  2  h  on  the  terrestrial 
clock  for  each  succeeding  month. 


The  geographic  longitude,  denoted  2,  of  the  observer  relates  the  Local  Sidereal 
Time  (LST)  to  the  Greenwich  Sidereal  Time,  or  GST  for  short,  at  the  Prime 
Meridian  by 


LST  =  GST— ^(observer), 

where  the  longitude  is  measured  positive  westward  and  can  be  converted  into  time 
using  24  h  =  360°.  The  local  meridian  is  an  imaginary  half  circle  stretching  from 
the  horizon  due  north,  through  the  zenith,  to  the  horizon  due  south.  The  zenith  is 
directly  overhead,  an  extension  of  a  plumb  line  from  the  center  of  the  Earth 


Fig.  1.5  Sun  time  and  star  time  The  Sun  reaches  its  highest  point  in  the  daytime  sky,  its 
culmination,  at  noon,  and  this  happens  every  24  h  in  a  solar  day.  A  distant  star  returns  to  its 
highest  point  in  the  night  sky  every  sidereal  day  of  23  h  56  min  04  s,  which  is  the  unit  of  star 
time.  The  Earth  rotates  once  around  its  axis  in  one  sidereal  day,  but  during  that  time  the  Earth  has 
moved  along  its  orbit  around  the  Sun  (bottom).  The  star  ( small  circle,  top)  and  the  Sun  ( below 
star)  are  at  culmination,  crossing  the  local  meridian,  at  just  one  time  (left).  After  a  sidereal  day, 
the  Earth  has  to  rotate  for  another  3  min  56  s  before  the  Sun  reaches  its  highest  point  (right). 
A  solar  day  is  therefore  nearly  4  min  longer  than  a  sidereal  day 
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through  the  observer  to  the  celestial  sphere.  The  Prime  Meridian  is  the  local 
meridian  for  the  old  Royal  Observatory  in  Greenwich,  England. 

We  define  the  Local  Hour  Angle  (LHA)  of  any  object  on  the  celestial  sphere  to 
be  the  time  since  it  last  crossed  the  meridian,  or 

LHA  (object)  =  LST  —  a  (object)  =  GST  —  ^(observer)  —  a(object). 

The  Local  Sidereal  Time  is  equal  to  zero  when  the  Vernal,  or  Spring,  Equinox 
is  on  the  local  meridian. 

The  solar  and  star  times  are  related  by: 

One  mean  solar  day  =  24  h  03  min  56.555  s  of  mean  sidereal  time 
One  mean  solar  day  =  1.0027379  mean  sidereal  days, 

One  mean  sidereal  day  =  23  h  56  min  04.09  s  of  mean  solar  time 
One  mean  sidereal  day  =  0.99726957  mean  solar  days. 


1.8  Optical  Telescopes  Observe  Visible  Light 

Telescopes  collect  and  magnify  electromagnetic  radiation  from  a  cosmic  object, 
and  bigger  telescopes  provide  two  advantages.  They  gather  more  radiation  than  a 
smaller  telescope,  permitting  the  detection  of  fainter  objects  and  providing  a 
brighter  image  of  any  cosmic  object  for  analysis.  Big  telescopes  also  provide 
greater  angular  resolution,  which  is  the  ability  to  see  the  separation  between 
objects  that  are  close  together.  Better  resolution  permits  observation  of  finer  detail 
on  the  object  emitting  the  radiation.  Kitchin  (2013)  provides  a  thorough  discussion 
of  telescopes  and  the  techniques  of  using  them  to  observe  the  cosmos. 

Regardless  of  what  cosmic  object  a  telescope  is  pointed  at,  the  object’s  radi¬ 
ation  that  carries  information  to  the  Earth  travels  in  rays  that  are  parallel  to  one 
another.  A  telescope’s  lenses  and/or  mirrors  are  used  to  focus  and  collect  visible 
radiation,  or  light.  They  are  described  by  the  science  of  optics;  therefore  the  study 
of  visible  light  from  cosmic  objects  is  called  optical  astronomy.  There  are  two 
types  of  optical  telescopes,  the  refractor  and  the  reflector,  which  respectively  use  a 
lens  and  a  mirror  to  gather  and  focus  optically  visible  light  (Lig.  1.6).  A  tele¬ 
scope’s  lens  bends  the  incoming  rays  by  refraction,  focusing  them  to  a  point  where 
they  meet,  called  the  focal  point.  A  curved  mirror  reflects  the  incoming  rays, 
sending  them  to  the  focal  point. 

In  a  refractor,  light  is  bent  by  refraction  at  the  curved  surface  of  a  lens,  called  an 
objective,  toward  a  focal  point  where  the  different  rays  of  light  meet.  If  we  place  a 
detector  at  the  focal  point,  in  the  plane  parallel  to  the  lens,  we  can  record  an  image 
of  whatever  the  telescope  is  observing.  The  distance  from  the  lens  to  the  focal 
point  is  called  the  focal  length ,  which  determines  the  overall  size  of  the  image. 
The  critical  thing  is  the  diameter,  or  aperture,  of  the  light-gathering  lens. 


20 


1  Observing  the  Universe 


Sun 


Sun 


Refractor  Reflector 


Fig.  1.6  Telescopes  Light  waves  that  fall  on  the  Earth  from  a  distant  object  are  parallel  to  one 
another,  and  are  focused  to  a  point  by  the  lens  or  mirror  of  a  telescope.  Early  telescopes  were 
refractors  (left).  The  curved  surfaces  of  the  convex  objective  lens  bend  the  incoming  parallel  light 
rays  by  refraction  and  bring  them  to  a  focus  at  the  center  of  the  focal  plane,  where  the  light  rays 
meet  and  an  image  is  created.  A  second  smaller  lens,  called  the  eyepiece,  was  used  to  magnify  the 
image  in  the  early  refractors;  later  versions  placed  photographic  or  electronic  detectors  at  the 
focal  plane.  In  1670,  the  English  physicist  Isaac  Newton  (1643—1727)  constructed  the  first 
reflecting  telescope  (right),  which  used  a  large  concave,  or  parabolic,  primary  mirror  to  collect 
and  focus  light.  A  small  flat  secondary  mirror,  inclined  at  an  angle  of  45°  to  the  telescope  axis, 
reflected  the  light  sideways,  at  a  place  now  known  as  the  Newtonian  focus.  Other  light-deflecting 
mirror  arrangements  can  be  used  to  obtain  any  desired  focal  length,  which  varies  with  the 
curvature  and  position  of  small  convex  mirrors 
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The  larger  the  aperture,  the  more  light  is  gathered  and  the  finer  the  detail  that  can 
be  resolved. 

The  other  type  of  telescope,  the  reflector,  uses  a  concave  mirror  with  a  parabolic 
shape  to  gather  and  focus  the  light.  The  prime  focus  is  back  in  the  path  of  the 
incoming  light;  so  secondary  mirrors  are  sometimes  used  to  reflect  the  light  to 
another  place  of  observation.  There  are  three  types  of  secondary  mirrors  called  the 
Cassegrain,  Coude,  and  Newtonian  mirrors,  which  can  focus  light  to  different 
locations. 

Professional  astronomers  place  electronic  detectors  at  the  focal  point  of  tele¬ 
scopes.  These  detectors  generate  digitized  signals  that  are  analyzed,  manipulated 
and  recorded  in  a  computer.  A  Charge-Coupled  Device,  or  CCD  for  short,  might 
be  used  to  efficiently  detect  the  radiation  and  form  an  image.  Nowadays,  CCDs  are 
used  in  this  way  in  everything  from  digital  cameras  to  the  Hubble  Space  Tele¬ 
scope.  In  some  cases,  diffraction  gratings  are  used  to  separate  the  incoming 
radiation  into  its  component  wavelengths,  dispersing  it  into  fine  wavelength 
intervals  to  form  a  spectrum  that  can  then  be  recorded  by  a  CCD. 

The  diameter  of  this  primary  mirror  determines  the  telescope’s  light-gathering 
ability  and  resolution.  The  amount  of  radiation  that  can  be  collected  is  proportional 
to  the  area  of  the  mirror,  and  the  square  of  its  diameter. 

Optical  telescopes  are  even  named  by  the  diameter  of  their  mirror.  The  2.5  m 
(100  in.)  Hooker  Telescope  at  the  Mount  Wilson  Observatory,  California  and  the 
nearby  5  m  (200  in.)  Hale  Telescope  at  the  Palomar  Observatory,  California  are 
of  great  historical  importance.  Recent  large  optical  telescopes  include  the  four 
Very  Large  Telescopes,  each  of  8.2  m  (323  in.)  effective  aperture,  located  at  the 
Paranal  Observatory,  Chile,  the  two  10  m  (400  in.)  Keck  telescopes  at  the 
Mauna  Kea  Observatory  in  Hawaii,  the  10.4  m  (410  in.)  Gran  Telescopio 
Canarias  on  the  Canary  Islands  and  the  Large  Binocular  Telescope  located  on 
Mount  Graham  in  Arizona.  It  consists  of  two  8.4  m  (330  in.)  mirrors  on  a 
binocular  mount. 

The  ability  to  resolve  details  is  called  the  resolving  power  of  a  telescope.  It  is 
specified  by  the  angular  resolution,  0res,  a  quantity  that  depends  on  the  diameter, 
Dt ,  of  the  telescope  lens  or  mirror  and  the  wavelength,  2,  of  observation.  The 
mathematical  expression  is: 


rad, 


or 


c  2 

0res  =  2.06265  x  105  —  s  of  arc, 

Dj 

where  1  rad  is  equivalent  to  57.2957795°  and  206265//  (2.06265  x  105  s  of  arc), 
and  there  are  3,600  s  of  arc  in  a  degree.  This  equation  tells  us  that  a  bigger  lens  or 
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mirror  provides  finer  angular  resolution  at  a  given  wavelength,  and  that  longer 
wavelengths  require  larger  telescopes  to  achieve  the  same  resolution  as  a  smaller 
telescope  at  shorter  wavelength. 

Example:  What  is  the  smallest  source  detected  on  the  Sun  in  optically 
visible  light? 

Due  to  atmospheric  obscuration,  the  effective  angular  resolution  of  a  typical 

.  .  _  S' 

ground-based  optical  telescope  is  about  1  =  4.848  x  10  rad.  That  is  pretty 
good,  for  its  comparable  to  seeing  the  details  on  a  coin  with  a  diameter  of 
0.5  cm  from  1,000  m  away.  At  the  Sun’s  mean  distance  of  1.496  x  1011  m, 
this  corresponds  to  structures  that  are  748  km  across,  about  the  distance  from 
Boston  to  Washington,  D.C.  and  about  three-quarters  the  size  of  France. 


The  resolving  power  of  a  telescope  operating  at  the  wavelengths  that  we  can  detect 

’—j 

with  our  eye,  at  a  yellow  wavelength  of  about  6  x  10  m,  is  about  0. 1 24 /Dr"  ifDT 
is  in  meters.  By  way  of  comparison,  the  typical  angular  resolution  of  the  unaided 
human  eye  is  about  60",  so  the  eye  acts  like  a  lens  with  a  diameter  of  about  0.002  m, 
or  only  2  cm.  However,  some  people  have  sharper  vision  than  the  average. 

Turbulence  in  the  atmosphere  limits  the  resolution  of  any  telescope  operating  at 
visible  wavelengths  to  about  1  s  of  arc;  therefore,  the  angular  resolution  cannot  be 
improved  by  building  an  optical  telescope  larger  than  about  0.12  m  in  diameter. 
Similar  atmospheric  variations  cause  the  stars  to  “twinkle”  at  night.  This  atmo¬ 
spheric  limitation  to  angular  resolution  at  visible  wavelengths  is  called  seeing.  The 
best  seeing,  of  0.2"  in  unusual  conditions,  is  found  at  only  a  few  sites  in  the  world, 
and  optical  observatories  are  located  in  most  of  them.  Better  visible  images  with 
even  finer  detail  can  be  obtained  from  the  unique  vantage  point  of  outer  space, 
using  satellite-borne  telescopes  unencumbered  by  the  limits  of  the  atmosphere. 

Optical  astronomy  began  about  four  centuries  ago,  in  1609,  when  Galileo 
Galilei  (1564—1642)  turned  the  newly  invented  spyglass,  or  telescope,  toward  the 
night  sky,  and  discovered  four  previously  unknown  moons  that  circle  Jupiter.  He 
also  resolved  small  craters  on  the  Moon,  and  detected  numerous  stars  in  the  Milky 
Way  that  cannot  be  seen  by  the  unaided  eye  (Galilei  1610).  His  rudimentary 
telescope  was  a  refractor  with  a  lens  whose  diameter  was  only  0.04  m,  or 

1.6  inches  and  a  little  smaller  than  your  hand.  The  angular  resolution  of  his 

r! 

telescope  at  a  visual  wavelength  of  6  x  10“'  m  was  3.1",  and  an  angular  reso¬ 
lution  more  than  about  10  times  better  than  this  cannot  be  achieved  with  any 
optical,  or  visible  light,  telescope  on  the  planet. 

A  bigger  lens  or  mirror  also  collects  more  light  than  a  smaller  one,  permitting 
the  detection  of  fainter  sources.  The  human  eye,  for  example,  is  severely  limited  by 
its  inability  to  gather  light.  The  eye,  or  rather  the  brain  fed  by  the  eye,  can  store  the 
images  for  no  more  than  a  few  tenths  of  a  second.  That’s  fortunate  because  if  it 
stored  an  image  for  much  longer,  we  couldn’t  watch  movies.  Light  collected  using 
telescopes  can  be  stored  for  hours  or  more.  This  was  first  done  using  photographic 
plates  and  more  recently  by  using  electronic  chips. 
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1.9  Telescopes  that  Detect  Invisible  Radiation 

Telescopes  of  different  designs  are  used  to  detect  cosmic  radiation  outside  the 
optically  visible  wavelengths.  They  are  used  to  observe  otherwise  invisible  x-ray, 
ultraviolet,  infrared  and  radio  wavelengths. 

Relatively  long  radio  waves  are  detected  by  radio  telescopes,  also  known  as 
radio  antennas,  whose  shapes  of  are  similar  to  the  reflecting  mirror  of  an  optical 
telescope.  The  main  reflector,  called  a  dish,  is  a  parabolic  metal  surface  that 
gathers  the  incoming  radio  waves,  reflecting  and  focusing  them  to  an  electronic 
receiver  at  the  reflector  focus.  This  receiving  system  converts  the  intensity  of  the 
incoming  radio  signal  to  numbers  that  are  then  transmitted  to  a  computer.  The  data 
is  stored  in  the  computer  as  a  matrix  of  numbers  and  is  then  manipulated  to  form 
images. 

Our  angular  resolution  equation  also  applies  at  radio  wavelengths  where  very 
big  telescopes  are  required  to  achieve  significant  angular  resolution.  At  a  radio 
wavelength  of  0.1  m,  an  angular  resolution  of  l"  requires  a  telescope  with  a 
diameter  of  20  km.  The  advantage  of  radio  signals  is  that  the  atmosphere  does  not 
distort  them,  or  limit  the  angular  resolution.  We  can  observe  the  radio  universe  on 
a  cloudy  day,  and  during  the  day  or  night,  just  as  radio  signals  are  used  to  com¬ 
municate  with  satellites  at  any  time,  even  when  it  rains  or  snows  outside. 

Since  radio  waves  are  millions  of  times  longer  than  those  of  light,  a  radio  telescope 
needs  to  be  at  least  a  million  times  bigger  than  an  optical  telescope  to  obtain  the  same 
resolving  power.  For  this  reason,  the  first  radio  telescopes  provided  a  very  myopic, 
out-of-focus  view.  But  this  limitation  was  soon  overcome  when  radio  astronomers 
built  successively  larger  telescopes,  culminating  in  the  100  m,  fully  steerable  par¬ 
abolic  dish  at  Effelsberg,  West  Germany.  Its  best  angular  resolution  is  about  10". 
This  may  be  the  largest  steerable  radio  dish  that  can  be  built,  but  a  novel  way  of 
building  an  even  larger  dish  was  to  cover  the  floor  of  a  valley  with  metal  screen, 
producing  a  305  m  dish  in  Arecibo,  Puerto  Rico.  This  antenna  relies  on  the  rotation  of 
the  Earth  to  bring  different  regions  of  the  sky  into  view. 

Nowadays,  relatively  small  radio  telescope  separated  by  large  distances  (called 
baselines)  are  combined  and  coordinated  electronically.  This  results  in  radio 
images  that  are  as  sharp  as  optical  ones  (Fig.  1.7).  Because  it  is  spread  out,  an 
array  of  small  telescopes  has  the  property  that  is  crucial  for  high  resolving  power  - 
namely,  great  size  relative  to  wavelength.  The  technique  is  known  as  interfer¬ 
ometry  because  it  analyzes  how  the  waves  detected  at  the  telescopes  interfere 
when  they  are  added  together,  so  the  interferometer  is  an  interference  meter. 
A  simple  example  combines  the  signals  from  a  pair  of  telescopes  with  a  computer 
to  reconstruct  the  waves  and  create  the  image. 

The  sensitivity  of  an  interferometric  array  is  determined  by  the  combined  areas  of 
the  individual  elements,  and  not  by  their  separations.  For  example,  the  two-element 
interferometer  can  resolve  details  that  are  much  finer  than  its  component  radio 
telescopes  can  by  themselves.  But  the  interferometer’s  collecting  area  is  only  twice 
that  of  the  two  individual  components,  so  its  sensitivity  is  just  twice  as  great.  Many 
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Fig.  1.7  Interferometer  When  incoming  radiation  approaches  the  Earth  at  an  angle,  the  crests 
will  arrive  at  two  separated  telescopes  at  slightly  different  times.  This  delay  in  arrival  time  is  the 
distance  X  divided  by  the  speed  of  light.  If  X  is  an  exact  multiple  of  the  wavelength,  then  the 
waves  detected  at  the  two  telescopes  will  be  in  phase  and  add  up  when  combined.  If  not,  they  will 
be  out  of  phase  and  interfere.  The  angular  resolution  of  such  an  interferometer ,  or  interference 
meter,  is  equal  to  the  wavelength  divided  by  the  effective  baseline.  When  the  object  being 
observed  is  directly  overhead,  the  effective  baseline  is  equal  to  the  distance  between  the  two 
telescopes 


two-element  telescope  pairs  are  therefore  combined  in  a  full-fledged  radio  array  to 
gather  more  radio  radiation.  The  Very  Large  Array,  abbreviated  VLA,  is  an  example 
of  a  modern  interferometric  radio  array  that  is  used  to  observe  the  cosmos.  Quir- 
renbach  (2001)  provided  a  review  of  optical  interferometry  at  visible  wavelengths. 

Even  better  angular  resolution  is  obtained  using  radio  signals  recorded  at 
widely  separated  radio  telescopes  when  they  are  observing  the  same  cosmic  object 
at  the  same  time.  The  recorded  signals  can  be  combined  to  effectively  turn  the 
Earth  into  a  giant  radio  telescope  with  transcontinental  interferometer  baselines 
and  the  sharpest  vision  of  any  telescope  on  the  Earth  or  in  space. 

Example:  Very  long  baseline  interferometry 

At  a  radio  wavelength  of  2  =  0.1  m,  the  diameter,  DT ,  of  a  telescope  needed  to 
obtain  a  resolution  of  dres  =  l"  is  DT  =  2.063  x  105  2/0res  ~  20,000  m 
=  20  km.  But  an  interferometer  with  radio  telescopes  placed  on  opposite  sides 
of  the  Earth  will  have  a  baseline  of  up  to  twice  the  planet’s  radius,  or  about 
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10  m.  That  would  give  it  an  angular  resolution  of  about  0.001 6"  at  a  wave¬ 
length  of  0.1  m. 


Radio  telescopes  do  not  provide  the  only  window  on  the  cosmos.  There  are  also 
invisible  gamma  rays,  x-rays,  ultraviolet  and  infrared  telescopes.  Radiation  com¬ 
ing  from  celestial  objects  at  these  wavelengths  is  absorbed  in  our  atmosphere  and 
must  be  collected  by  telescopes  in  satellites  that  orbit  the  Earth  above  its 
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atmosphere.  All  of  these  space  telescopes  measure  the  intensity  of  the  incoming 
signal  and  convert  these  measurements  into  radio  transmissions  that  are  sent  to 
radio  telescopes  and  receivers  on  the  ground.  NASA  has,  for  example,  launched 
four  large  Space  Telescopes ,  the  Great  Observatories  named  Hubble ,  Compton , 
Chandra  and  Spitzer ,  which  respectively  operate  at  visible,  gamma  ray,  x-ray  and 
infrared  wavelengths  (Focus  1.4).  The  European  Space  Agency,  abbreviated  ESA, 
has  built  and  launched  the  large  infrared  Herschel  telescope  that  is  used  at  far 
infrared  and  sub-millimeter  wavelengths.  The  American,  European  and  Japanese 
space  agencies  have  sent  a  host  of  satellites  into  space  with  telescopes  designed  to 
observe  specific  cosmic  phenomena,  such  as  activity  on  the  Sun  that  is  observed 
from  telescopes  on  SOHO ,  Hinode ,  STEREO ,  and  SDO.  The  distances  and  motions 
of  stars  have  been  observed  from  HIPPARCOS ,  and  the  cosmic  microwave 
background  radiation  delineated  from  COBE ,  WMAP  and  PLANCK. 

Focus  1.4  The  great  observatories 

NASA  has  sent  four  large  telescopes  into  orbit  around  the  Earth,  designed  to 
study  the  universe  in  both  optically  visible  light  and  non-visible  forms  of 
radiation.  The  first  in  the  series  was  the  Hubble  Space  Telescope ,  or  HST  for 
short,  launched  on  April  24,  1990  and  operating  mainly  in  visible  light,  the 
second  was  the  Compton  Gamma  Ray  Observatory  ( CGRO ),  launched  into 
Earth  orbit  on  April  5,  1991,  the  third  was  the  Chandra  X-ray  Observatory , 
or  CXO ,  launched  on  July  23,  1999,  and  the  fourth  was  the  infrared  Spitzer 
Space  Telescope ,  or  SST,  launched  on  August  23,  2003. 

The  HST  is  the  largest  visible-light,  astronomical  telescope  ever  put  into 
space,  with  a  2.4  m  (94.5  in.)  primary  mirror.  The  angular  resolution  of 
optically  visible  light  telescopes  in  space  is  not  limited  by  the  Earth’s 
atmosphere,  so  the  HST  has  an  angular  resolution  as  good  as  0.05".  It  has 
observed  newly  formed  galaxies  when  the  universe  was  less  than  half  its 
present  age,  contributing  to  our  understanding  of  the  age  and  evolution  of  the 
Cosmos,  watched  super-massive  black  holes  consuming  the  material  around 
them,  and  helped  astronomers  determine  how  a  mysterious  “dark  energy” 
has  taken  over  the  expansion  of  the  universe.  The  telescope  is  named  for  the 
American  astronomer  Edwin  Hubble  (1889—1953)  who  demonstrated  that 
spiral  nebulae  are  galaxies  like  our  own  Milky  Way  and  found  that  galaxies 
move  away  from  us  at  speeds  that  increase  with  their  distance. 

Chandra  investigates  the  high-energy,  x-ray  regions  of  the  universe  from 
objects  such  as  active  galactic  nuclei,  black  holes,  clusters  of  galaxies,  dark 
matter,  galaxies,  neutron  stars,  pulsars,  quasars,  supernova  remnants,  su¬ 
pernovae,  and  white  dwarfs.  It  is  named  after  the  Indian- American  astro¬ 
physicist  and  Nobel  laureate,  Subrahmanyan  Chandrasekhar  (1910—1995). 

Instruments  aboard  Compton  detected  thousands  of  energetic,  brief 
gamma-ray  bursts,  as  well  as  gamma  rays  from  black  holes,  pulsars,  quasars, 
and  supernovae.  The  gamma  ray  observatory  was  named  after  the  American 
physicist  Arthur  H.  Compton  (1892—1962). 
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The  Spitzer  telescope  obtains  images  and  spectra  of  the  infrared  energy, 
or  heat,  radiated  by  cosmic  objects  at  wavelengths  between  3  and  180 
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microns,  where  1  micron  is  10  m  or  one-million  of  a  meter.  The  telescope 
has  a  0.85  m  (33.5  in.)  primary  mirror.  It  provides  information  on  the  for¬ 
mation,  composition  and  evolution  of  planets,  stars  and  galaxies,  and  is 
named  after  the  American  astrophysicist  Lyman  Spitzer,  Jr.  (1914—1997). 
Most  satellite  telescopes,  including  Hubble ,  circle  our  planet  outside  the 
Earth’s  atmosphere  while  remaining  nearby  to  send  observations  down  to  the 
ground  by  radio  signals,  but  Spitzer  revolves  around  the  Sun,  trailing  behind 
the  Earth  in  its  orbit,  to  avoid  heat  from  the  Earth  and  the  Moon.  A  supply  of 
liquid  helium  initially  cooled  the  instrument  to  almost  absolute  zero  so  the 
telescope’s  heat  radiation  would  not  interfere  with  its  detectors. 

NASA’s  next  Great  Observatory,  the  James  Webb  Space  Telescope ,  or 
JWST  for  short,  is  working  on  a  launch  date  of  2018.  It  will  have  a  large 
mirror,  6.5  m  (21.5  feet)  in  diameter,  and  will  operate  at  infrared  wave¬ 
lengths  that  will  permit  the  observations  of  distant  galaxies  that  formed  in 
the  early  universe,  as  well  as  nearby  planet-forming  regions. 

In  the  meantime,  on  May  14,  2009,  the  European  Space  Agency,  abbre¬ 
viated  ESA,  launched  the  Herschel  infrared  telescope;  its  primary  mirror  is 
3.5  m  (138  in.)  in  diameter.  It  is  named  for  the  English  astronomer  Sir 
William  Herschel  (1738—1822),  the  discoverer  of  the  infrared  spectrum  and 
the  planet  Uranus,  and  his  sister  and  collaborator  Caroline  (1750—1848). 
Herschel  is  used  to  see  deep  into  star-forming  regions,  galactic  centers  and 
planetary  systems. 


Astronomy  from  space  has  several  advantages  over  ground-based  observations. 
The  weather  in  space  is  always  perfectly  clear  and  the  atmosphere  does  not  blur 
images  obtained  from  telescopes  in  space.  Furthermore,  a  large  telescope  is  not 
needed  to  observe  the  short  ultraviolet  and  x-ray  wavelengths  with  high  angular 
resolution.  The  aperture  must  be  only  0.002  m  across  to  achieve  an  angular  res- 
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olution  of  1  at  an  extreme  ultraviolet  wavelength  of  10  m  and  only  0.00002  m 
for  the  same  resolution  at  soft  x-rays  of  10  keV  in  energy  and  a  wavelength  of 
10“ 10  m.  Moreover,  in  space  the  sky  is  truly  dark,  and  observations  do  not  need  to 
be  limited  to  the  night. 

Example:  The  size  of  optical,  x-ray  and  radio  telescopes,  and  resolving 
features  on  the  Sun  or  Moon 

The  angular  resolution,  6 ,  of  a  telescope  with  a  diameter  DT  operating  at  a 
wavelength  2  is  given  by  6  =  X/DT  rad,  where  1  rad  =  2.06265  x  105  "  and 
"  denotes  second  of  arc  with  1"  =  4.848  x  1 0  6  rad.  The  angular  resolution 
of  most  ground-based,  visible  light,  or  optical,  telescopes  is  limited  by  tur¬ 
bulence  in  the  Earth’s  atmosphere  to  about  1",  or  under  excellent  seeing 
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conditions  at  remote  mountain  tops  to  perhaps  0.25  =  1.212  x  10  rad, 
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which  is  also  the  angular  resolution  of  the  Solar  Optical  Telescope  on  the 
Hinode  mission.  The  size  of  a  telescope  required  to  give  this  angular  resolution 
at  a  yellow  wavelength  of  X  =  580  nm  is  DT  =  X/6  «  0.5  m.  The  Hinode 
mission  also  has  an  x-ray  instrument,  operating  at  a  wavelength  of 
X  =  1.24  x  10-10  m,  corresponding  to  10  keV  in  photon  energy  (see  Sect.  2.6 
for  definition  of  photon  energy).  The  size  of  the  x-ray  telescope  for  the  same 
angular  resolution  is  DT  =  X/6  «  0.0001  m.  A  ground-based  radio  telescope 
is  not  limited  in  angular  resolution  by  the  atmosphere,  but  to  achieve  an  angular 
resolution  of  0.25"  at  a  radio  wavelength  of  X  =  1  m,  the  telescope  diameter 
would  have  to  be  DT  =  X/6  «  106  m  or  a  million  meters,  comparable  in  size 
to  a  large  country  and  about  one  sixth  the  radius  of  the  Earth. 

The  smallest  linear  size,  L,  that  can  be  resolved  on  the  Sun  with  any  tele¬ 
scope  with  this  angular  resolution  is  L  =  D  x  6,  where  the  mean  distance 
between  the  Earth  and  the  Sun  is  D  =  1  AU  =  1.496  x  1011  m,  and  therefore 
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L  «  181  km  for  an  angular  resolution  of  0.25  =  1.212  x  10  rad.  By 
way  of  comparison,  the  angular  resolution  of  the  human  eye  is  about  60"  so  the 
smallest  sunspot  that  the  eye  can  resolve  has  a  size  of  L  =  AU  x 
60"  x  4.848  x  10-6  «  4.35  x  107  m  «  6.8  RE  where  the  radius  of  the 
Earth  RE  =  6,378  km.  The  tallest  mountain  that  can  be  observed  at  the  limb, 
or  edge,  of  the  Moon  with  the  unaided  human  eye  will  have  a  height 
H  =  D  x  6  where  the  mean  distance  of  the  Moon  is  3.844  x  108  m  and  for 
0  =  60"  we  have  a  mountain  height  of  H  =  1.12  x  105  m,  much  taller  than 

o 

Mount  Everest,  whose  elevation  is  8.848  x  10  m.  In  other  words,  we  could 
not  see  mountains  on  the  Moon  until  telescopes  were  used. 


1.10  Units  Used  by  Astronomers  and  Astrophysicists 

By  any  terrestrial  standard,  the  scale  of  astronomical  objects  is  enormous  in  mass, 
luminosity,  distance,  size  and  age.  Astronomers  and  astrophysicists  use  the  Sun’s 
values  of  these  quantities  as  benchmark  units  that  reflect  their  large  amount.  Any 
solar  value  is  denoted  by  a  subscript  symbol  ©,  a  circle  with  a  dot  in  the  center. 

The  mass  of  the  Sun,  denoted  by  the  symbol  M0,  is  often  used  as  the  unit  of 
celestial  mass.  Its  value  is 


Me  =  1.989  x  1030  kg, 

where  kg  denotes  a  kilogram,  or  1,000  g.  The  mass  of  most  stars  lies  in  a  narrow 
range  of  between  0.1  M0  and  100  M0,  just  as  the  mass  of  newborn  children  varies 
by  a  relatively  small  amount.  Galaxies  typically  contain  about  100  billion,  or  100 
thousand  million,  stars,  so  the  stellar  mass  of  a  galaxy  is  about  1011  M0,  but  a 
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galaxy  often  contains  even  more  mass  in  invisible  dark  matter  that  lies  beyond  the 
visible  stars. 

The  rate  at  which  radiation  carries  energy  away  from  a  cosmic  object  is  known 
as  its  luminosity,  designated  by  L.  Luminosity  has  the  units  of  energy  per  unit 
time,  which  is  also  the  unit  of  power.  The  SI  unit  of  luminosity  is  a  joule  per 
second  (J  s-1),  where  joule  is  the  unit  of  energy,  and  one  watt  of  power  is  equal  to 
one  joule  per  second,  or  1  J  s  1  =  1  W. 

The  unit  of  luminosity  used  by  astronomers  is  often  the  Sun’s  luminosity, 
denoted  L0.  Its  value  is: 


Lq  =  3.828  x  1026  Js-1, 

where  J  s-1  denotes  joule  per  second.  Stars  vary  by  many  orders  of  magnitude  in 
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their  luminosity,  from  0.001  L0  to  a  million  L0  or  from  10  L0  to  10  L0.  The 
luminosity  of  a  galaxy  is  roughly  1011  L0. 

In  astronomy  and  astrophysics,  temperatures  are  measured  on  the  kelvin  scale, 
named  after  Lord  Kelvin  (1824—1907)  who  proposed  it  (Kelvin  1848).  This 
temperature  unit  is  written  kelvin,  with  a  lower  case  k,  and  assigned  the  unit 
symbol  capital  K.  The  freezing  temperature  of  water  is  273.15  K  and  the  boiling 
temperature  of  water  at  sea  level  on  Earth  is  373.15  K.  The  kelvin  scale  is  an 
absolute,  thermodynamic  temperature  scale  where  absolute  zero  is  the  temperature 
at  which  all  thermal  motion  ceases.  Nothing  can  move  at  a  temperature  of  0  K.  For 
conversion  to  the  degrees  Centigrade,  denoted  by  C,  and  degrees  Fahrenheit, 
abbreviated  by  F,  we  have  K  =  C  +  273.15  =  (5F/9)  +  255.22,  with  C  =  K  — 
273.15  and  F  =  (9  K/5)  -  459.67  =  (9C/5)  +  32. 

Astronomers  use  the  astronomical  unit  (AU)  as  the  unit  of  distance  within  the 
solar  system.  It  is  the  mean  distance  between  the  Earth  and  the  Sun,  with  a  value  of 

1  AU  =  149597870691  m  «  1.496  x  10nm. 

To  be  exact,  astronomers  now  use  the  speed  of  light,  denoted  by  the  lower  case 
letter  c,  as  a  defining  constant  for  distance,  with 

Speed  of  light  =  c  =  299792458  m  s-1  «  2.9979  x  108m  s-1, 
with  the  derived  value  of  the  light  travel  time,  ta,  for  1  AU 

ta  =  499.0047863852  s  «  499  s. 

This  is  the  time  it  takes  for  radiation  to  travel  from  the  Sun  to  the  Earth. 

The  unit  of  stellar  size  is  the  radius  of  the  Sun,  denoted  Rq,  given  by 

Rq  =  6.955  x  108  m. 

The  supergiant  stars  can  be  as  large  as  1,000  Rq  in  radius,  giant  stars  are  about 
10  times  smaller,  and  the  smallest  stars  that  shine  by  nuclear  reactions  are  about 
0.1  Rq  in  radius.  Collapsed  white  dwarf  stars  are  about  as  big  as  the  Earth,  whose 
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radius  is  6.378  x  10  m  and  about  0.01  Rq. 
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The  distances  between  stars  are  expressed  in  a  unit  called  the  parsec,  or  pc  for 
short,  where 

lpc  =  3.08567758128  x  1016  m  =  206265  AU  «  3.086  x  1016m, 
which  is  also  equivalent  to 

lpc  =  3.261564 light-year  «  3.26 light-year, 
where  a  light-year  is  the  distance  light  travels  in  one  year  at  the  speed  of  light,  c,  or 

1  light-year  =  9.4607304726  x  1015  m  «  9.461  x  1015  m. 

As  we  shall  subsequently  see,  the  term  parsec  is  derived  from  the  parallax 
method  of  determining  distance,  where  one  parsec  is  a  parallax  of  one  second  of 
arc. 

Example:  How  far  away  and  long  ago  was  starlight  emitted? 

The  Sun  is  located  at  a  mean  distance  of  1  AU  =  1.496  x  1011  m.  Traveling 
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at  the  speed  of  light  c  =  2.9979  x  10  m  s  ,  it  takes  a  time 
t  =  AU /c  =  499  s  for  radiation  to  travel  from  the  Sun  to  the  Earth,  so  the 
sunlight  we  see  this  very  moment  was  emitted  499  s  ago.  The  nearest  star 
other  than  the  Sun  is  Proxima  Centauri,  and  it  takes  4.286  light-years  for 
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starlight  to  travel  from  this  star  to  the  Earth.  Since  1  year  =  3.156  x  10  s, 
the  ratio  of  the  distance  to  Proxima  Centauri  and  the  distance  to  the  Sun  is 
about  4.286  x  3.156  x  107/499  «  271,000.  The  first  stars  were  formed 
shortly  after  the  big  bang,  which  occurred  about  13.7  billion  years  ago.  So 
these  first  stars  are  located  at  a  distance  of  about  13.7  billion  light-years. 
Using  the  conversion  of  1  parsec  =  1  pc  =  3.26  light-years  and  1  light-year 
=  9.461  x  1015  m,  these  first  stars  are  located  at  a  distance  of  about 
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4.2  x  10  pc  and  1.3  x  10“  m,  almost  1  million  billion  times  further  away 
than  the  Sun. 


The  extent  of  a  galaxy  is  measured  in  units  of  kiloparsec,  or  kpc  for  short,  where  1 
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kpc  =10  pc.  The  distance  between  the  Sun  and  the  center  of  our  Galaxy  is,  for 
example,  about  8.5  kpc.  Nearby  galaxies  are  separated  by  about  a  million  parsecs, 

/r 

denoted  as  a  megaparsec  and  abbreviated  Mpc,  where  1  Mpc  =10  pc  = 
3.0857  x  10““  m.  The  nearest  large  spiral  galaxy,  Andromeda  or  M  3 1 ,  is  located  at 
a  distance  of  0.78  Mpc,  while  a  very  remote  galaxy  might  be  at  a  distance  of  a  billion 
parsec,  denoted  as  a  gigaparsec  and  abbreviated  as  Gpc,  where  1  Gpc  =  109  pc. 

Astronomers  use  the  second,  abbreviated  by  the  lower  case  letter  s,  for  small 
time  scales  and  the  year,  or  yr  for  short,  for  large  ones.  The  orbital  period  of  the 
Earth  around  the  Sun  is  one  year,  with  a  value  of 

lyr=  3.156  x  107s  =  365.25  d, 
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Table  1.2  Principal  SI  units  and  their  conversion  to  corresponding  c.g.s.  units 


Quantity 

SI  units 

Conversion  to  c.g.s.  units 

Lengtha 

Meter  (m) 

100  centimeters  (cm) 

Mass’* 

Kilogram  (kg) 

1,000  grams  (g) 

Speedc 

Meter  per  second  (m  s_1) 

100  centimeters  per  second  (cm  s-1) 

Energy  d 

Joule  (J) 

10,000,000  erg  =  107  erg 

Power 

Watt  (W)  =  J  s_1 

10,000,000  erg  s-1  =  107  erg  s-1 

Temperature 

Kelvin  (K) 

degrees  Centigrade  =  C  =  K— 273 

Magnetic  flux  density 

Tesla  (T) 

10,000  gauss  (G)  =  104  G 

Force 

Newton  (N)  =  kg  m  s  ' 

100,000  dyn  =  105  dyn 

Pressure 

Pascal  (Pa)  =  N  m-2 

10  dyn  cm 
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One  nanometer  (nm)  is  1  nm  =10  m;  1  A  =  1  A  =  10  m;  1  mile  =  1.609  km; 
and  1.0  in.  =  2.54  cm 


b  1  ton  =  2,240  lb  =  1.016  047  x  103  kg  «  103  kg 

c  Speed  is  the  magnitude  of  velocity.  1ms-1  =  3.600  km  h-1  =  2.237  miles  per  hour,  and  1 
mile  per  hour  =  44.704  cm  s-1 

d  The  energy  of  high-energy  particles  and  x-ray  radiation  are  often  expressed  in  units  of  kilo- 
electron  volts,  or  keV,  where  1  keV  =  103  eV  =  1.602  x  10-16  J,  or  MeV  =  1,000  keV,  with 
1  eV  =  1.602  176  487  x  10“19  J  «  1.602  x  10“19  J 

and  1  day  =  1  d  =  86,400  s.  A  pulsar  might  rotate  with  a  period  of  1  s,  the  Earth 
is  4.6  billion  years,  or  4.6  Gyr,  old  and  the  expanding  universe  originated  about 
14  billion  years  ago.  The  International  System  of  Units  (Systeme  International, 
abbreviated  SI,  is  used  in  this  book.  It  includes  the  length  unit  of  meter  (m),  the 
mass  unit  of  kilogram  (kg),  and  the  time  unit  of  second  (s).  The  SI  units  of  energy, 
luminosity,  temperature,  and  magnetic  field  strength,  are  joule  (J),  watt  (W,  or 
J  s'1),  kelvin  (K),  and  tesla  (T),  respectively. 

Many  astronomers  and  astrophysicists  have  often  used,  and  still  use,  the  c.g.s. 
units  of  centimeter  (cm),  gram  (g),  second  (s)  in  their  professional  papers. 
Conversions  from  the  SI  units  to  the  c.g.s.  units  are  given  in  Table  1.2. 


1.11  Physical  Constants 

Since  the  fundamental  laws  of  physics  apply  throughout  the  universe,  the  physical 
constants  used  in  the  equations  that  describe  these  laws  are  thought  to  be  universal 
and  unvarying  in  space  or  time.  These  constants  include  the  speed  of  light,  c,  the 
Newtonian  gravitational  constant,  designated  G,  the  Boltzmann  constant,  denoted 
by  k  and  the  Planck  constant,  designated  h. 

The  speed  of  light,  c,  is  independent  of  the  frame  of  reference  in  space  or  time. 
It  provides  an  upper  speed  limit  affecting  any  object  in  the  entire  universe,  and  has 
a  value  of 

c  =  speed  of  light  =  299,792,458  m  s-1  «  2.9979  x  108  m  s^1 
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The  gravitational  constant  G  enters  into  Newton’s  universal  law  of  gravitation, 
expressing  the  force  of  gravity  F  =  GMm/D  between  two  bodies  of  mass  M  and 
m  separated  by  a  distance  D.  If  the  law  of  gravitation  is  universal,  applying  to  all 
objects  in  the  universe,  than  G  must  be  independent  of  time,  position,  mass  and  the 
nature  of  the  bodies.  It  has  a  value  of 

G  =  gravitational  constant  =  6.67428  x  10  11  N  m2  kg  2 
~  6.674  x  10-11  N  m2  kg-2. 

The  Boltzmann  constant  k  and  the  Planck  constant  h  are  quantum  constants 
used  to  describe  the  macroscopic  properties  of  exceedingly  small  things.  Atoms  in 
“thermal”  equilibrium  are  characterized  by  a  single  temperature  T,  and  an  energy 
E  «  kT,  where 

k  =  Boltzmann  constant  =  1.3806504  x  10  23  J  K  1  «  1.3806  x  10  23  J  K_1. 

Astrophysicists  measure  temperature  on  the  kelvin  scale,  where  the  symbol  K 
denotes  degrees  kelvin.  The  Boltzmann  constant  k  appears  in  the  statistical 
description  of  the  velocities  of  atoms  that  are  in  thermal  equilibrium  and  in  the 
ideal  gas  law  that  specifies  the  pressure  of  a  gas  at  a  given  temperature. 

The  Planck  constant  h  is  used  to  describe  the  particle,  or  photon,  nature  of 
radiation,  specifying  the  photon  energy  E  =  hv  for  radiation  of  frequency  v,  where 

h  =  Planck  constant  =  6.626069  x  10  34  J  s  ~  6.6261  x  10  34  J  s. 

This  constant  appears  in  the  description  of  thermal  (blackbody)  radiation,  and 
when  describing  the  interaction  of  radiation  with  matter. 

Additional  universal  constants,  listed  at  http://physics.nsit.gov/,  include  the 
electric  and  magnetic  constants: 

=  electric  constant  =  permittivity  of  vacuum  =  (10~9/(367c) 

=  8.8542  x  10“12F  m_1 

_ n 

fi0  =  magnetic  constant  =  permeability  of  vacuum  =  4tz  x  10 
=  1.2566  x  10-6N  A-1. 


Chapter  2 

Radiation 


2.1  Electromagnetic  Waves 

The  physical  perception  of  the  universe  is  governed  almost  solely  by  the  elec¬ 
tromagnetic  radiation  received  from  cosmic  objects.  This  radiation  carries  energy 
and  moves  through  space  in  periodic  waves  at  the  speed  of  light,  designated  by  the 
lower  case  letter  c.  The  speed  of  light  in  empty  space  is  a  universal  constant, 
independent  of  reference  in  space  and  time.  The  radiation  is  called  electromagnetic 
because  it  propagates  by  the  interplay  of  oscillating  electric  and  magnetic  waves. 

Our  understanding  of  electricity  and  magnetism  is  founded  upon  the  experi¬ 
mental  investigations  of  the  English  scientist  Michael  Faraday  (1791-1867),  who 
invented  the  first  rotating  electric  motor  and  discovered  electromagnetic  induction, 
the  principle  behind  the  electric  transformer  and  generator  (Faraday  1843).  His 
experiments  led  Faraday  to  propose  that  electromagnetic  forces  extend  into  empty 
space  around  charged  bodies,  electrical  conductors,  and  magnets;  these  invisible 
forces  are  now  called  electromagnetic  fields. 

The  Scottish  mathematician  and  theoretical  physicist  James  Clerk  Maxwell 
(1831-1879)  was  able  to  express  Faraday’s  results  in  a  precise  mathematical  form, 
now  known  as  Maxwell’s  equations  (Maxwell  1865).  These  four  partial  differential 
equations  depend  on  variations  of  the  force  fields  in  four  dimensions  -  three  for 
space  and  one  for  time. 

In  regions  with  no  charge  or  currents,  such  as  a  vacuum,  Maxwell’s  equations 
describe  sinusoidal  electromagnetic  waves  (Focus  2.1).  The  waves  described  by 
this  electromagnetic  wave  equation  have  a  speed  equal  to  the  speed  of  light, 
leading  Maxwell  to  comment,  “light  is  an  electromagnetic  disturbance  propagated 
through  the  field  according  to  electromagnetic  laws.”  The  changing  magnetic  field 
creates  a  changing  electric  field  that,  in  turn,  creates  a  changing  magnetic  field. 
The  electric  and  magnetic  field  directions  are  orthogonal  to  each  other  and  to  the 
direction  of  travel. 
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2  Radiation 


Focus  2.1  Plane  waves  of  electromagnetic  radiation 

For  the  electric  field,  E ,  and  the  magnetic  field,  B ,  in  free  space,  Maxwell’s 
equations  take  the  form  (Maxwell  1865): 


V  E  =  0 

V  B  =  0 


Vx£  =  - 
X7  x  B  =  /us 


dB 

dt 

dE 

Qt' 


where  V-  is  the  divergence  operator,  with  units  of  m  \  Vx  is  the  curl 
operator,  with  units  of  s  \  and  d /dt  is  the  partial  derivative  with  respect  to 
time,  t.  These  equations  can  be  written  as  second-order  partial  differential 
equations: 


V2E~  — 


1  d2E 

2 


VAB~  — 


dt 2 

1  d2B 


dt2 


=  0 


0 


(2.2) 


that  describes  the  propagation  of  electromagnetic  waves  through  a  medium 
or  a  vacuum.  Here  c  is  the  speed  of  light  in  the  medium.  In  a  vacuum: 

c  =  —^—  =  2.99792458  x  108 ms"1,  (2.3) 

\/IkX> 

where  the  electric  constant,  or  vacuum  permittivity,  So  =  8.854187817  x 

1  O  1  1  O  1 

10_  F  m~  «  8.854  x  10  “  F  m  ,  and  the  magnetic  constant,  or 
vacuum  permeability,  /i0  =  1.256632061  x  10-6  N  A-2  «  1.257  x 
10  N  A  .  In  a  medium  with  refractive  index  n: 


1  _  1 

^ \J  ^0  Bo  y/T^ 


where  s  is  the  electric  permittivity  of  the  medium  and  jjl  is  the  magnetic 
permeability  of  the  medium. 

There  are  sinusoidal,  plane- wave  solutions  of  these  equations  written  as: 


r,  t)  =  E0  cos  (cot  —  k  •  r) 

B(r,  t )  =  Bq  cos  (cot  —  k  •  r), 

where  t  is  the  time  variable,  and  the  angular  frequency,  co,  is  related  to  the 
wavenumber,  k,  by  the  dispersion  relation 
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c 

where  A  is  the  wavelength,  v  =  co/2n  =  c/A  is  the  frequency  in  s  \  and  the 
constant  n  3.154159. 

The  energy  flux,  S,  and  energy  density,  U,  in  the  plane  wave  are 


which  is  directed  along  the  direction  of  wave  propagation,  and 


U 


An 


Maxwell  realized  that  c  equals  the  speed  of  light,  which  others  had  pre¬ 
viously  measured,  and  concluded  that  light  is  a  form  of  electromagnetic 
radiation. 


In  common  with  any  wave,  electromagnetic  radiation  has  a  wavelength,  usually 
denoted  by  the  lowercase  Greek  letter  lambda,  A.  The  wavelength  is  the  distance 
between  successive  crests  or  successive  troughs  (Fig.  2.1).  Different  types  of 
electromagnetic  radiation  differ  in  their  wavelength,  although  they  propagate  at  the 
same  speed.  Like  waves  on  water,  electromagnetic  waves  have  crests  and  troughs; 
but,  unlike  water  waves,  electromagnetic  waves  can  propagate  in  vacuous  empty 
space. 

In  SI  units,  the  wavelength  is  measured  in  meters,  abbreviated  m.  Other  units  of 
wavelength  are  the  nanometer,  or  nm  for  short,  where  1  nm  =  10-9  m,  the 

O  O  O  _  1  A 

Angstrom,  abbreviated  A  where  1  A  =  10  m  =  0.1  nm,  and  the  micron, 

_ z 

denoted  j i  where  1  ji  =  10  m.  Radio  astronomers  might  specify  the  wavelength 

_ o 

in  meters  or  centimeters,  abbreviated  cm  where  1  cm  =10  ~  m. 


Fig.  2.1  Electromagnetic  waves  All  forms  of  radiation  consist  of  electrical  and  magnetic  fields 
that  oscillate  at  right  angles  to  each  other  and  to  the  direction  of  travel.  They  move  through  empty 
space  at  the  speed  of  light.  The  separation  between  adjacent  wave  crests  is  called  the  wavelength 
of  the  radiation  and  usually  is  designated  by  the  lowercase  Greek  letter  lambda,  A 
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Sometimes  radiation  is  described  by  its  frequency,  denoted  by  the  lower  case 
Greek  letter  nu  written  v.  The  frequency  indicates  how  fast  the  radiation  oscillates, 
or  moves  up  and  down.  The  frequency  of  a  wave  is  the  number  of  wave  crests 
passing  a  stationary  observer  every  second,  measured  in  Hertz,  abbreviated  Hz. 
One  Hertz  is  equivalent  to  one  cycle  per  second,  or  1  Hz  =  1  s_1.  Radio 
astronomers  use  a  frequency  unit  of  megahertz,  abbreviated  MHz,  where 

S' 

1  MHz  =10  Hz.  Radio  stations  that  transmit  frequency  modulated,  or  FM,  sig¬ 
nals,  are  denoted  by  their  call  letters  and  the  frequency  of  their  broadcasts  in  MHz. 
The  frequency  range  of  FM  radio  broadcasts  is  88-108  MHz.  Amplitude  modu¬ 
lated,  or  AM,  radio  signals  are  broadcast  in  several  bands  of  frequency  between 
0.150  and  30  MHz. 

Electromagnetic  waves  all  travel  through  empty  space  at  the  same  constant 
speed  -  that  is,  the  speed  of  light  in  a  vacuum  c  =  299,792,458  m  s  \  or  about 
2.9979  x  108  m  s  '.  The  product  of  wavelength,  2,  and  frequency,  v,  is  equal  to 
the  speed  of  light,  c,  or 


2  x  v  =  c.  (2.9) 

So,  radiation  at  shorter  wavelengths  has  a  higher  frequency  and  a  longer 
wavelength  corresponds  to  a  lower  frequency.  Any  electromagnetic  wave, 
regardless  of  wavelength  or  frequency,  travels  though  empty  space  at  the  speed  of 
light,  and  it  is  the  maximum  speed  possible  (Focus  2.2). 

Focus  2.2  Light,  the  fastest  thing  around 

It  was  once  thought  that  light  moves  instantaneously  through  space.  But  we 
now  know  that  it  travels  at  a  very  fast  but  finite  speed.  This  was  first  inferred 
from  observations  of  Jupiter’s  moon  Io  in  the  17th  century.  The  King  of 
France  had  directed  Giovanni  Domenico  Cassini  (1625-1712),  director  of 
the  Paris  Observatory,  to  use  such  observations  to  improve  knowledge 
of  terrestrial  longitude  and  maps  of  France.  Both  the  Danish  astronomer  Ole 
Rpmer  (1644-1710),  who  also  worked  at  the  observatory,  and  Cassini 
noticed  a  varying  time  between  eclipses  of  Io  by  the  Jupiter  (Rpmer  1677). 
Although  the  time  between  Io  eclipses  was  approximately  42  h,  it  varied  by 
an  amount  of  up  to  22  min. 

Both  astronomers  concluded  that  it  was  not  the  orbit  of  Io  around  Jupiter 
that  changed,  but  the  time  it  took  Jupiter’ s  light  to  travel  from  Io  to  the  Earth, 
which  depended  on  the  Earth’s  position  in  its  orbit  around  the  Sun.  When  the 
Earth  was  on  the  side  of  its  orbit  that  is  closest  to  Jupiter,  the  observed 
eclipse  period  for  Io  was  shortest,  and  when  the  Earth  was  on  the  opposite 
side  of  its  annual  orbit  around  the  Sun,  Io’s  apparent  eclipse  period  was 
largest. 

Neither  astronomer  gave  a  value  for  the  speed  of  light,  which  would  have 
been  equal  to  the  diameter  of  the  Earth’s  orbit  divided  by  the  time  difference 
between  the  longest  and  shortest  observed  Io  period,  or  a  velocity  of 
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c  =  2  AU/22  min  and  approximately  2.27  x  10  m  s  ,  where  1 
AU  =  1.496x  10u  m  is  the  mean  distance  between  the  Earth  and  the  Sun.  At 
Cassini’s  time  this  distance  was  not  well  known  (see  following  Sect.  2.5). 

Jupiter  orbits  the  Sun  at  a  mean  distance  of  5.203  AU  and  Jupiter’s  natural 
satellite  Io  orbits  Jupiter  with  a  period  of  1.769  Earth  days.  It  is  eclipsed  by 
the  planet  with  that  period.  Observations  of  changes  in  the  eclipse  period 
were  interpreted  as  differences,  At,  in  the  time,  t  that  light  takes  to  travel 
from  Jupiter  to  Earth.  When  Jupiter  is  furthest  from  Earth,  its  distance  will 
be  6.203  AU,  since  the  Earth  is  1.00  AU  from  the  Sun,  and  when  Jupiter  is 
closest  to  the  Earth,  the  giant  planet’s  distance  will  be  4.203  AU.  So  the  total 
change  in  Io’s  apparent  orbital  period,  from  longest  to  shortest,  will  be 
At  =  (6.203-4.203)/c  =  2  AU/c  =  998  s  =  16.63  min  =  0.0116  Earth 

O  _  1 

days,  where  c  =  2.9979  x  10°  m  s  is  the  speed  of  light.  This  is  a  rela¬ 
tively  small  change  in  the  Io’s  actual  orbital  period  of  1.769  Earth  days. 

The  English  astronomer  James  Bradley  (1693-1762)  unexpectedly  dis¬ 
covered  the  aberration  of  starlight  about  half  a  century  later.  It  is  a  change  in 
the  observed  position  of  a  star  that  depends  on  the  ratio  of  the  velocity  of  the 
Earth  and  the  speed  of  light.  Using  then  current  estimates  for  the  Earth’s 
orbital  motion  around  the  Sun,  Bradley  used  his  aberration  measurements  to 
infer  a  speed  of  light  of  about  294,500  m  s-1  (Bradley  1728). 

More  refined  laboratory  measurements  during  subsequent  centuries 
indicated  that  light  is  always  moving  at  a  constant  speed  with  the  precise 
velocity  of  c  —  299,792,458  ms-1.  Light  emitted  by  any  star  moves  at  this 
speed  through  empty  space  for  all  time.  It  never  stops  or  slows  down,  and  it 
never  comes  to  rest.  Nothing  outruns  light;  it  is  the  fastest  thing  around. 


Electromagnetic  radiation  has  no  way  of  marking  time,  and  it  can  persist  for¬ 
ever.  As  long  as  its  rays  pass  through  empty  space  and  encounter  no  atoms  or 
charged  particles  like  electrons,  it  will  survive  unchanged.  Radiation  emitted  from 
any  star  or  galaxy  today  might  therefore  travel  for  all  time  in  vacuous  space, 
bringing  its  message  forward  to  the  end  of  the  universe.  Astronomers  on  Earth 
intercept  just  a  small  part  of  this  radiation,  which  is  streaming  away  from  both 
known  and  unknown  objects  located  throughout  the  cosmos. 


2.2  The  Electromagnetic  Spectrum 

Most  of  us  remember  the  colorful  display  of  a  rainbow,  which  is  sunlight  bent  into 
separate  wavelengths  by  droplets  of  water.  In  the  mid- 17th  century,  the  English 
scientist  Isaac  Newton  (1642-1727)  showed  that  sunlight  could  also  be  broken  into 
its  colors  using  a  prism  -  a  specially  cut  chunk  of  glass  (Newton  1671,  1704). 
Furthermore,  each  color  could  not  be  divided  into  other  colors.  A  crystal 
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Table  2.1  Approximate 
wavelengths  of  colorsa 


Color 

Wavelength  (nm  =10  9  m  =  10  A) 

Violet 

420 

Blue 

470 

Green 

530 

Yellow 

580 

Orange 

610 

Red 

660 

a  Approximate  wavelengths  good  to  about  10  nm 


chandelier  or  compact  disk  also  displays  the  spectrum  of  visible  light,  arranging 
the  colors  by  their  different  wavelengths. 

From  short  to  long  waves,  the  colors  in  the  spectrum  of  visible  light  correspond 
to  violet,  blue,  green,  yellow,  orange  and  red  (Table  2.1).  Their  wavelengths  might 
be  specified  in  nanometers,  abbreviated  nm,  where  1  nm  =  10-9  m  or  in 

o  o  o  _ i  n 

Angstroms,  abbreviated  A,  where  1  A  =  0.1  nm  =10  m.  Light  from  the  Sun 
or  an  incandescent  light  bulb  often  is  called  white  light,  because  it  contains  all  of 
the  colors,  whereas  black  denotes  the  absence  of  color  when  we  see  no  light. 

The  electromagnetic  spectrum  describes  the  types  and  wavelengths  of  elec¬ 
tromagnetic  radiation  (Fig.  2.2).  From  short  wavelengths  to  long  ones,  this  spec¬ 
trum  includes  gamma  rays,  x-rays,  ultraviolet  radiation,  visible  light,  infrared 
radiation  and  radio  waves  (Table  2.2). 

Our  eyes  detect  a  narrow  range  of  wavelengths,  which  include  the  visible 
colors.  It  comprises  just  one  small  segment  of  the  much  broader  electromagnetic 
spectrum.  This  band  of  light  is  also  termed  visible  radiation,  to  distinguish  it  from 
invisible  radiation  that  cannot  be  seen  with  the  eye.  The  radiation  we  can  see  is 
also  known  as  optically  visible  radiation,  since  the  science  of  optics  is  used  to 
describe  the  lenses  and  mirrors  used  to  detect  the  light.  The  most  intense  radiation 
of  the  Sun  and  many  other  stars  is  emitted  at  these  optically  visible  wavelengths, 
and  our  atmosphere  permits  it  to  reach  the  ground.  Other  types  of  radiation,  like 
the  invisible  x-rays,  are  absorbed  in  our  atmosphere  and  do  not  reach  the  Earth’s 
surface. 

The  invisible  domains  include  infrared  and  radio  waves  -  with  wavelengths 
longer  than  that  of  red  light  -  and  the  ultraviolet  (UV)  rays,  x-rays,  and  gamma  (y) 
rays,  whose  wavelengths  are  shorter  than  violet  light.  They  all  are  electromagnetic 
waves  and  part  of  the  same  family,  and  they  all  move  in  empty  space  at  the  speed 
of  light,  but  we  cannot  see  them. 

Gamma  rays  are  the  shortest  and  most  energetic  electromagnetic  waves.  Their 
wavelengths  are  as  small  as  the  nucleus  of  an  atom,  or  about  10“ 15  m,  and  their 
waves  are  so  energetic  that  they  can  pass  through  a  thick  iron  plate. 

The  x-ray  region  of  the  electromagnetic  spectrum  extends  from  a  wavelength  of 
100  billionth  (10-11)  of  a  meter,  which  is  about  the  size  of  an  atom,  to  the  short- 
wavelength  side  of  the  ultraviolet.  The  German  physicist  Wilhelm  Rontgen 
(1845-1923)  discovered  x-rays,  producing  them  with  an  electrical  discharge  in  a 
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Fig.  2.2  Electromagnetic  spectrum  Radiation  from  cosmic  objects  can  be  emitted  at 

_  1  O  A 

wavelengths  from  less  than  10  m  to  greater  than  10  m,  where  m  denotes  meters.  The 
visible  spectrum  that  we  see  with  our  eyes  is  a  very  small  portion  of  the  entire  range  of 
wavelengths.  Lighter  shading  indicates  a  greater  transparency  of  the  Earth’s  atmosphere  to 
cosmic  radiation.  It  only  penetrates  the  Earth’s  atmosphere  at  visible  and  radio  wavelengths, 
respectively  represented  by  the  narrow  and  broad  white  areas.  Electromagnetic  radiation  at  short 
gamma  ray,  X-ray  and  ultraviolet  wavelengths,  represented  by  the  dark  areas,  is  absorbed  in  our 
atmosphere.  The  universe  is  now  observed  in  these  spectral  regions  from  above  the  atmosphere  in 
Earth-orbiting  satellites 


Table  2.2  The 

electromagnetic  spectrum 


Region 

Wavelength  range  (m) 

Radio 

10  3-103 

Microwave 

10“3-1 

Infrared 

7  x  10~7-10  3 

Visible 

4  x  10-7-7  x  10-7 

Ultraviolet 

t" 

1 

O 

H 

X 

xl- 

1 

00 

1 

O 

y— H 

x-ray 

o 

1 

H- 

H— 

1 

o 

1 

00 

Gamma  ray 

Less  than  10  11 

glass  vacuum  tube  (Rontgen  1896).  He  used  the  energetic  x-rays  to  penetrate  skin 
and  muscle,  detecting  human  bones  and  revolutionizing  medicine. 

The  wavelength  of  ultraviolet  radiation ,  abbreviated  UV,  is  just  a  bit  longer, 

_ o  _ ’~1 

between  10  and  4  x  10  m,  with  extreme  ultraviolet  radiation,  denoted  EUV, 
lying  in  the  short  wavelength  part  of  this  range.  Most  of  the  ultraviolet  radiation 
from  the  Sun  is  absorbed  in  our  air,  but  prolonged  exposure  to  the  amount  that 
reaches  the  ground  can  burn  your  skin. 

The  infrared  part  of  the  electromagnetic  spectrum  is  located  at  wavelengths 
between  7  x  10  and  10  m.  The  German-born  English  astronomer  William 
Herschel  (1738-1822)  discovered  infrared  radiation  when  he  put  a  beam  of  sun¬ 
light  through  a  prism  to  spread  it  into  its  spectral  components.  He  noticed  that  an 
unseen  portion  of  sunlight  warmed  a  thermometer  placed  beyond  the  red  edge  of 
the  visible  spectrum  (Herschel  1800).  The  thermometer  recorded  higher 


40 


2  Radiation 


temperatures  in  the  invisible  infrared  sunlight  than  in  normal  visible  sunlight. 
Herschel  called  them  calorific  rays  because  of  the  heat  they  generated.  The  term 
infrared  did  not  appear  until  the  late  19th  century. 

Humans  “glow  in  the  dark,”  emitting  infrared  radiation,  but  we  cannot  see  the 
heat;  it  is  outside  our  range  of  vision.  Soldiers  can  locate  the  enemy  at  night  by 
using  night-vision  goggles  with  infrared  sensors  that  detect  their  heat,  and  spy 
satellites  use  infrared  telescopes  to  detect  heat  radiated  by  rocket  exhaust  and  by 
large  concentrations  of  troops  and  vehicles. 

Atmospheric  molecules  such  as  carbon  dioxide  and  water  vapor  absorb  infrared 
radiation.  So  the  air  that  looks  so  transparent  to  our  eyes  is  opaque  to  much  of  the 
infrared  radiation  coming  from  outer  space.  Telescopes  located  above  part  of 
the  atmosphere,  on  the  tops  of  mountains  in  dry  climates,  can  catch  some  of  the 
incoming  infrared  radiation  before  it  is  completely  absorbed.  The  atmosphere 
similarly  blocks  the  heat  radiation  from  the  Earth’s  surface,  keeping  it  warmer  than 
it  would  otherwise  be.  This  warming  of  the  ground  is  known  as  the  greenhouse 
effect. 

The  atmosphere  effectively  absorbs  most  of  the  ultraviolet  and  infrared  radia¬ 
tion  from  cosmic  objects  and  all  of  their  x-rays  and  gamma  rays,  which  never 
reach  the  ground.  To  look  at  the  universe  at  these  invisible  wavelengths,  we  must 
loft  telescopes  above  the  atmosphere.  This  was  done  first  by  using  balloons  and 
sounding  rockets,  followed  by  Earth-orbiting  satellites  with  telescopes  that  view 
the  cosmos  at  invisible  ultraviolet,  infrared,  x-ray  and  gamma  ray  wavelengths. 

Radio  waves  are  between  0.001  and  1,000  m  long,  too  long  to  enter  the  eye  and 
not  energetic  enough  to  affect  vision.  The  German  physicist  Heinrich  Hertz 
(1857-1894)  discovered  radio  waves  by  building  equipment  to  both  produce  and 
detect  the  invisible  electromagnetic  signals  (Hertz  1887).  The  unit  of  frequency 
v  =  c/ X  is  now  named  the  Hertz  in  his  honor;  this  unit  is  abbreviated  Hz. 
Microwaves  have  wavelengths  in  the  short  part  of  the  radio-wave  region,  between 
0.001  and  1.0  m. 

Radio  waves  are  the  only  type  of  invisible  radiation  that  is  not  absorbed  in  the 
Earth’s  atmosphere.  Radio  waves  even  can  pass  through  rain  clouds;  therefore,  the 
radio  universe  can  be  observed  on  cloudy  days  and  in  stormy  weather,  just  as  a 
home  or  car  radio  works  even  when  it  is  raining  or  snowing.  Cosmic  radio  waves 
that  are  longer  than  about  10  m  are  nevertheless  reflected  by  an  ionized  layer  in  the 
Earth’s  atmosphere,  called  the  ionosphere;  so  these  longer  radio  waves  cannot 
reach  the  ground  and  must  be  observed  from  space. 


2.3  Moving  Perspectives 

Motion  changes  our  perspective,  and  observations  depend  on  our  relative  motion 
with  respect  to  the  object  being  observed.  These  moving  perspectives  are  descri¬ 
bed  using  inertial  frames  of  reference,  which  move  at  a  constant  velocity,  never 
accelerating  or  decelerating.  The  Dutch  physicist  Hendrik  A.  Lorentz  (1853-1928) 
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derived  the  coordinate  transformation  of  Maxwell’s  equations  from  one  inertial 
system  to  another,  showing  that  the  equations  are  invariant  when  subjected  to  this 
transformation.  The  Lorentz  transformation  utilizes  a  parameter  y,  now  known  as 
the  Lorentz  factor,  which  is  given  by  (Lorentz  1904): 


(2.10) 


where  V  is  the  relative  velocity  of  the  two  inertial  frames  of  reference,  f  =  V/c 
and  c  =  2.99792458  x  108  m  s”1  is  the  speed  of  light. 

The  German-born  physicist  Albert  Einstein  (1879-1955)  generalized  the 
Lorentz  transformation  in  the  Principle  of  Relativity  (Einstein  1905a,  b),  which 
states  that  the  laws  of  nature  and  the  results  of  experiments  performed  in  an  inertial 
frame  are  independent  of  the  uniform  velocity  of  the  system.  Einstein  additionally 
proposed  that  there  exists  in  nature  a  limiting,  invariant  speed,  the  speed  of  light,  c, 
now  known  as  a  universal  constant. 

The  unvarying  speed  of  light  was  first  demonstrated  in  the  late  19th  century  by 
the  American  physicist  Albert  A.  Michelson  (1852-1931),  assisted  by  his  friend 
the  chemist  Edward  W.  Morley  (1838-1923),  when  they  attempted  to  precisely 
measure  how  the  speed  of  light  depends  on  the  Earth’s  motion  through  a  hypo¬ 
thetical,  space-filling  medium,  the  ether,  in  which  light  waves  were  supposed  to 
propagate  and  vibrate. 

As  the  Earth  moves  through  the  stationary  ether,  a  wind  would  be  generated, 
and  the  observed  speed  of  light  would  vary,  like  the  speed  of  a  sailboat  going  with 
or  against  the  wind.  But  Michelson  and  Morley  found  that  there  was  no  detectable 
difference  in  the  speed  of  light  measured  in  the  direction  of  the  Earth’s  motion  or 
at  right  angles  to  it  (Michelson  and  Morley  1887).  So  the  experiment  meant  that 
there  was  no  light-carrying  ether.  It  also  implied  that  the  speed  of  light  is  constant, 
exactly  the  same  in  all  directions  and  at  all  seasons,  and  independent  of  the  motion 
of  the  observer  (Focus  2.3). 

Focus  2.3  The  Michelson-Morley  experiment 

Many  experiments  have  been  carried  out  to  confirm  the  unvarying  speed  of 
light,  but  the  most  famous  one  was  conducted  in  a  basement  laboratory  at  the 
Case  School  of  Applied  Science  in  Cleveland,  Ohio  in  1887,  when  the 
American  scientists  Albert  A.  Michelson  and  Edward  W.  Morley  attempted 
to  use  an  interferometer  to  precisely  measure  how  the  speed  of  light  depends 
on  the  Earth’s  motion  through  space. 

Scientists  of  that  time  firmly  believed  in  an  imaginary  luminiferous  ether, 
an  invisible,  frictionless,  and  unmoving  medium  that  was  supposed  to  per¬ 
meate  all  of  space.  Its  presence  explained  how  light  waves  could  travel  at 
high  speed  through  the  apparent  emptiness  of  space,  providing  the  medium 
in  which  they  propagate.  Light  was  supposed  to  be  transmitted  in  space  by 
the  vibrations  of  the  hypothetical,  invisible  ether. 
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If  the  Earth  moved  through  the  stationary  ether,  a  wind  would  be  gen¬ 
erated  and  the  observed  speed  of  light  would  vary,  like  the  speed  of  an 
airplane  moving  with  or  against  the  wind.  But  Michelson  and  Morley  found 
that  there  was  no  detectable  difference  in  the  interference  pattern  produced 
when  a  beam  of  light  was  sent  into  the  ether  wind  in  the  direction  of  the 
Earth’s  motion  or  directed  at  right  angles  to  it.  Moreover,  there  was  no 
difference  in  the  measured  speed  of  light  when  the  Earth  was  traveling 
toward  the  Sun  and  away  from  it  half  a  year  later.  That  is,  Michelson  and 
Morley  could  measure  no  difference,  Ac,  in  the  speed  of  light,  c,  in  two 
perpendicular  paths  of  equal  length,  in  the  direction  of  the  Earth’s  motion  or 
transverse  to  it,  with  a  precision  of  Ac/c<  0.0001  (Michelson  1881; 
Michelson  and  Morley  1887).  Roy  J.  Kennedy,  at  the  California  Institute  of 
Technology,  subsequently  refined  the  experiment  and  improved  the  mea¬ 
surement  precision  by  a  factor  of  ten  (Kennedy  1926;  Kennedy  and 
Thorndike  1932). 

So  the  Michelson-Morley  experiment  meant  that  there  was  no  light-car¬ 
rying  ether.  It  also  meant  that  the  speed  of  light  is  always  constant  and 
everywhere  the  same.  In  1907  Michelson  was  awarded  the  Nobel  Prize  in 
Physics  for  his  optical  precision  instruments  and  the  spectroscopic  and 
metrological  investigations  carried  out  with  their  aid. 

The  speed  of  light,  c,  enters  into  Albert  Einstein’s  (1879-1955)  Special  Theory 
of  Relativity  through  the  Lorentz  factor  y  =  [1  —  (V/c)  ]_  for  an  object  moving 
at  an  observed  velocity,  V.  We  normally  regard  time  as  absolute  and  immutable, 
with  nothing  disturbing  its  relentless,  steady  tick.  But  for  Einstein,  time  was  rel¬ 
ative  and  variable.  In  rapid  travel,  the  rate  at  which  time  flows  decreases,  so 
moving  clocks  run  slower  by  the  factor  y.  Lengths  are  diminished  at  high  speed, 
shrinking  in  the  direction  of  motion  by  the  amount  y.  At  very  high  velocities,  mass 
is  also  relative,  and  it  increases  with  the  speed  by  the  same  infamous  y  factor. 

In  the  Special  Relativity ,  motions  and  events  are  described  by  coordinates  in 
space  (x,  y,  z)  and  time,  t,  within  an  inertial  frame  of  reference  that  moves  at  a 
constant  velocity.  The  length  of  an  object  moving  with  the  reference  frame  of  an 
observer  is  called  the  proper  length,  and  the  time  read  in  a  clock  in  that  frame  is  the 
proper  time. 

If  proper  time,  t,  and  time  interval,  At,  between  two  events  at  one  location  are 
measured  in  system  K,  then  the  time  interval,  At' ,  between  the  events  as  measured 
in  system  K'  moving  with  uniform  velocity  V  is: 


(2.11) 


A  moving  clock  will  therefore  appear  to  go  slower  to  an  observer  in  the  moving 
system,  which  is  known  as  time  dilation. 
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Time  dilation  can  prolong  the  decay  time  of  fast-moving,  unstable  cosmic  ray 
particles  by  several  orders  of  magnitude,  and  noticeably  lengthen  the  lifetime  of 
elementary  particles  produced  in  man-made  particle  accelerators. 

Atomic  clocks  have  been  flown  around  the  world,  first  eastward  and  then 
westward,  and  compared  with  the  time  recorded  by  a  reference  atomic  clock  on  the 
ground.  As  predicted  by  Special  Relativity ,  the  flying  clocks  lost  time  (aged 
slower)  during  the  eastward  trip,  in  the  direction  of  the  Earth’s  rotation,  and  gained 
time  (aged  faster)  during  the  westward  trip  (Hafele  and  Keating  1972). 

Lengths  are  also  diminished  at  high  speed,  shrinking  in  the  direction  of  motion. 
For  proper  spatial  separation  or  length,  Ax,  in  system  K ,  there  is  a  Lorentz  con¬ 
traction  or  shortening,  Ax',  in  the  moving  K'  system  given  by: 


Ax 


Ax 

7 
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Thus,  both  space  and  time  are  relative  in  the  Special  Theory  of  Relativity . 
Mass  is  also  relative,  for  it  increases  with  the  speed.  If  a  particle  or  object  has 
rest  mass,  m0,  in  a  non-moving  frame,  the  mass  increases  in  the  moving  one  to  m' 
given  by: 


(2.13) 
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The  rest-mass  energy  E  =  m0c  increases  in  the  moving  frame  to  the  energy  E' 
given  by: 


=  yniQC 
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(2.14) 


This  expression  has  been  verified  in  high-energy  particle  experiments  that 
demonstrate  that  the  energy  of  a  subatomic  particle  can  increase  with  its  speed. 
The  equation  also  shows  that  an  infinite  amount  of  work  would  be  required  to 
accelerate  a  particle  to  the  speed  of  light,  with  V  =  c,  implying  that  no  physical 
object  can  move  faster  than  the  speed  of  light  in  an  inertial  frame.  The  mass  grows 
without  bound  when  an  object  moves  as  fast  as  light,  and  there  is  nothing  that  can 
propel  it  so  fast. 

Light  is  especially  difficult  to  describe  using  this  theory,  for  any  specification  of 
mass,  size,  or  time  intervals  are  undefined  when  moving  at  light’s  speed. 

In  the  Special  Theory  of  Relativity,  which  applies  to  the  non-accelerating  and 
non-gravitational  laws  of  physics,  distance  is  measured  by  a  metric,  or  line 
element,  ds,  that  combines  space,  x,  y,  z,  and  time,  t.  It  was  first  proposed  by 
Einstein’s  former  teacher,  Hermann  Minkowski  (1864-1909)  and  is  given  by 
Minkowski  (1908): 


ds 2  =  — c2dt 2  +  dx 2  +  dy2  +  dz 2 


5 


(2.15) 
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where  the  speed  of  light,  c,  is  used  to  give  the  units  of  space  from  time  and  the  all- 
important  negative  sign  indicates  time  passing.  In  spherical  coordinates  r,  d,  <fi  the 
metric  ds  is  written 

ds 2  =  — c2dt 2  +  dr 2  +  r2d02  +  r2  sin2  6d(j)2.  (2.16) 

In  this  description,  two  events  don’t  have  a  uniquely  defined  separation  in  either 
space  or  time.  Instead,  they  are  separated  in  space-time.  So,  the  concepts  of  space 
and  time  are  interwoven. 

Space  and  time  manage  to  join  together  in  a  detectable  way  when  objects  move 
exceptionally  fast,  approaching  the  speed  of  light.  In  these  special  circumstances, 
there  is  no  space  without  time  and  no  time  without  space;  they  are  fused  together. 
If  the  motion  is  fast  enough,  it  will  change  the  size  and  shape  of  things,  or  slow  the 
passing  of  time.  But  these  effects  only  become  significant  at  exceptionally  high 
speeds,  close  to  the  speed  of  light.  We  never  encounter  these  experiences  in 
normal  circumstances,  and  they  are  not  directly  applicable  to  our  everyday  lives. 


2.4  Thermal  (Blackbody)  Radiation 

An  ideal  thermal  radiator  is  known  as  a  blackbody.  By  definition,  a  blackbody 
absorbs  all  the  radiation  that  falls  upon  it  and  reflects  none  -  hence  the  term  black. 
A  black  shirt  will  similarly  absorb  most  of  the  visible  sunlight  falling  on  it  and 
reflects  no  colors. 

Thermal  radiation  is  emitted  by  a  gas  in  thermal  equilibrium,  and  arises  by 
virtue  of  an  object’s  heat,  or  temperature.  A  single  temperature  characterizes 
thermal  radiation. 

Any  hot  gas  that  is  in  thermal  equilibrium,  with  a  temperature  above  absolute 
zero,  will  attempt  to  radiate  its  energy  away.  The  emission  from  such  a  thermal 
radiator  is  found  at  all  wavelengths,  or  frequencies,  but  with  a  varying  intensity 
that  depends  on  the  temperature  (Fig.  2.3).  As  the  temperature  increases,  more 
energy  is  radiated  at  all  wavelengths.  Moreover,  the  wavelength  of  maximum 
radiation  shifts  toward  the  shorter  wavelengths  when  the  temperature  rises. 

Since  the  emission  of  thermal  radiation  is  present  at  all  wavelengths,  astrono¬ 
mers  say  it  emits  a  continuum  spectrum.  A  display  of  its  radiation  intensity  as  a 
function  of  wavelength,  known  as  the  spectrum,  shows  no  gaps,  breaks  or  sudden 
increases  or  decreases.  It  is  an  unbroken  continuum  ascending  to  peak  intensity 
and  then  dropping  again  as  the  wavelength  increases. 

No  real  object  emits  a  perfect  thermal,  or  blackbody,  spectrum,  but  the  Sun 
shines  with  roughly  such  a  spectrum.  It  closely  matches  the  radiation  spectrum  of  a 
blackbody  at  a  temperature  of  5,780  K. 

The  German  physicist  Max  Planck  (1858-1947)  derived  the  formula  for  the 
spectrum  of  a  perfect  absorber,  or  blackbody,  introducing  the  idea  that  it  radiates 
energy  in  fundamental  indivisible  units,  which  he  called  quanta ,  whose  energy  is 
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Fig.  2.3  Blackbody  radiation  The  spectral  plot  of  blackbody  radiation  intensity  as  a  function  of 
wavelength  depends  on  the  temperature  of  the  gas  emitting  the  radiation.  The  German  physicist 
Max  Planck  (1858-1947)  derived  the  formula  that  describes  the  shape  and  peak  of  this  spectrum 
in  1900.  He  proposed  that  the  radiation  energy  was  quantized,  which  provided  a  foundation  for 
quantum  theory.  At  higher  temperatures  the  wavelength  of  peak  emission  shifts  to  shorter 
wavelengths,  and  the  thermal  radiation  intensity  becomes  greater  at  all  wavelengths.  At  a 
temperature  of  6,000  degrees  on  the  kelvin  scale,  or  6,000  K,  the  thermal  radiation  peaks  in  the 
visible,  or  V,  band  of  wavelengths.  A  hot  gas  with  a  temperature  of  100,000  K  emits  most  of  its 
thermal  radiation  at  ultraviolet,  or  UV,  wavelengths,  whereas  the  emission  peaks  in  X-rays  when 
the  temperature  is  1  million  to  10  million  K 


proportional  to  the  frequency  of  the  radiation  (Planck  1901,  1910,  1913).  The 
constant  of  proportionality  between  the  frequency  and  energy  of  the  radiation  is 
now  known  as  the  Planck  constant,  designated  by  the  lower  case  letter  h.  It  has  a 
value  of  h  —  6.626  069  57  x  10“ 34  J  s,  or  about  h  «  6.626  x  10“ 34  J  s.  This 
marked  the  beginning  of  quantum  physics,  whose  history  is  discussed  by  Kragh 
(2002). 

Planck  found  that  a  blackbody  with  temperature  T  emits  a  continuum  spectrum 
of  radiation  characterized  by  a  brightness  distribution,  BV(T ),  which  depends  only 
on  the  frequency  v  and  temperature  T  and  is  given  by: 


BV(T)  = 


2  hv3 


1 


J  s  1  m  z  Hz  1  steradian 


l 


c2  exp[hv/(kT)  —  1] 


(2.17) 
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where  the  Planck  constant  h  «  6.626  x  10  J  s,  the  Boltzmann  constant 

'O  o  1 

k  «  1.381  x  10  J  K  ,  and  a  steradian  is  the  dimensionless  SI  unit  of  solid 
angle,  which  is  related  to  the  area  an  angle  cuts  out.  The  solid  angle  of  a  full  sphere 
is  471  and  that  of  a  hemisphere  is  2n,  where  n  =  3.13159. 

The  Planck  distribution  can  also  be  written  per  unit  wavelength,  B^(T),  where 
the  wavelength  2  =  c/v  and: 
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The  blackbody  spectrum  is  markedly  asymmetric.  It  falls  off  very  rapidly  with 
decreasing  wavelength  on  the  short  wavelength  side  of  the  maximum  and 
decreases  gradually  with  increasing  wavelength  at  long  wavelengths.  At  short 
wavelengths,  or  high  frequencies,  we  have  the  so-called  Wien  tail  of  the  distri¬ 
bution,  derived  by  the  German  physicist  Wilhelm  Wien  (1864-1928)  near  the  end 
of  the  19th  century.  It  is  given  by  (Wien  1893): 
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Two  English  physicists,  Lord  Rayleigh  (John  Strutt,  1842-1919)  and  James 
Jeans  (1877-1946)  derived  an  expression  for  the  brightness  of  thermal  radiation  at 
long  wavelengths,  or  low  frequencies.  For  wavelength  X  this  Rayleigh- Jeans  law  is 
(Rayleigh  1900,  1905;  Jeans  1905,  1909): 


or  at  frequency  v: 
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BJT )  =  — —  for  hv  <  kT,  (2.21) 
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These  equations  are  applicable  at  radio  wavelengths  or  frequencies  for  most 
temperatures. 

The  Rayleigh- Jeans  law  agrees  with  experimental  results  at  large  wavelengths, 
with  X  hc/(kT ),  or,  equivalently,  at  low  frequencies  v  <C  kT/h,  but  strongly 
disagrees  at  the  short  ultraviolet  wavelengths  (or  high  frequencies).  This  incon¬ 
sistency  between  observations  and  the  predictions  of  classical  physics  is  com¬ 
monly  known  as  the  ultraviolet  catastrophe ;  Planck  (1901)  explained  the 
inconsistency  when  he  introduced  radiation  quanta. 

The  blackbody,  or  thermal,  spectrum  has  a  maximum  intensity  at  a  wavelength, 
2max,  which  can  be  found  by  taking  the  derivative  of  the  Planck  distribution  and 
setting  the  equation  to  zero,  or  from  dBx(T)/dX  =  0,  giving: 


„  b  0.00289777  0.0029 

Anax  =  -  = - — - meters  «  — — — meters, 


(2.22) 


where  b  is  the  Wien  displacement  constant  and  T  is  the  temperature  on  the  kelvin 
scale,  abbreviated  K.  This  expression  is  called  the  Wien  displacement  law ,  after  the 
German  physicist  Wilhelm  Wien  (1864-1928)  who  formulated  the  relationship 
based  on  a  thermodynamic  argument  (Wien  1893).  It  is  known  as  a  displacement 
law  because  the  wavelength  peak  2max  is  displaced  when  the  temperature,  Ty  is 
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changed.  The  expression  indicates  that  colder  objects  radiate  most  of  their  energy 
at  longer  wavelengths,  and  that  hotter  objects  are  most  luminous  at  shorter 
wavelengths.  In  other  words,  as  the  temperature  of  a  gas  increases,  most  of  its 
thermal  radiation  is  emitted  at  shorter  and  shorter  wavelengths. 

Example:  The  most  intense  thermal  radiation  at  different  temperatures 

The  Sun  radiates  its  most  intense  radiation  in  the  visible  colors.  At  an  orange 
wavelength  of  2  =  500  nm,  the  effective  temperature  of  the  Sun’s  photo¬ 
sphere  is  T  =  0.0029/(5.00  x  10-7)  «  5,800  K.  The  average  body  tem¬ 
perature  of  a  human  is  about  T  =  310  K.  From  the  Wien  displacement  law, 
the  wavelength  of  maximum  thermal  radiation  at  this  temperature  is 
2max  =  0.0029/310  «  9.35  x  10  6  m,  corresponding  to  infrared  wave¬ 
lengths.  This  heat  radiation  can  be  detected  by  rattlesnakes  and  by  night- 
vision  goggles.  The  primary  mirror  of  the  Spitzer  Space  Telescope  has  a 
diameter  of  DT  =  0.85  m,  and  its  angular  resolution  6  at  this  infrared 
wavelength  is  6  =  1/DT  ^  1.1  x  10-5  rad  «  2.27" ,  where  1  rad  = 
2.06265  x  105 ".  Suppose  this  telescope  was  pointed  down  at  the  ground  to 
act  as  a  spy  satellite  from  a  geosynchronous  orbit  where  the  orbital  period  is 
equal  to  the  Earth’s  rotation  period  of  24  h.  The  semi-major  axis  of  such  an 
orbit  is  equal  to  a  =  42,164  km,  so  the  altitude  H  above  the  ground  in 

7 

H  =  a  —  RE  =  3.579  x  10  m,  where  the  radius  of  the  Earth  is 
Re  =  6,371  km  (see  Sect.  4.1).  The  smallest  feature  this  telescope  could 
resolve  on  the  ground  would  have  a  linear  size  of  L  =  H  x  6  «  394  m, 
bigger  than  a  human  but  comparable  to  a  convoy  of  vehicles.  An  x-ray 
telescope  operating  at  a  wavelength  of  2  =  1.24  x  10-9  m  would  detect  the 
thermal  radiation  of  a  gas  at  a  temperature  of  T  —  0.0029/2  ^2.3  x  106  K, 
or  about  2  million  K.  In  contrast,  the  cosmic  microwave  background  radi¬ 
ation  has  a  temperature  of  T  =  2.725  K,  and  the  wavelength  at  which  its 
emission  is  most  intense  is  2max  =  0.0029/2.725  «  0.001  m  or  1  mm. 


The  Wien  displacement  law  helps  explain  why  stars  have  different  colors.  Since 
red  wavelengths,  at  about  660  nm,  are  longer  than  blue  wavelengths,  at  around 
470  nm,  you  would  expect  that  the  visible  disk  of  a  red  star  would  be  cooler  than 
the  disk  of  a  blue  star.  The  Wien  displacement  law  yields  effective  disk  temper¬ 
atures  of  about  4,400  K  for  the  red  star  and  roughly  6,200  K  for  the  blue  star. 

However,  the  radiation  from  exceptionally  hot  stars,  which  peaks  at  short, 
invisible  ultraviolet  wavelengths,  also  enhances  the  radiation  intensity  at  adjacent 
blue  wavelengths.  A  star  that  is  most  intense  at  an  unseen  ultraviolet  wavelength  of 
30  nm  might  have  a  disk  temperature  as  great  as  100,000  K,  and  such  a  star  will 
also  emit  more  radiation  in  blue  visible  light  than  a  cooler  star  (Fig.  2.4).  Careful 
spectral  calibration  of  stellar  colors  indicates  that  blue  stars,  of  spectral  class  O, 
can  indeed  have  disk  temperatures  as  high  as  280,000  K,  while  the  red  stars  of 
spectral  class  M  can  be  about  100  time  cooler,  at  2,800  K  (Sect.  10.10). 
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Fig.  2.4  Ultraviolet  overflow  The  continuum  spectrum  of  a  star’s  thermal  radiation  changes 
with  the  effective  temperature  of  the  stellar  disk,  and  this  results  in  different  star  colors  within  the 
visible  range  of  wavelengths,  from  400  to  700  nm  (middle).  They  range  from  blue  stars,  at 
relatively  short  visible  wavelengths  to  red  stars,  at  the  longer  wavelengths  detected  by  our  eyes. 
The  thermal  radiation  of  a  star  with  an  effective  disk  temperature  of  about  4,000  degrees  on  the 
kelvin  scale,  denoted  K,  peaks  at  the  red  wavelengths,  and  a  hotter  star  with  a  temperature  of 
about  6,000  K  emits  its  most  intense  emission  at  blue  wavelengths.  A  much  hotter  star  at  100,000 
K  will  be  most  intense  at  invisible  ultraviolet  wavelengths  (left),  but  because  the  total  energy 
emitted  by  a  star  increases  dramatically  with  temperature,  the  very  hot  star  will  also  appear  bright 
at  blue  wavelengths 


In  terms  of  frequency,  Wien’s  displacement  law  for  the  maximum  frequency, 
vmax,  is  determined  from  dBv(T)/dv  =  0  and  is  given  by: 
vmax  ~  2.8  kT/h  «  5.8  x  1010THz. 

Because  the  spectrum  of  blackbody  radiation  per  unit  frequency  interval,  BV(T), 
differs  from  the  Planck  distribution  per  unit  wavelength,  B^(T),  the  vmax  does  not 
equal  c  /  2max. 

The  energy  density,  uv(T),  of  blackbody  radiation,  per  unit  frequency  interval,  is 


uv(T) 


87ivz 


hv 


c3  exp[(^)-l] 


(2.23) 
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Table  2.3  Radiation  constants 

a  =  Radiation  density  constant  =  8n5k4 / (15c3h3)  =  Aajc  =  7.5657  x  10-16  J  K-4  m-3 
a  =  Stefan-Boltzmann  constant  =  2n5k4 / (15 c2h3)  =  ac/A  =  5.6704  x  10-8  J  s_1  m-2  K-4 
Ci  =  First  radiation  constant  =  2nhc 2  =  3.741771  x  10-16  J  s_1  m2 
c2  =  Second  radiation  constant  =  hc/k  =  0.0143877  m  K 

b  —  kmaxT  =  Wien  displacement  law  constant  =  0.002897768  m  K  %  0.002898  m  K 


_ Q  _  1 

The  energy  density  has  SI  units  of  J  m  Hz  .  The  radiation  is  isotropic,  or  the 
same  in  all  directions,  and  the  solid  angle  of  a  full  sphere  is  An  sr,  where  the 
constant  n  «  3.14159.  When  this  expression  is  integrated  over  all  frequencies,  we 
obtain  the  total  energy  density,  u,  of  a  blackbody: 

oo 

u  =  J  uv(T)dv  =  aT4,  (2.24) 

o 

where  the  radiation  constant  a  is  given  by 

Rir5k4 

a  =  — -t-t  «  7.57  x  10“16  J  K-4  m"3  (2.25) 

15 c5h5 

The  radiant  flu x,fv(T),  of  energy  flowing  out  of  the  blackbody  over  n  sr,  or  over 
the  hemisphere  facing  an  observer,  is 

fv(T)  =  7iBv(T)ls-lm~2Hz-\  (2.26) 

in  units  of  energy  per  unit  time  per  unit  area  per  unit  frequency  interval.  The  radiant 
flux  is  what  is  observed  from  astronomical  objects.  When  integrating  the  flux  over  all 
frequencies  one  obtains  the  total  radiant  output  per  unit  area,/,  given  by: 


/ 


J  nB,.(T)  dv  =  jT4 
o 


(2.27) 


.  _ O 

where  the  Stefan-Boltzmann  constant,  a,  is  given  by:  a  =  ac/A  =  5.6704  x  10 
J  s_1  m~2  K~4. 

This  and  other  radiation  constants  are  given  in  Table  2.3. 

We  can  add  up,  or  integrate,  the  contributions  to  the  blackbody  spectrum  at 
every  wavelength  to  obtain  the  total  luminosity  of  a  thermal  radiator.  This  results 
in  the  Stefan-Boltzmann  law  in  which  the  luminosity  increases  with  the  square  of 
the  radius  and  the  fourth  power  of  the  effective  temperature.  Luminosity  is 
intrinsic  to  a  star,  establishing  its  power  and  energy  output  per  unit  time. 

The  Stefan-Boltzmann  law  states  that  the  total  power,  or  intrinsic  luminosity  L, 
at  the  visible  disk  of  a  star  or  other  astronomical  object  with  radius,  R ,  and 
effective  temperature,  Teff,  is: 


L  =  Ana  R2  T4, 


(2.28) 
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where  71  =  3.1416  and  the  Stefan-Boltzmann  constant  a  =  2n5k4 /(I5c2h3)  = 
5.6704  xlO  8Js  1m  2K4.  The  effective  temperature,  Te ff,  is  the  disk  tempera¬ 
ture  that  the  object  would  have  if  it  were  a  perfect  blackbody  radiating  at  lumi¬ 
nosity  L. 

The  unit  of  energy  is  the  joule,  and  the  unit  of  luminosity  is  joule  per  second, 
abbreviated  J  s-1.  Power  is  often  expressed  in  units  of  watts,  where  1  watt  = 
1  W  =  1  J  s_1. 

The  Stefan-Boltzmann  law  indicates  that  at  a  given  effective  temperature, 
bigger  stars  have  a  greater  luminosity  than  smaller  stars,  and  at  the  same  size, 
hotter  stars  are  intrinsically  more  luminous  than  cooler  stars.  The  Austrian  phys¬ 
icist  Joseph  Stefan  (1835-1893)  obtained  this  law  using  experimental  measure¬ 
ments  made  by  the  English  physicist  John  Tyndall  (1820-1893),  and  Stefan’s 
student  Ludwig  Boltzmann  (1844-1906)  derived  it  from  theoretical  consider¬ 
ations,  using  thermodynamics  (Stefan  1879;  Boltzmann  1872). 

The  intensity  of  radiation  striking  a  unit  area  decreases  as  the  radiation  spreads  out 
into  an  increasing  volume.  The  area  of  an  imaginary  sphere  located  at  a  distance,  D , 
from  the  Sun  or  any  other  star  is  given  by  4nD2,  so  the  intensity  per  unit  area, 
designated  by  /,  is  given  by  l —  L)  (47lD2),  which  falls  off  as  the  inverse  square  of  the 
distance.  You  can  notice  this  effect  when  watching  the  increased  brightness  of  a  car’ s 
headlight  when  the  car  approaches  you  and  its  distance  decreases,  or  when  watching 
the  car’s  taillights  dim  as  it  moves  away  to  greater  distance. 

The  radiant  flux,  /  o  fa  blackbody,  thermal  radiator  of  radius  R  and  absolute 
luminosity,  L,  and  temperature  T,  observed  at  a  distance,  D,  is 

f_  L  ff 

7  4  nD2  D2 


2.5  How  Far  Away  is  the  Sun,  and  How  Bright,  Big 
and  Hot  is  it? 

2.5.1  Distance  of  the  Sun 

How  far  away  is  the  Sun?  The  mean  distance  separating  the  Earth  and  the  Sun  is 
known  as  the  astronomical  unit,  abbreviated  AU,  and  it  provides  the  crucial  unit  of 
planetary  distance.  Yet,  for  a  very  long  time  no  one  knew  exactly  how  big  it  was. 
We  now  know  that  it  is  about  149.6  million  km. 

By  the  end  of  the  17th  century,  astronomers  and  other  scientists  had  a  good 
understanding  of  how  the  planets  move  around  the  Sun,  but  they  could  produce  a 
scale  model  of  the  solar  system  that  only  provided  relative  distances  of  the  planets 
from  the  Sun.  The  true  distances  and  speeds  of  motion  of  the  planets  remained 
unknown. 
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Fig.  2.5  Distance  to  the  Sun  Values  of  the  solar  parallax  obtained  from  measurements  of  the 
parallaxes  of  Venus,  Mars,  and  the  asteroid  Eros  between  1850  and  1970.  The  solar  parallax, 
designated  by  nQ,  is  half  the  angular  displacement  of  the  Sun  viewed  from  opposite  sides  of  the 
Earth.  The  error  bars  denote  the  probable  errors  in  the  determination;  the  points  for  1941,  1950 
and  1965  all  have  errors  smaller  than  the  plotted  points.  In  the  1960s,  the  newly  developed  radar 
(i.e.,  radio  detection  and  ranging)  technology  enabled  determination  of  the  Sun’s  distance  with  an 
accuracy  of  about  1,000  m.  The  radar  value  of  the  solar  parallax  is  8.79405  s  of  arc 


It  is  no  wonder  then  that  obtaining  a  precise  value  for  the  Sun— Earth  distance 
played  an  important  role  in  the  astronomy  of  the  18th  and  19th  century.  The  quest 
for  accurately  measuring  that  distance  involved  hundreds  of  trips  to  remote 
countries,  tens  of  thousands  of  observations  and  photographs,  and  the  lifetime 
work  of  several  astronomers.  They  first  determined  the  separations  of  the  Earth 
and  a  nearby  planet,  such  as  Venus  or  Mars,  and  then  used  this  planetary  distance 
to  infer  the  separation  of  the  Earth  and  the  Sun. 

The  distance  of  a  nearby  planet  can  be  estimated  by  measuring  the  angular 
separation  in  the  apparent  direction  of  the  planet  when  observed  simultaneously 
from  two  widely  separated  locations  on  the  Earth.  This  angle  is  known  as  the 
parallax ,  from  the  Greek  parallaxis ,  meaning  the  “value  of  an  angle.”  If  both  the 
parallax  and  the  separation  between  the  two  observers  are  known,  then  the  distance 
of  a  planet  can  be  determined  by  triangulation.  This  is  based  on  the  geometric  fact 
that  if  we  know  the  length  of  one  side  of  a  triangle  and  the  angles  of  the  two 
corners,  then  all  of  the  other  dimensions  can  be  calculated. 

Since  angular  measurements  were  involved,  the  astronomical  unit  was  naturally 
specified  by  an  angle  called  the  solar  parallax,  which  is  defined  as  half  the  angular 
separation  of  the  Sun  as  viewed  from  opposite  sides  of  the  Earth.  More  than  a 
century  of  estimates  for  the  solar  parallax  are  shown  in  Fig.  2.5  and  discussed  in 
Focus  2.4  -  also  see  Hirshfeld  (2001)  and  Van  Helden  (1985). 
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Focus  2.4  The  solar  parallax  and  the  Sun’s  distance 

The  distance  separating  the  Earth  and  the  Sun,  known  as  the  astronomical 
unit  or  AU  for  short,  is  determined  by  first  estimating  the  distance  between 
the  Earth  and  a  nearby  planet.  This  planetary  distance  then  can  be  used  to 
specify  the  AU.  The  distance  of  Venus  from  the  Sun,  for  example,  is  equal  to 
one  half  of  the  distance  between  the  Earth  and  Venus  when  it  is  closest  and 
farthest  away,  on  the  other  side  of  the  Sun.  When  the  Venus— Sun  distance  is 
known,  we  can  infer  the  distance  of  any  other  planet  from  the  Sun  using 
Kepler’s  third  law  (see  Sects.  3.1,  3.2),  which  relates  the  orbital  periods  and 
orbital  distances  of  the  planets. 

For  more  than  a  century,  the  distances  of  Venus  and  Mars  were  deter¬ 
mined  by  tri angulation  from  different  points  on  the  Earth.  It  involved 
measurements  of  the  parallax,  or  angular  difference  in  the  apparent  direction 
of  the  planet,  as  observed  from  widely  separated  locations.  The  solar  par¬ 
allax,  designated  by  the  symbol  7i0  was  then  inferred.  It  is  defined  mathe¬ 
matically  by  sin7r0  =  RE/AU ,  where  the  equatorial  radius  of  the  Earth  is 

o 

Re  =  6.378  x  10  m.  The  ratio  of  RE  and  the  AU  provided  an  angle  in 
radian  units,  and  one  radian  is  equivalent  to  2.06265  x  105  "  where  the 
symbol  //  denotes  a  second  of  arc  or  an  arc  second. 

In  1672,  Giovanni  Domenico  Cassini  (1625-1712),  an  Italian  astronomer 
and  the  first  director  of  the  Paris  Observatory,  obtained  an  early  triangulation 
of  Mars,  combining  his  observations  from  Paris  with  those  taken  by  his 
colleague  Jean  Richer  (1630-1696)  from  Cayenne,  French  Guiana.  The 
planet  was  then  in  opposition,  at  its  closest  approach  to  the  Earth.  From  the 
two  sets  of  observations  of  Mars,  made  from  opposite  sides  of  the  Earth  and 
about  7,200  km  apart,  it  was  possible  to  estimate  the  distance  to  Mars  and  to 
infer  an  approximate  value  of  9.5"  for  the  solar  parallax  (Van  Helden  1985). 

Astronomers  in  the  18th  and  19th  century  attempted  to  improve  the 
measurement  accuracy  of  the  Sun’s  distance  during  the  rare  occasions  when 
Venus  crossed  the  face  of  the  Sun  in  1761  and  1769,  with  an  estimate  for  the 
solar  parallax  of  8.57//  =b  0.04"  and  in  1874  and  1882  with  a  wide  range  of 
results  between  8.76"  and  8.88"  from  world-wide  observations.  The  method 
also  involved  comparison  of  observations  from  widely  separated  locations  to 
determine  the  distance  by  triangulation. 

In  1877,  David  Gill  (1843-1914),  an  unemployed  Scottish  astronomer 
with  no  university  degree,  traveled  to  the  small  island  of  Ascension  near  the 
equator  where  he  could  use  the  Earth’s  rotation  to  view  the  near  approach  of 
Mars  from  different  directions,  obtaining  a  solar  parallax  of  8.78"  =L  0.01". 

Subsequent  determinations  of  the  distance  to  the  nearby  asteroid  named 
433  Eros,  during  its  closest  approaches  to  the  Earth  in  1900-1901  and 
1930-1931,  resulted  in  respective  estimates  for  the  solar  parallax  of 
8.807"  ±  0.0027"  and  8.790"  ±  0.001". 
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Significant  improvements  in  the  precision  of  planetary  distances  came  in 
the  late  1960s  by  bouncing  pulsed  radio  waves  off  of  Venus  and  timing  the 
echo.  The  round-trip  travel  time  -  about  276  s  when  Venus  is  closest  to  the 
Earth  -  was  measured  using  atomic  clocks,  and  a  precise  distance  to  Venus 
then  was  obtained  by  multiplying  half  of  the  round-trip  time  by  the  speed  of 
light. 

The  distance  of  Venus  from  the  Sun  is  equal  to  one  half  of  the  difference 
between  the  Earth  and  Venus  when  it  is  closest  and  farthest  away,  on  the 
other  side  of  the  Sun.  The  resulting  radar  value  for  the  solar  parallax  was 
8.79405//.  The  corresponding  value  of  the  astronomical  unit,  inferred  from 
the  radar  determination  of  the  distance  of  Venus,  is  149,597,870  km,  with  an 
accuracy  of  about  1  km,  or  for  the  accuracy  required  in  most  astronomical 
calculations  1  AU  =  1.496  x  1011  m  (Ash  et  al.  1967;  Muhleman  1969). 

Nowadays  the  accuracy  of  the  mean  Earth— Sun  distance  is  fixed  by  the 
exact  value  for  the  speed  of  light.  The  Earth-Sun  light  travel  time,  tau  -  or 
the  time  for  light  to  travel  across  1  AU  -  is  given  as  a  primary  astronomical 
constant  and  has  the  value  tau  =  499.0047863852  s,  with  a  derived  value 
for  the  mean  Earth-Sun  distance  of  1AU  =  ctau  =  1.495978707  x  1011  m, 
where  the  speed  of  light  c  =  299792458  ms-1.  The  derived  value  of  the 
solar  parallax  is  tzq  =  8.794 1433"  where  //  denotes  second  of  arc. 


The  time  for  light  to  travel  from  the  Sun  to  the  Earth  is  used  now  as  a  primary 
astronomical  constant.  It  is  approximately  499  s,  which  corresponds  to  an  AU  of 
about  149.6  million  km  or  1.496  x  1011  m,  and  approximately  10,000  times  the 
diameter  of  the  Earth.  Once  scientist’s  determined  the  Sun’s  distance,  they  could 
determine  the  Earth’ s  mean  orbital  velocity,  by  assuming  -  to  a  first  approximation  - 
a  circular  orbit  and  dividing  the  Earth’s  orbital  circumference  by  its  orbital  period  of 
PE  =  one  year  =  3.1557  x  107  s.  The  Earth’s  velocity  is  2n  AU /PE  = 
29,800  ms-1,  which  is  equivalent  to  approximately  107,000  km  per  hour,  much 
faster  than  the  fastest  airplane  or  car. 

By  way  of  comparison,  the  light  travel  time  from  the  next  nearest  star  other  than 

the  Sun  to  the  Earth  is  4.24  years,  or  approximately  134  million  s.  This  star  is 

1  6 

called  Proxima  Centauri  and  it  is  located  at  a  distance  of  4.01  x  10  m  and 
268,000  AU. 

Therefore,  the  Sun  is  about  a  quarter  million  times  closer  to  the  Earth  than  the 
next  nearest  star.  Because  of  this  closeness,  the  Sun  is  approximately  100  billion 
times  brighter  than  any  other  star.  This  brilliance  and  proximity  permit  detailed 
investigations  that  are  not  possible  for  any  other  star.  As  a  result,  studies  of  the  Sun 
provide  the  foundation  and  benchmark  for  an  understanding  of  other  stars. 

Although  our  unaided  eyes  can  see  about  six  thousand  stars  in  the  night  sky,  and 
telescopes  reveal  hundreds  of  billions  of  them  in  the  Milky  Way,  our  own  daytime 
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star,  the  Sun,  is  a  special  star.  It  is  the  source  of  all  our  power.  Its  radiation 
energizes  our  planet,  warms  the  ground  and  sea,  lights  our  days,  strengthens  our 
bodies,  and  sustains  life  on  Earth. 

The  life- sustaining  Sun  also  links  us  to  the  other  stars,  and  to  understand  how 
the  Sun  or  any  other  star  operates,  we  must  examine  their  radiation,  which  spreads 
out  and  carries  energy  in  all  directions. 


2.5.2  How  Big  is  the  Sun? 

Any  incandescent  body  shines  because  it  is  hot.  The  wire  filament  in  an  incan¬ 
descent  light  bulb  is,  for  example,  heated  to  a  white-hot  temperature  of  about 
3,000  K  to  produce  its  luminous  glow.  As  it  turns  out,  the  visible  solar  disk  is  just 
about  twice  that  hot,  and  it  owes  its  much  greater  luminosity  to  its  vastly  larger 
size. 

The  solar  radius,  denoted  RQ,  can  be  determined  from  observations  of  the  Sun’s 
angular  size  and  distance,  and  these  measurements  indicate  that  the  solar  radius 

o 

rq  =  6.955  x  10  m,  which  is  109  times  the  radius  of  the  Earth  (see  Sect.  1.4). 


2.5.3  The  Unit  of  Energy 

The  joule  is  the  unit  of  energy  in  the  International  System  of  units,  abbreviated  by 
SI  from  the  French  Systeme  International  d’ unites.  The  SI  unit  of  energy  is  named 
after  the  English  physicist  James  Prescott  Joule  (1818-1889),  who  described  the 
relationship  of  heat  to  mechanical  work  (Joule  1847),  leading  to  the  theory  of 
conservation  of  energy.  When  an  SI  unit  is  spelled  out  in  English,  it  begins  with  a 
lower  case  letter,  like  joule,  but  it  is  abbreviated  with  a  capital  version  of  the  first 
letter,  such  as  J. 

A  joule  is  the  work  required  to  produce  one  watt  of  power  for  one  second,  so  a 
power  of  1  J  s_1  is  equivalent  to  one  watt. 

One  joule  is  twice  the  kinetic  energy  of  a  mass  of  one  kilogram,  abbreviated 
1  kg,  moving  at  a  speed  of  one  meter  per  second,  or  1  ms-1.  This  amount  of 
energy  is  a  very  small  number  as  far  as  the  mass  and  speed  of  cosmic  objects  are 
concerned.  The  Sun,  for  example,  has  a  mass  of  about  2,000  billion  billion  billion 

OA  _  1 

kg,  or  2  x  10  kg,  and  moves  though  space  at  a  speed  of  about  220,000  m  s  . 

Even  an  ordinary  table  lamp  with  a  100  watt  light  bulb  uses  just  100  J  s  \ 
whereas  the  Sun  liberates  a  lot  more  power,  some  382.8  million  billion  billion 
J  s'1  written  3.828  x  1026  J  s”1. 
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2.5.4  The  Sun’s  Luminosity 

The  Sun  emits  radiation  in  all  directions,  and  as  the  solar  radiation  spreads  out  into 
space,  it  is  dispersed  into  an  ever-increasing  volume.  The  distant  Earth  therefore 
collects  only  a  small  fraction  of  the  total  energy  radiated  by  the  Sun.  The  solar 
constant  specifies  the  amount  of  the  Sun’s  radiation  that  arrives  at  our  planet.  It  is 
denoted  by  the  symbol/©,  and  is  precisely  defined  as  the  total  amount  of  radiant 
solar  energy  per  unit  time  per  unit  area  reaching  the  top  of  the  Earth’s  atmosphere 
at  the  Earth’s  mean  distance  from  the  Sun.  (Any  physical  parameter  of  the  Sun  is 
denoted  by  a  subscript  ©,  a  circle  with  a  dot  at  the  center.) 

Artificial  satellites  have  been  used  to  accurately  measure  the  Sun’s  total  irra- 
diance,  or  radiant  flux,  just  outside  the  Earth’s  atmosphere,  establishing  the  value 
of  the  solar  constant  (Kopp  et  al.  2005): 


/e  =  1,361  Js_1m“2.  (2.30) 

We  can  use  the  solar  constant  and  Earth-Sun  distance  to  determine  the  total 
amount  of  energy  radiated  by  the  Sun  every  second.  At  the  Earth’s  mean  distance 
of  1  AU  from  the  Sun,  the  solar  radiation  per  unit  area  is  diminished  by  An  (AU)  , 
the  surface  area  of  a  sphere  at  this  distance.  We  therefore  infer  the  Sun’s  lumi¬ 
nosity,  denoted  L©,  by  multiplying  the  solar  constant  with  this  area  to  obtain: 

Le  =  471/0 (AU)2  =  3.828  x  1026  Js_1.  (2.31) 

where  1  AU  =  1.496  x  1011  m. 


2.5.5  Taking  the  Sun’s  Temperature 


The  Sun,  like  any  incandescent  body,  shines  because  it  is  hot.  How  hot?  Once  we 
know  the  radius  and  luminosity  of  the  Sun,  we  can  determine  the  temperature  of 
the  Sun’s  visible  disk.  Using  the  Stefan- Boltzmann  law,  the  effective  temperature, 
^eff0>  of  the  visible  solar  disk  is  given  by: 


Lr. 


0 


1  1/4 


AuctRq 


5,780  K, 


(2.32) 


_ o  _ 1  _ 1 

where  the  Stefan- Boltzmann  constant  o  =  5.670  x  10  J  m  K  s  ,  and  the 

o 

Sun’s  radius  is  RQ  =  6.955  x  10  m.  At  this  temperature,  all  elements  in  the  Sun 
are  present  in  gaseous  form.  The  Sun  is  only  about  twice  as  hot  as  the  wire  filament 
in  an  incandescent  light  bulb,  so  its  much  greater  luminosity  is  due  to  its  vastly 
larger  size. 

Astronomers  use  the  kelvin  temperature  scale  that  starts  from  absolute  zero,  the 
temperature  at  which  atoms  and  molecules  cease  to  move.  The  unit  for  this  scale  is 
written  kelvin,  without  a  capital  K,  or  just  denoted  by  a  capital  K.  Water  freezes  at 
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273  K  and  boils  at  373  K,  and  to  convert  to  degrees  Celsius,  abbreviated  by  C,  just 
subtract  273,  or  C  =  K  —  273.  The  conversion  to  degrees  Fahrenheit,  denoted 
by  F,  is  more  complicated,  with  F  =  (9  K/5)  -  459.4. 


2.5.6  How  Hot  are  the  Planets? 


Solar  radiation  warms  a  planet’s  surface,  and  as  we  would  expect,  the  heat  is 
greatest  for  objects  that  are  closest  to  the  Sun.  That  is  because  the  intensity  of 
sunlight  falls  off  as  the  inverse  square  of  distance  from  the  Sun. 

We  can  make  an  initial  estimate  for  the  temperature  of  a  planet  by  assuming 
that  the  surface  of  a  terrestrial  planet  or  the  cloud  tops  of  a  giant  planet  are  not 
noticeably  warmed  by  heat  rising  from  the  planet’s  interior  and  that  there  is  no 
atmosphere  above  them.  The  planet  is  then  heated  solely  by  the  Sun’s  radiation, 
and  we  can  calculate  the  planet’s  effective  temperature,  rep,  from  the  relation 
rep  =  279  (AU /Dp)m  K,  where  Dp  is  the  planet’s  distance  from  the  Sun  and  the 
mean  distance  between  the  Earth  and  the  Sun  is  1  AU  =  1.496  x  1011  m. 

To  derive  this  expression,  notice  that  the  radiant  energy  per  unit  time,  LP  that  a 
planet  of  radius  RP  receives  from  the  Sun  is: 


n  R\ 


D\ 


Js 


-1 


(2.33) 


where  RP  is  the  radius  of  the  planet, /is  the  total  amount  of  radiant  solar  energy 
perunit  time  per  unit  area  reaching  the  top  of  the  planet’s  atmosphere,  DP  is  the 
planet’s  distance  from  the  Sun,  the  Stefan-Boltzmann  constant 
a  =  5.670  x  10-8  J  m-2  K-4  s_1,  the  solar  radius  RQ  =  6.955  x  108  m,  and  the 
effective  temperature  of  the  visible  solar  disk  is  TeQ  =  5,780  K. 

The  Stefan-Boltzmann  law  can  also  be  applied  to  a  planet,  giving  its  radiant 
luminosity,  LP ,  or  the  amount  of  radiation  energy  lost  per  unit  time,  as  a  function 
of  its  radius  and  effective  temperature,  Tep. 

LP  =4naR2pT^Js~\  (2.34) 


Assuming  thermal  equilibrium  between  energy  lost  and  received,  so 
LP  =  7 zRpf,  and  combining  equations  we  obtain: 


y4  _  Lp 
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or 


(A  U\  1/2  , 

7’ep«  279(— J  K,  (2.36) 

where  1  AU  =  1.496  x  1011  m  is  the  mean  distance  of  the  Earth  from  the  Sun. 

Notice  that  the  effective  temperature  is  independent  of  the  planet’s  radius,  and 
that  the  effective  temperature  for  planets  around  other  stars  depends  upon  the  star’ s 
disk  temperature  and  the  square  root  of  the  star’s  radius,  as  well  as  the  planet’s 
distance  from  the  star,  or  on  the  star’s  absolute  luminosity  and  the  planet’s  distance 
from  the  star.  This  is  of  interest  in  determining  the  habitable  zone,  in  which  the 
planet  surface  temperature  might  permit  liquid  water,  at  temperatures  between  273 
and  373  K;  it  is  located  closer  to  a  star  that  is  less  luminous. 

This  expression  assumes  that  all  of  the  sunlight  falling  on  the  planet  is  absor¬ 
bed,  but  some  of  it  is  always  reflected.  The  extent  to  which  a  planet  or  satellite 
reflects  light  from  the  Sun  is  specified  by  its  albedo,  A,  the  percentage  of  reflected 
light.  The  visual  albedo  measures  the  fraction  of  incoming  visible  sunlight  that  is 
reflected  directly  into  space,  on  a  scale  of  0. 0-1.0.  Rocky  bodies  like  the  planet 
Mercury  or  the  Earth’s  Moon  absorb  a  lot  of  incident  sunlight,  while  clouds  or  icy 
surfaces  reflect  it.  Thus,  the  Moon  and  Mercury  have  a  visual  albedo  of  0.12,  while 
cloud-covered  Venus  has  an  albedo  of  0.65,  helping  to  make  it  the  brightest  planet 
we  detect  with  our  eyes. 

Taking  the  albedo,  A,  into  account,  we  have: 


279(1  —  A)1/4 


(2.37) 


There  are  two  kinds  of  albedo,  the  Bond  albedo  (Bond  1863),  which  measures 
the  total  proportion  of  electromagnetic  energy  reflected,  and  the  visual  geometric 


Table  2.4 

planetsa 

Distances,  visual 

albedos,  effective 

temperatures,  and  mean  temperatures  of  the 

Planet 

Average  distance, 
DP  (AU) 

Visual  geometric 
albedo,  A 

Effective  temperature, 

Teff  (K) 

Mean 

Temperature13  (K) 

Mercury 

0.387 

0.106 

436 

440 

Venus 

0.723 

0.65 

252 

730 

Earth 

1.000 

0.367 

249 

281 

Mars 

1.524 

0.150 

217 

210 

Jupiter 

5.203 

0.52 

102 

165 

Saturn 

9.537 

0.47 

77 

134 

Uranus 

19.19 

0.51 

53 

76 

Neptune 

30.07 

0.41 

45 

73 

a  Distance  and  mean  temperature  from  the  Jet  Propulsion  Laboratory.  Effective  temperatures  are 
calculated  from  the  visual  geometric  albedos,  which  are  from  http://ssd.jpl.nasa.gov 
b  The  mean  surface  temperatures  for  the  terrestrial  planets  and  the  mean  cloud-top  temperatures 
for  the  giant  planets 
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albedo  that  refers  only  to  electromagnetic  radiation  in  the  visible  spectrum. 
The  geometric  albedo  of  an  astronomical  body  is  the  ratio  of  its  actual  brightness 
to  that  of  an  idealized  flat,  fully  and  isotropically  reflecting  disk  with  the  same 
cross-sectional  area.  The  Bond  albedos  for  Mercury,  Venus,  Earth  and  Mars  are 
0.119,  0.75,  0.29,  and  0.16,  respectively,  while  their  visual  geometric  albedos  are 
0.106,  0.65,  0.367,  and  0.150.  When  our  formula  is  applied  to  the  Earth  we  obtain 
Te p  (Earth)  ^  256  K  using  the  Bond  albedo  and  rep  (Earth)  «  249  K  using  the 
visual  geometric  albedo.  The  Bond  albedo  for  the  Earth’s  Moon  is  0.123,  so  its 
effective  temperature  would  be  higher,  at  about  270  K. 

The  effective  temperatures  of  the  planets  are  compared  to  their  mean  observed 
surface  or  cloud-top  temperatures  in  Table  2.4.  The  surface  of  Venus  is  much  hotter 
than  expected,  and  the  surface  of  the  Earth  is  somewhat  hotter,  both  a  consequence 
of  the  greenhouse  effect  (Focus  2.5).  The  giant  planets  are  also  hotter,  due  to  the  heat 
left  over  from  their  formation  or  to  helium  raining  down  inside  them. 

Focus  2.5  Global  warming  by  the  greenhouse  effect 

The  surface  temperature  of  a  terrestrial  planet  can  increase  when  its  atmo¬ 
sphere  traps  heat  near  the  surface,  warming  it  to  a  higher  temperature  than 
would  be  achieved  by  the  Sun’s  radiation  in  the  absence  of  an  atmosphere. 
Incoming  sunlight  is  partly  reflected  by  clouds,  but  the  rest  passes  through 
the  atmosphere  to  warm  the  planet’s  surface.  Much  of  the  surface  heat  is 
re-radiated  in  the  form  of  long  infrared  waves  that  are  absorbed  by  atmo¬ 
spheric  molecules  such  as  carbon  dioxide  or  water  vapor.  Some  of  the 
trapped  heat  is  re-radiated  downward  to  warm  the  planet’ s  surface  and  the  air 
immediately  above  it.  The  atmosphere  thus  acts  as  a  one-way  filter,  allowing 
the  warmth  of  sunlight  in,  and  holding  it  close  to  the  planet’s  surface  and 
elevating  the  temperature  there. 

The  idea  that  our  atmospheric  blanket  might  warm  the  Earth  was  sug¬ 
gested  by  the  French  mathematician  Jean-Baptiste  Fourier  (1768-1830)  and 
developed  by  the  Irish  scientist  John  Tyndall  (1820-1893).  Fourier  won¬ 
dered  how  the  Sun’s  heat  could  be  retained  to  keep  the  Earth  hot,  concluding 
that  sunlight  passes  through  the  atmosphere,  which  also  prevents  the  escape 
of  heat  from  the  planet’s  surface  (Fourier  1824,  1827). 

Tyndall  built  an  instrument  to  measure  the  heat-trapping  properties  of 
various  gases,  examining  the  transmission  of  infrared  radiation  through 
them.  He  found  that  the  main  constituents  of  our  atmosphere  -  oxygen  and 
nitrogen  -  were  transparent  to  both  visible  and  infrared  radiation.  Oxygen 
molecules,  denoted  02,  account  for  21  %  of  our  atmosphere,  while  nitrogen 
molecules,  designated  N2,  accounts  for  78  %.  These  diatomic,  or  two-atom, 
molecules  are  incapable  of  absorbing  any  noticeable  amounts  of  infrared 
heat  radiation. 

Tyndall  also  found  that  water  vapor,  designated  H20,  and  carbon  dioxide, 
denoted  C02,  absorb  significant  heat  even  though  they  are  minor  ingredients 
of  the  Earth’s  atmosphere  (Tyndall  1861,  1863).  As  Tyndall  realized,  these 
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gases  are  transparent  to  sunlight,  which  warms  the  ground,  but  partially 
opaque  to  the  infrared  rays,  which  are  trapped  near  the  surface  and  warm  our 
globe.  Water  vapor  and  carbon  dioxide  molecules  consist  of  three  atoms  and 
are  more  flexible  and  free  to  move  in  more  ways  than  diatomic  molecules,  so 
they  absorb  the  heat  radiation. 

Global  warming  by  heat-trapping  gases  in  the  air  is  now  known  as  the 
greenhouse  effect ,  but  this  is  a  misnomer.  The  air  inside  a  garden  greenhouse 
is  heated  because  it  is  enclosed,  preventing  the  circulation  of  air  currents  that 
would  carry  away  heat  and  cool  the  interior.  Nevertheless,  the  term  is  now  so 
common  that  we  continue  to  use  it  to  designate  the  process  by  which  an 
atmosphere  traps  heat  near  a  planet’s  surface. 

As  Tyndall  pointed  out,  our  environment  would  be  much  colder  at 
nighttime  in  the  absence  of  the  greenhouse  effect,  and  the  Earth  might 
otherwise  be  covered  with  frost.  The  warming  is  crucial  to  life  on  Earth.  If 
the  Earth  had  no  atmosphere,  it  would  be  directly  heated  by  the  Sun’s  light  to 
temperatures  below  the  freezing  point  of  water.  Fortunately,  the  extra  heat 
from  the  greenhouse  effect  keeps  the  oceans,  lakes  and  streams  from  turning 
into  ice. 

Nevertheless,  humans  have  increased  global  warming  by  burning  coal, 
oil,  and  gas  and  releasing  carbon  dioxide  into  the  atmosphere.  This  extra 
warming  has  been  rising  ever  since  the  industrial  revolution.  The  effect  was 
suggested  by  the  Swedish  scientist  Svante  Arrhenius  (1859-1927),  realized 
as  an  environmental  threat  by  the  American  scientists  Roger  Revelle 
(1909-1991)  and  Hans  E.  Suess  (1909-1920),  and  documented  by  Charles 
D.  Keeling’s  (1928-2005)  measurements  of  the  atmospheric  carbon  dioxide 
(Arrhenius  1896;  Revelle  and  Suess  1957;  Keeling  1960,  1978,  1992  ).  Weart 
(2008)  describes  the  discovery  of  global  warming. 

The  Nobel  Peace  Prize  was  awarded  in  2007  jointly  to  the  Intergovern¬ 
mental  Panel  on  Climate  Change  and  to  Albert  Arnold  (Al)  Gore  Jr.  (1948-  ) 
for  their  efforts  to  build  up  and  disseminate  greater  knowledge  about  man¬ 
made  climate  change,  and  to  lay  the  foundations  for  the  measures  that  are 
needed  to  counteract  such  change.  The  film  entitled  An  Inconvenient  Truth 
(2006)  documents  Gore’s  campaign  to  make  the  issue  of  global  warming,  by 
human  emissions  of  heat-trapping  gases,  a  recognized  problem. 


2.6  The  Energy  of  Light 

When  radiation  moves  in  space  from  one  place  to  another,  it  will  behave  like  trains 
of  waves.  But  when  radiation  is  absorbed  or  emitted  by  atoms,  it  behaves  not  as  a 
wave  but  as  a  package  of  energy,  or  like  a  particle,  a  photon .  A  photon  is  a  discrete 
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quantity  of  energy  associated  with  electromagnetic  radiation.  Thus,  light  has  a 
wave-particle  duality;  it  can  act  light  a  wave  and  a  particle  depending  on  the 
situation  (De  Broglie  1923). 

Photons  have  no  electric  charge  and  travel  at  the  speed  of  light.  They  are 
created  whenever  a  material  object  emits  electromagnetic  radiation,  and  they  are 
consumed  when  matter  absorbs  radiation.  And  each  atom,  ion,  or  molecule  can 
only  absorb  and  radiate  at  a  very  specific  set  of  photon  energies.  (An  ion  is  an  atom 
that  has  lost  one  or  more  electrons.) 

The  ability  of  radiation  to  interact  with  matter  is  determined  by  the  energy  of  its 
photons. 

Photon  energy  depends  on  the  wavelength  or  frequency  of  the  radiation.  Waves 
with  shorter  wavelengths,  or  higher  frequencies,  correspond  to  photons  with  higher 
energy.  That  is,  the  energy,  E ,  transported  by  a  particular  photon  is  directly  pro¬ 
portional  to  the  radiation  frequency,  v,  and  inversely  proportional  to  the  radiation 
wavelength,  2.  The  photon  energy,  E ,  is  given  by: 

E  =  hv  =  ^-  ,  (2.38) 

A 

_ o  q 

where  h  is  the  Planck  constant  with  the  value  h  =  6.626  x  10  J  s,  the 
frequency  is  given  in  Hz  or  s  \  and  the  wavelength  is  in  m. 

The  idea  that  light  acts  like  a  particle,  the  photon,  when  interacting  with  matter 
originated  when  Albert  Einstein  (1879-1955)  explained  the  photoelectric  effect ,  in 
which  some  metals  release  a  current  of  electrons  when  light  shines  on  them. 
Measurements  of  this  effect  indicated  that  the  kinetic  energy  of  the  individual 
escaping  electrons  increases  with  the  frequency  of  the  incoming  light  wave. 
Einstein  explained  the  observations  by  supposing  that  individual  electrons  are  not 
hit  by  a  continuous  stream  of  light  energy,  but  by  an  individual  photon  of  light 
with  an  energy  hv  (Einstein  1905a,  b). 

Einstein  was  awarded  the  1921  Nobel  Prize  in  Physics  for  his  services  to 
theoretical  physics,  and  especially  for  his  discovery  of  the  law  of  the  photoelectric 
effect.  The  American  scientist  Robert  A.  Millikan  (1868-1953)  subsequently 
endorsed  the  photon  interpretation,  despite  his  initial  reservations,  and  used  the 
effect  to  measure  the  value  of  Planck’s  constant  h  (Millikan  1916);  he  received  the 
1924  Nobel  Prize  in  Physics  for  his  work  on  the  elementary  charge  of  electrons 
and  on  the  photoelectric  effect. 

The  amount  of  energy  transported  by  a  single  photon  is  quite  small.  For  yellow 
light,  the  wavelength  2  =  580  nm,  so  the  frequency  v  =  5.17  x  10 14  Hz,  and  the 
photon  energy,  E ,  is  only  3.42  x  10“ 19  J.  A  hundred-watt  light  bulb  radiates  a 
power  of  100  J  s_1,  so  it  sends  out  an  incredible  2.9  million  million  million,  or 
2.9  x  10  ,  photons  every  second. 

Radio  waves  have  even  smaller  photon  energy,  when  compared  with  the 
photons  of  visible  light.  The  low  energies  of  the  radio  photons  cannot  easily  excite 
the  atoms  of  our  atmosphere,  so  radio  photons  easily  pass  through  the  air.  Visible 
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radiation  can  also  slip  through  the  Earth’s  atmosphere  with  little  trouble.  Its 
photons  are  too  energetic  to  resonate  with  molecular  vibrations  and  they  are  too 
feeble  to  excite  atoms. 

Ultraviolet  photons  are  sufficiently  energetic  to  tear  off  electrons  from  atoms 
and  many  molecules  in  the  Earth’s  atmosphere,  particularly  in  the  ozone  layer. 
That’s  a  good  thing,  since  most  of  these  ultraviolet  photons  cannot  reach  the 
ground.  If  they  did  they  would  cause  lots  of  damage  to  our  skin  and  eyes. 

Astronomers  often  describe  energetic,  short-wavelength  radiation,  such  as 
x-rays  or  gamma  rays,  in  terms  of  their  energy  rather  than  their  wavelength  or 
frequency.  At  the  atomic  level,  the  natural  unit  of  energy  is  the  electron  volt, 
or  eV.  One  electron  volt  is  the  energy  an  electron  gains  when  it  passes  across  the 
terminals  of  a  1-volt  battery.  A  photon  of  visible  light  has  an  energy  of  about  two 
electron  volts,  or  2  eV.  Much  higher  energies  are  associated  with  nuclear  pro¬ 
cesses;  they  are  often  specified  in  units  of  millions  of  electron  volts,  denoted  MeV. 
A  somewhat  lower  unit  of  energy  is  1,000  electron  volts,  called  kilo-electron  volts 
and  abbreviated  keV;  it  is  often  used  to  describe  x-ray  radiation.  For  conversion 
between  energy  units,  1  eV  =  1.602  x  10“ 19  J  and  1  keV  =  1.602  x  10“ 16  J. 

The  x-ray  region  lies  between  1  and  100  keV  of  energy.  There  are  soft  x-rays 
with  relatively  low  energy  and  modest  penetrating  power,  with  energies  of 
1-10  keV.  The  hard  x-rays  have  higher  energy  and  greater  penetrating  power,  at 
10-100  keV.  Gamma  rays  are  even  more  energetic  than  x-rays,  exceeding 
100  keV  in  energy. 


2.7  Radiation  Scattering  and  Transfer 
2.7.1  Why  is  the  Sky  Blue  and  the  Sunsets  Red? 

Our  atmosphere  is  a  colorless  gas,  as  you  can  see  in  looking  at  the  air  in  your 
room,  but  the  sky  is  usually  blue  and  sunsets  are  red.  The  incident  sunlight 
contains  all  colors,  but  molecules  in  our  atmosphere  scatter  blue  light  from  the 
Sun  more  than  they  scatter  red  sunlight.  John  Tyndall  (1820-1893)  discovered 
the  effect  when  passing  light  through  a  clear  fluid  holding  small  particles  in 
suspension  (Tyndall  1861),  and  Lord  Rayleigh  (1842-1919)  derived  the  relevant 
equations  for  atmosphere  molecules  a  decade  later.  When  the  Sun  is  overhead, 
the  light  that  reaches  us  is  mostly  scattered  sunlight,  and  this  causes  the  sky  to 
appear  blue.  When  the  Sun  sets,  its  rays  pass  through  a  maximum  amount  of 
atmosphere,  and  most  of  the  blue  light  is  scattered  out  before  it  reaches  us.  The 
setting  Sun  is  therefore  reddened,  and  atmospheric  dust  also  contributes  to  its 
apparent  red  color. 
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2.7.2  Rayleigh  Scattering 


The  scattering  of  radiation  by  a  particle  depends  on  the  size,  a ,  of  the  particle  and 
the  wavelength,  2,  of  the  radiation.  When  the  particle  is  much  smaller  in  size  than 
the  wavelength,  or  a<2,  then  the  effect  is  known  as  Rayleigh  scattering ,  named 
after  Lord  Rayleigh  (1842-1919).  It  applies  to  gas  molecules  that  scatter  visible 
sunlight,  explaining  why  the  sky  is  blue  and  why  a  sunset  red. 

The  intensity  of  Rayleigh  scattered  radiation,  /,  by  a  spherical  particle  of  radius 
a  at  wavelength  2  for  an  incident  wave  of  intensity  /0  is  (Rayleigh  1871,  1899): 
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(2.39) 


where  D  is  the  distance  from  the  sphere  to  the  observation  point,  the  scattering 
angle  6  is  the  angle  between  the  direction  of  propagation  of  the  incident  wave  and 


the  direction  of  observation,  and  n  is  the  relative  index  of  refraction  n  = 

82ii2/(siiii)]1^2  between  the  sphere,  denoted  by  subscript  2,  and  the  surrounding 
medium,  labeled  with  subscript  1 ,  the  s  denotes  the  dielectric  constant  and  ju  is  the 
magnetic  permeability. 

The  amount  of  Rayleigh  scattering  from  a  single  particle  can  also  be  expressed 
as  a  scattering  cross  section,  as  given  by  (Rayleigh  1871): 
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The  major  molecular  constituent  in  our  atmosphere,  nitrogen,  has 
as  =  5.1  x  10“  m  in  green  light  at  a  wavelength  of  2  =  530  nm. 

The  strong  wavelength  dependence  of  the  Rayleigh  scattering,  which  varies  as 
2-4,  means  that  the  shorter  blue  wavelengths  are  scattered  much  more  than  the 
longer,  red  wavelengths.  Since  the  molecules  in  our  atmosphere  are  much  smaller 
than  the  wavelengths  of  colored  light,  the  blue  component  of  sunlight  is  more 
strongly  scattered  down  to  our  eyes  than  the  other  colors,  creating  our  bright  blue 
sky.  At  sunset  the  Sun’s  rays  pass  through  a  maximum  amount  of  atmosphere; 
most  of  the  blue  sunlight  is  then  scattered  out  of  our  viewing  direction,  and  the 
setting  Sun  is  colored  red.  Dust  in  the  air  also  helps  redden  the  sunset. 

Dust  particles  are  larger  than  molecules,  and  comparable  in  size  to  the  wave¬ 
length  of  visible  light.  The  equations  that  describe  the  scattering  are  then  more 
complicated;  it  is  known  as  Mie  scattering  after  the  German  physicist  Gustav  Mie 
(1869-1957)  who  first  published  its  mathematical  equations.  They  are  used  to 
describe  the  scattering  of  starlight  by  interstellar  dust,  which  reddens  the  light  of 

distant  stars  and  has  a  relatively  weak  dependence  on  wavelength,  varying  as  2_1. 
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2.7.3  Thomson  and  Compton  Scattering 

Electromagnetic  radiation  can  be  scattered  by  a  free  electron,  which  is  unattached 
to  an  atom.  The  electric  field  of  the  incident  wave  accelerates  the  electron,  which 
moves  in  the  direction  of  the  oscillating  electric  field  and  emits  radiation  at  the 
same  wavelength,  or  frequency,  as  the  incident  wave.  The  scattering  is  described 
by  the  Thomson  scattering  cross  section,  which  is  independent  of  the  wavelength, 
or  frequency,  of  the  incident  radiation. 

Thomson  scattering  is  very  important  deep  within  the  Sun,  where  the  temper¬ 
atures  are  high  enough  to  ionize  the  atoms,  producing  numerous  free  electrons  that 
scatter  radiation  produced  by  nuclear  fusion  reactions  in  the  solar  core  and 
determine  how  that  radiation  works  it  way  out  to  the  visible  disk  of  the  Sun  (see 
Sect.  8.5).  It  also  establishes  the  upper  limit  to  the  luminosity  of  a  star,  known  as 
the  Eddington  luminosity,  which  is  related  to  the  largest  mass  a  star  may  have 
(Sect.  10.1,  Focus  10.2). 

The  English  physicist  Joseph  John  Thomson  (1856-1940)  first  provided  the 
expression  for  the  total  scattered  power,  P,  or  the  energy  scattered  per  unit  time  in 
all  directions  (Thomson  1903,  1906),  which  is  given  by: 

P  =  aTcU ,  (2.41) 

where  U  is  the  energy  density  of  the  incident  radiation,  c  is  the  speed  of  light,  and 
the  Thomson  scattering  cross  section,  oT,  is  given  by: 

=  Tr'  =  T  =  6  6525  x  llr  m  ■  (2  42) 

where  the  classical  electron  radius  re  =  2.818  x  10“ 15  m,  the  electron  charge 

e  =  1.602  x  10  19  C,  the  electric  constant  in  vacuum  £o  =  8.854  x  10“12  Fm-1, 

_ Q  1 

the  rest  mass  of  the  electron  me  =  9.1094  x  10  kg,  and  the  speed  of  light 

o  1 

c  =  2.9979  x  10  m  s  .  Notice  that  the  free  electron  acts  as  if  it  had  the  classical 
electron  radius  when  interacting  with  radiation. 

The  Thomson  scattered  radiation  is  polarized  along  the  direction  of  the  elec¬ 
tron’s  motion,  or  along  the  direction  of  the  oscillating  electric  field  of  the  incident 
radiation.  The  power  scattered  per  unit  solid  angle,  dP/dQ,  therefore  depends  on 
the  angle  6  between  the  direction  of  the  electron’s  motion  and  the  direction  of  the 
observer,  or: 


dP  3 

~lF.  =  ^~aTU  sin2  (2-43) 

usl  o7l 

Shortly  after  the  big  bang  origin  of  the  expanding  universe,  it  was  so  hot  that 
the  universe  was  completely  opaque  to  electromagnetic  radiation  as  the  result  of 
Thomson  scattering.  The  cosmic  microwave  background  radiation,  dating  back  to 
shortly  after  the  big  bang,  is  thought  to  be  linearly  polarized  as  a  result  of  Thomson 
scattering  (see  Sect.  15.2). 
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The  Thomson  scattering  cross  section  is  applicable  whenever  the  incident 
photon  energy  is  much  less  than  the  rest  mass  energy  of  the  electron,  for  radiation 
frequency  v  <C  mec2 /h  «  1020Hz.  When  the  photon  energy  of  the  incident  elec¬ 
tromagnetic  radiation  is  comparable  to,  or  larger  than,  the  rest  mass  energy  of  the 
free  electron,  or  for  frequencies  v  >  1020  Hz,  the  incident  radiation  transfers  energy 
to  the  electron,  and  the  scattered  photon  has  less  energy,  or  a  lower  frequency  and 
longer  wavelength,  than  the  incident  one.  The  effect  is  named  Compton  scattering 
or  the  Compton  effect ,  after  the  American  physicist  Arthur  H.  Compton 
(1892-1962)  who  first  observed  and  explained  it,  receiving  the  1927  Nobel  Prize 
in  Physics  for  his  discovery.  The  change  in  wavelength,  A2,  caused  by  Compton 
scattering  from  an  electron  that  is  at  rest,  or  not  moving,  is  given  by  (Compton 
1923a,  b): 


A2  =  ^2  —  = - (1  —  cos  0)  =  2c(  1  —  cos  0),  (2.44) 

mec 

where  k\  is  the  wavelength  of  the  incident  radiation,  ^  is  the  wavelength  of  the 
Compton  scattered  radiation,  h  is  the  Planck  constant,  me  is  the  rest  mass  of  the 
electron,  c  is  the  speed  of  light,  and  6  is  the  scattering  angle,  or  the  angle  by  which 
the  incident  radiation  is  deflected.  The  quantity  2c  =  h/(mec )  =  2.426  x  10-12  m 
is  known  as  the  electron  Compton  wavelength.  The  Compton  wavelength  for  any 
other  subatomic  particle  is  given  by  the  same  expression  with  me  replaced  by  the 
mass  of  the  particle. 

In  the  inverse  Compton  effect ,  the  electrons  are  not  at  rest,  and  may  be  moving 
at  high  speeds.  These  high-energy  electrons  scatter  low  energy  photons,  and  the 
photons  now  gain  energy  in  the  Compton  interaction  and  the  electrons  lose  energy. 
When  the  electron’s  speed  is  large,  approaching  that  of  light,  the  scattered  fre¬ 
quency,  V2,  for  incident  radiation  of  frequency  V\  is  given  by: 

V2  y  V\  fory/nq  mec 2,  (2-45) 

the  scattering  cross  section  is  <7$  =  y2aT  for  Thomson  scattering  cross  section  aT, 
and  the  total  energy  radiated  per  unit  time,  P,  by  an  electron  passing  through 
radiation  of  energy  density  U  is  given  by 

P  «  y2aTcU,  (2.46) 

where  the  energy  of  the  electron  is  ymec 2  and  y  is  the  Lorentz  factor  y  =  [1  — 
(V/c)2]-172  for  an  electron  moving  at  velocity  V. 

The  scattered  radiation  from  high-energy  electrons  with  Lorentz  factors 
y  =  1,000  has  a  frequency  that  is  a  million  times  that  of  the  incident  radiation. 
Thus  radio  radiation  becomes  ultraviolet  radiation,  infrared  radiation  becomes 
x-rays  and  optical  radiation  becomes  gamma  rays. 

When  the  electron  velocity  is  high  and  yhv\  mec 2,  all  of  the  electron  energy 
is  transferred  into  the  scattered  radiation  regardless  of  the  incident  photon 
frequency  and  V2  =  ymec2 /h  for  y/nq  mec2 . 
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Inverse  Compton  scattering  can  quench  the  synchrotron  radiation  of  cosmic 
radio  sources,  which  is  emitted  by  high-speed  electrons,  and  the  effect  can  be 
important  in  the  x-ray  radiation  of  relativistic  electrons  being  accreted  by  a  black 
hole.  Longair  (2011)  has  provided  applications  of  scattering  formulae  to  astro¬ 
nomical  objects  in  high-energy  situations. 


2.7.4  Radiation  Transfer 


Once  radiation  is  emitted  from  an  astronomical  object,  it  must  pass  through  inter¬ 
vening  space  before  it  reaches  the  observer.  The  radiation  can  be  absorbed  when 
passing  through  a  layer  or  cloud  of  matter,  and  the  same  material  can  also  emit 
radiation.  The  material’s  effect  on  the  radiation  is  therefore  characterized  by  an 
absorption  coefficient  per  unit  length  av  at  frequency  v  and  emission  coefficient  sv  at 
frequency  v.  For  matter  in  thermodynamic  equilibrium  at  temperature  T, 

sv  =  gcvBv(T ),  (2.47) 

a  result  derived  by  Gustav  Kirchhoff  (1824-1887)  and  hence  known  as  Kirchhoff’s 
law  (Kirchhoff  1860).  Here  BV(T)  is  the  Planck  distribution  for  thermal,  or  black- 
body,  radiation  discussed  in  the  previous  Sect.  2.4,  and  the  emission  coefficient  sv  is 
the  power  per  unit  volume  per  unit  frequency  interval  per  unit  solid  angle. 

The  optical  depth,  denoted  by  tv,  characterizes  the  absorption  capability  of  the 
intervening  matter  at  radiation  frequency  v.  It  is  given  by: 


Tv 


(2.48) 


for  a  cloud  or  layer  of  thickness  L  in  the  v  direction  and  an  absorption  coefficient 
per  unit  length  av.  For  radiation  of  incident  intensity  7V(0),  the  intensity  of  radi¬ 
ation,  7V(L),  on  leaving  the  cloud  will  be: 


IV(L)  =  If  0)  exp(-Tv)  +  —  [1 

xv 


exp(-Tv)], 


(2.49) 


and  the  intensity  Iv  (cloud)  of  the  thermal  radiation  emitted  by  the  cloud  is  given  by: 


Iv (cloud)  =  /  sv  exp(— otvx)dx  «  Bv(T)[l  —  exp(— tv)],  (2.50) 


o 


with  BV(T)  =  £v/av.  If  the  cloud  or  layer  has  negligible  radiation  at  frequency  v,  its 
emission  coefficient  is  effectively  zero  and: 


IfL)  =  /v(0)exp(— tv). 


(2.51) 
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Fig.  2.6  Looking  through  a  cloud  When  an  interstellar  cloud  happens  to  lie  along  the  line  of 
sight  to  a  star,  the  observed  temperature,  denoted  T0,  can  differ  from  the  star’s  temperature, 
abbreviated  Ts.  That  is  because  the  cloud,  with  temperature  Tc,  will  shine  like  any  hot  gas, 
emitting  its  own  radiation,  and  the  cloud  can  also  absorb  and  scatter  the  star’s  radiation  that  is 
passing  through  it.  A  thick,  dense  cloud  can  absorb  all  the  incident  star’s  radiation,  so  you  don’t 
even  see  the  star;  just  the  cloud  is  detected.  In  contrast,  a  thin,  rarefied  cloud  can  be  transparent; 
therefore,  you  look  right  through  it,  detecting  the  star  as  if  the  cloud  wasn’t  even  there.  The  cloud 
is  characterized  by  its  optical  depth,  denoted  by  the  symbol  tv,  which  depends  on  the  substance  in 
the  cloud,  the  thickness  of  the  cloud  along  the  line  of  sight,  and  the  observation  frequency, 
designated  by  v.  An  optically  thin  cloud  is  transparent  to  the  radiation  at  this  frequency  and  the 
cloud  optical  depth  tv  <C  1 .  An  optically  thick  cloud,  with  rv  1  is  opaque,  and  at  this  frequency 
we  cannot  observe  anything  behind  the  cloud 


The  term  optical  depth  implies  that  we  are  talking  about  radiation  at  the  visible 
wavelengths  we  detect  with  our  eyes.  If  the  optical  depth  tv  1,  along  a  ray  path 
through  a  cloud  or  layer,  then  that  cloud  or  layer  is  known  as  optically  thick.  On 
the  other  hand,  a  transparent  cloud  or  layer  is  known  as  optically  thin  if  tv  <C  1 .  It 
follows  that  an  optically  thick  object  extinguishes  the  light  of  a  source  behind  it, 
whereas  an  optically  thin  object  absorbs  negligible  amounts  of  light  passing 
through  it.  More  generally,  the  terms  optically  thick  and  optically  thin  roughly 
mean  opaque  and  transparent  at  the  wavelength  or  frequency  of  electromagnetic 
radiation  we  are  considering. 

When  the  emission  coefficient  is  not  zero,  the  brightness,  Bcv{T ),  of  the  thermal 
emission  from  the  intervening  cloud  or  layer  is  given  by: 

Bcv(r)  =Bv(r)[l  -exp(-Tv)] 

=  BV(T)  if  tv  1  (optically  thick)  (2-52) 

=  tvBv(7')  if  tv  «  1  (optically  thin), 

where  BCv(T )  denotes  the  cloud  brightness  at  frequency  v  and  temperature  T. 

The  observed  brightness  and  temperature  of  a  source  that  lies  behind  a  cloud 
will  depend  on  the  temperature  of  the  source,  the  temperature  of  the  cloud,  and  the 
optical  depth  of  the  cloud  (Fig.  2.6).  When  the  cloud  is  transparent  at  the  obser¬ 
vation  wavelength  or  frequency,  which  corresponds  to  the  completely  optically 
thin  situation,  the  source  temperature  is  observed.  If  the  cloud  is  opaque,  or 
optically  thick,  the  cloud’s  temperature  is  observed. 

If  a  source  of  brightness,  Bs(T),  at  frequency,  v,  and  temperature,  Ts,  is  irra¬ 
diating  a  cloud  or  layer  of  temperature,  Tc,  the  total  observed  brightness 
B0v(Ttot)  is  given  by 
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Bov(J tot)  —  BSv(Ts )  exp(— tv)  +  BCv(Tc)^ 


(2.53) 


where 


Ttot  =  exp(— tv)  +  Tc[l  —  exp(— tv) 


(2.54) 


For  the  optically  thin  case  iv  <  1  the  total  observed  temperature  would  be 
given  by: 


Ttot  —  Ts(l  —  tv)  +  x  VTC- 


(2.55) 


Chapter  3 

Gravity 


3.1  Ceaseless,  Repetitive  Paths  Across  the  Sky 

Look  up  at  the  Sun  as  it  glides  across  the  bright  blue  sky,  or  watch  the  Moon’s 
nightly  voyage.  On  dark,  moonless  nights  you  also  might  notice  a  bright  planet 
traveling  against  the  stars. 

Ancient  astronomers  thought  that  the  Moon,  Sun,  and  planets  all  moved  in 
circles,  forever  wheeling  around  the  central,  unmoving  Earth,  and  the  Moon  does 
indeed  revolve  about  our  planet.  But  the  Earth  and  other  planets  revolve  about  the 
Sun,  and  the  Sun  does  not  revolve  around  the  Earth. 

So,  motion  is  a  matter  of  perspective.  It  is  always  relative,  perceived  only  in 
relation  to  something  else,  by  comparison  with  another  object  that  is  either  at  rest 
or  moving  in  a  different  way. 

The  earliest  Sun-centered  theories  of  planetary  motion  had  one  fatal  flaw;  they 
also  initially  assumed  that  the  planets  move  in  circular  orbits.  This  explanation 
couldn’t  be  reconciled  with  careful  observations  of  the  changing  positions  of  the 
planets  in  the  sky  that  were  meticulously  carried  out  by  the  Danish  astronomer, 
Tycho  Brahe  (1546-1601).  Johannes  Kepler  (1571-1630),  Brahe’s  assistant  and 
eventual  successor,  found  that  the  architecture  of  the  solar  system  had  to  be 
described  by  noncircular  shapes. 

After  8  years  of  computations,  Kepler  found  in  1609  that  the  observed  plane¬ 
tary  orbits  could  be  described  by  ellipses  with  the  Sun  at  one  focus  (Kepler  1609). 
This  ultimately  became  known  as  Kepler’ s  first  law  of  planetary  motion.  Although 
the  planetary  orbits  are  nearly  circular,  they  are  slightly  elliptical  in  shape. 

At  about  the  same  time,  Kepler  described  how  a  planet  moves  at  different 
speeds  as  it  travels  along  its  elliptical  orbit.  He  was  able  to  state  the  relationship  in 
a  precise  mathematical  form  now  called  Kepler’s  second  law ,  which  can  be 
explained  with  the  help  of  Fig.  3.1.  Imagine  a  line  drawn  from  the  Sun  to  a  planet. 
As  the  planet  swings  about  its  elliptical  path,  the  line  (which  will  increase  and 
decrease  in  length)  sweeps  out  a  surface  at  a  constant  rate.  This  also  is  known  as 
the  law  of  equal  areas.  During  the  three  equal  time  intervals  shown  in  Fig.  3.1,  the 


K.  R.  Lang,  Essential  Astrophysics,  Undergraduate  Lecture  Notes  in  Physics, 
DOI:  10.1007/978-3-642-35963-7_3,  ©  Springer- Verlag  Berlin  Heidelberg  2013 


69 


70 


3 


Gravity 


Planet’s  Orbit 


Fig.  3.1  Kepler’s  first  and  second  laws  The  German  astronomer  Johannes  Kepler  (1571-1630) 
published  his  first  two  laws  of  planetary  orbital  motion  in  1609.  His  first  law  states  that  the  orbit 
of  a  planet  about  the  Sun  is  an  ellipse  with  the  Sun  at  one  focus.  The  other  focus  of  the  ellipse  is 
empty.  According  to  Kepler’s  second  law,  the  line  joining  a  planet  to  the  Sun  sweeps  out  equal 
areas  in  equal  times.  This  is  also  known  as  the  law  of  equal  areas,  and  is  represented  by  the 
equality  of  the  three  shaded  areas  ABS ,  CDS,  and  EFS.  It  takes  as  long  to  travel  from  A  to  B  as 
from  C  to  D  and  from  E  to  F.  A  planet  moves  most  rapidly  when  it  is  nearest  the  Sun,  at 
perihelion;  a  planet’s  slowest  motion  occurs  when  it  is  farthest  from  the  Sun,  at  aphelion 


planet  moves  through  different  arcs  because  its  orbital  speed  changes,  but  the  areas 
swept  out  are  identical. 

So,  a  planet  moves  faster  when  it  is  closer  to  the  Sun,  and  the  modern  expla¬ 
nation  for  this  involves  one  of  the  fundamental  concepts  of  physics,  known  as  the 
conservation  of  angular  momentum  (Focus  3.1). 

Focus  3.1  Moving  along  an  elliptical  trajectory 

According  to  Kepler’s  first  law ,  the  planets  move  in  elliptical  orbits 
(Fig.  3.2).  A  planet’s  closest  point  to  the  Sun,  when  the  planet  moves  most 
rapidly,  is  called  the  perihelion;  and  its  most  distant  point  is  the  aphelion, 
where  the  planet  moves  most  slowly.  The  distance  between  the  perihelion 
and  aphelion  is  the  major  axis  of  the  orbital  ellipse.  Half  that  distance  is 
called  the  semi-major  axis,  designated  by  the  symbol,  a .  The  semi-major 
axis  of  the  Earth’s  elliptical  orbit  about  the  Sun  is  called  the  astronomical 
unit,  abbreviated  AU.  It  sets  the  scale  of  the  solar  system,  and  when  com¬ 
bined  with  the  Earth’s  yearlong  orbital  period  permits  the  determination  of 
the  Sun’s  mass  and  the  Earth’s  orbital  velocity,  but  only  after  astronomers 
had  found  out  how  large  the  AU  is. 


3.1  Ceaseless,  Repetitive  Paths  Across  the  Sky 
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The  distances  to  the  solar  focus  and  the  shape  of  an  ellipse  are  determined 
by  its  eccentricity,  e.  At  perihelion  the  distance  between  the  planet  and  the 
Sun  is  [a  (1  —  e )]  and  at  aphelion  that  distance  is  [a  (1  +  e)].  If  e  =  0  its 
shape  is  a  circle.  The  ellipse  becomes  more  elongated  and  squashed  as  its 
eccentricity  increases  toward  e  =  1.0.  The  eccentricity  of  the  planetary 
ellipse  has  been  greatly  exaggerated  in  Fig.  3.2,  with  an  eccentricity  of  about 
e  =  0.5. 

With  the  exception  of  Mercury,  all  of  the  major  planets  have  orbits  that 
are  nearly  circular,  with  eccentricities  of  less  than  e  =  0.1.  This  means  that 
the  Sun  is  very  near  the  center  of  each  orbital  ellipse.  For  Mercury, 
a  =  0.387  AU  and  e  =  0.206,  so  its  distance  from  the  Sun  is  just  0.307  AU 
at  perihelion  and  quite  a  lot  greater  at  aphelion,  at  0.467  AU. 

Conservation  of  angular  momentum  explains  why  a  planet  keeps  on 
whirling  around  the  Sun,  and  why  its  speed  is  fastest  at  perihelion.  For  a 
planet  of  mass,  M,  orbiting  the  Sun  at  speed  or  velocity,  V,  and  a  distance,  D, 

Angular  momentum  =  M  x  V  x  D.  (3-1) 

By  the  way,  in  physics  velocity  has  an  amount,  its  magnitude,  and  a 
direction.  Speed  is  the  magnitude  of  the  velocity.  In  astronomy  the  velocity 
is  often  just  given  by  its  observed  magnitude  in  a  given  direction,  the  speed, 
so  the  orbital  velocity  is  given  as  its  speed  along  the  orbit. 

The  conservation  law  states  that  as  long  as  no  outside  force  is  acting  on  a 
planet,  its  angular  momentum  cannot  change.  This  means  that  a  planet 
continues  moving  along  without  anything  pushing  or  pulling  it.  The  mass 
does  not  change,  so  when  the  distance  from  the  Sun  decreases,  at  perihelion, 
the  velocity  increases  to  compensate  and  keep  the  angular  momentum 
unchanged;  at  aphelion  the  distance  from  the  Sun  increases  so  the  speed 
must  decrease. 


Kepler’s  third  law  took  another  10  years  of  work  to  discover  (Kepler  1619).  In 
this  musical  pattern,  each  planet  produces  its  own  unique  “note”  as  it  moves 
around  the  Sun,  with  an  orbital  period  that  increases  with  a  planet’ s  distance  from 
the  Sun.  Kepler’s  harmonic  relationship  states  that  the  squares  of  the  planetary 
periods  are  in  proportion  to  the  cubes  of  their  average  distances  from  the  Sun.  If  PP 
denotes  the  orbital  period  of  a  planet  measured  in  years  and  aP  describes  its  semi¬ 
major  axis  measured  in  AU,  then  Kepler’s  third  law  states  that  P2p  =  aP.  This 
expression  is  illustrated  in  Fig.  3.3,  for  the  major  planets  and  for  the  brighter 
moons  of  Jupiter.  It  also  implies  that  a  more  distant  planet  moves  with  a  slower 
speed.  For  a  circular  orbit,  the  planet’s  uniform  velocity  VP  =  2naP/PP  =  con¬ 
stant  x  aP  l^2,  which  falls  off  as  the  inverse  square  root  of  aP. 
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Fig.  3.2  Ellipse  Each  planet  moves  in  an  ellipse  with  the  Sun  at  one  focus.  The  length  of  a  line 
drawn  from  the  Sun,  to  a  planet  and  then  to  the  empty  focus,  denoted  by  the  dashed  line ,  is 
always  2a ,  or  twice  the  semi-major  axis,  a.  The  eccentricity,  or  elongation,  of  the  planetary 
ellipse  has  been  greatly  overdone  in  this  figure;  planetary  orbits  look  much  more  like  a  circle 


Fig.  3.3  Kepler’s  third  law  The  orbital  periods  of  the  major  planets  in  years  are  plotted  against 
the  semi-major  axes  of  their  elliptical  orbits  in  astronomical  units  (AC/),  using  a  logarithmic  scale. 
The  straight  line  that  connects  the  points  has  a  slope  of  3/2,  thereby  verifying  Kepler’s  third  law 
that  states  that  the  square  of  the  orbital  periods  increase  with  the  cubes  of  the  planetary  distances. 
The  German  astronomer  Johannes  Kepler  (157 1-1630)  published  this  third  law  in  1619.  This  type 
of  relation  applies  to  any  set  of  bodies  in  elliptical  orbits,  including  Jupiter’s  four  largest  satellites 
shown  in  the  inset,  with  a  vertical  axis  in  units  of  days  and  a  horizontal  axis  that  gives  the 
distance  from  Jupiter  in  AU  units 


3.1  Ceaseless,  Repetitive  Paths  Across  the  Sky 
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These  were  amazing  discoveries,  but  no  one  yet  had  explained  what  holds  up 
the  Moon  and  planets  in  their  orbits.  The  explanation  awaited  the  discovery  of 
gravity,  a  principle  that  rules  the  universe. 

3.2  Universal  Gravitational  Attraction 

What  moves  the  planets  within  their  well-defined  orbits?  Kepler  supposed  that 
some  invisible  magnetic  force  emanated  from  the  rotating  Sun,  and  that  this  force 
pushed  the  planets  through  space.  The  farther  the  planet  is  from  the  Sun,  the 
weaker  the  solar  force  and  the  slower  a  planet’s  motion  -  as  described  by  Kepler’s 
harmonic  relationship. 

Roughly  half  a  century  later,  the  great  English  scientist  Isaac  Newton 
(1643-1727)  proposed  another  unseen  agent,  the  invisible  gravitational  force  of 
the  Sun.  Newton  showed  that  the  pull  of  gravity  is  universal,  with  an  unlimited 
range  and  capacity  to  act  on  all  matter,  thereby  holding  the  Moon,  comets,  and 
planets  in  their  orbits. 

Gravitation  is  the  driving  and  organizing  force  of  the  universe  (see  Mac  Dougal 
2013);  that  is  why  it  is  known  as  universal  gravitation.  It  binds  stars  and  galaxies 
together  and  is  responsible  for  their  formation.  The  pull  of  gravity  keeps  our  feet 
on  the  ground,  so  we  rotate  with  the  spinning  Earth  and  stay  on  it.  The  atmosphere 
and  oceans  similarly  are  held  close  to  the  planet  by  its  relentless  gravitational  pull. 

It  is  gravity  that  explains  why  and  how  things  fall.  We  might  suppose,  as 
Aristotle  once  did,  that  a  heavy  object  will  fall  faster  than  a  lighter  one,  in  direct 
proportion  to  its  weight,  but  that  is  not  the  case.  Unless  some  outside  force  is 
involved,  such  as  wind,  all  objects  fall  at  the  same  rate,  regardless  of  their  weight. 
Galileo  Galilei  (1564-1642)  stated  the  idea  in  his  Discoursi  (Galilei  1638),  and 
apparently  the  same  idea  was  stated  17  centuries  before  that  by  the  Roman  poet 
Lucretius  (c.  99  BC-c.  55  BC)  in  De  rerum  natura  (Lucretius  55  BC). 

Galileo  also  proposed  that  any  undisturbed  body  will  fall  with  uniform  accel¬ 
eration,  and  he  showed  that  the  distance  traveled  by  an  object  falling  from  rest  is 
proportional  to  the  square  of  the  elapsed  time.  The  distance,  d ,  after  time,  t,  is 
given  by  d  =  gt  / 2,  where  g  «  9.8  m  s  is  the  local  acceleration  of  gravity  on 
the  Earth.  In  these  ways  Galileo  provided  a  scientific  foundation  for  Newton’s 
subsequent  theory  of  universal  gravitation. 

Newton  realized  that  the  power  of  gravity,  whose  pull  influences  the  motion  of 
falling  bodies,  seems  undiminished  even  at  the  top  of  the  highest  mountains.  He 
therefore  argued  that  the  Earth’s  gravitational  force  extends  to  our  Moon,  and 
showed  that  this  force  can  pull  the  Moon  into  its  orbit. 

Newton  showed  that  motions  everywhere,  whether  in  the  celestial  heavens  on 
the  ground,  are  described  by  the  same  concepts  and  that  all  material  objects  are 
subject  to  gravitation.  Therefore,  everything  in  the  observable  universe  moves  in 
predictable  and  verifiable  ways.  The  basic  ideas  are  that  a  moving  body  will 
continue  to  move  in  a  straight  line,  unless  acted  on  by  an  outside  force,  and  that 


74 


3  Gravity 


every  object  attracts  every  other  object  as  the  result  of  universal  gravitation.  These 
insights  resulted  in  Sir  Isaac  Newton  becoming  the  first  person  in  England  to  be 
knighted  for  his  scientific  work. 

It  was  his  friend,  the  English  astronomer  Edmond  Halley  (1656-1742),  who 
persuaded  the  secretive  Newton  to  write  his  greatest  work,  the  Philosophiae 
naturalis  principia  mathematica ,  or  the  Mathematical  Principles  of  Natural  Phi¬ 
losophy ,  commonly  known  as  the  Principia  (Newton  1687).  It  was  presented  to  the 
Royal  Society  of  London  in  1686,  which  withdrew  from  publishing  it  due  to 
insufficient  funds;  Halley,  a  wealthy  man,  paid  for  the  publication  the  following 
year. 

The  enormous  reach  of  gravity  can  be  traced  to  two  causes.  First,  gravitational 
force  decreases  relatively  slowly  with  distance,  which  gives  gravity  a  much  greater 
range  than  other  natural  forces,  such  as  the  strong  force  that  holds  the  nucleus  of  an 
atom  together.  Second,  gravitation  has  no  positive  and  negative  charge,  like 
electricity,  or  opposite  polarities  like  magnets.  This  means  that  there  is  no  grav¬ 
itational  repulsion  between  masses.  That  is,  the  force  of  gravity  acting  between 
two  objects  always  pulls  them  together  and  never  pushes  them  apart.  The  attractive 
forces  among  unlike  electrical  charges  in  an  atom  cancel  one  another,  shielding  it 
from  the  electrical  forces  of  any  other  atom. 

The  gravitational  force  is  mutual,  so  any  two  objects  attract  each  other,  and 
every  atom  in  the  universe  feels  the  gravitational  attraction  of  every  other  atom. 
Their  attraction  is  proportional  to  the  product  of  their  masses,  which  possess 
inertia,  the  tendency  to  resist  any  change  in  motion.  Mass  is  an  intrinsic  aspect  of 
an  object.  It  is  different  from  weight,  which  decreases  with  distance  from  the  main 
source  of  gravity.  An  astronaut,  for  example,  weighs  less  after  leaving  the  Earth, 
but  his  or  her  mass  is  just  the  same. 

As  expected,  the  strength  of  the  gravitational  force  decreases  with  increasing 
distance,  and  Newton  used  Kepler’s  relationship  between  a  planet’s  orbital  period 
and  distance  to  show  that  the  force  of  gravity  falls  off  as  the  inverse  square  of  the 
distance  from  the  center  of  the  main  source  of  gravity,  the  Sun. 

Newton  also  demonstrated  that  the  force  of  gravity  at  the  Earth’s  surface  is  the 
same  as  the  force,  diminished  by  distance,  which  holds  our  Moon  in  place  during 
its  endless  journey  around  the  Earth.  In  effect  our  planet’s  gravity  is  forever 
pulling  on  the  Moon,  so  it  is  perpetually  falling  toward  the  Earth  while  maintaining 
the  same  mean  distance  from  it.  Without  the  Earth’s  gravitational  pull,  the  Moon 
would  not  orbit  our  planet  but  instead  would  travel  out  into  space,  never  returning 
to  Earth.  The  Sun’s  gravity  similarly  deflects  the  moving  planets  into  their  curved 
paths,  so  they  forever  revolve  around  the  Sun  (Newton  1687). 

The  gravitational  power  of  an  individual  object  depends  on  its  mass  and 
diminishes  with  distance  from  it.  Expressed  mathematically,  any  mass,  Mu  pro¬ 
duces  a  gravitational  force,  FG ,  on  another  mass,  M2 ,  given  by  the  expression: 


GM\M2 

D2 


1 
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where  the  universal  gravitational  constant  denoted  G,  has  the  value 

11  o  9 

G  «  6.674  x  10  N  m  kg  ,  and  D  is  the  distance  between  the  centers  of  the 
two  masses.  This  expression  for  the  force  is  sometimes  called  an  inverse  square 
law ,  since  the  force  of  gravity  is  inversely  proportional  to  the  square  of  the  dis¬ 
tance  or  separation.  The  SI  unit  of  force  is  appropriately  called  the  newton, 
abbreviated  N,  and  it  is  equal  to  the  amount  of  net  force  required  to  accelerate  a 

_ o 

mass  of  1  kg  at  a  rate  of  1  m  every  second  squared,  so  1  N  =  1  kg  m  s  ,  and  we 
can  express  the  universal  constant  of  gravitation  with  the  units 
G  x  6.614  x  10“ 11  m3  kg-1  s“2. 

Any  two  masses  attract  one  another  with  a  gravitational  force  that  varies  in 
proportion  to  the  product  of  the  masses  and  the  inverse  square  of  the  separation 
between  their  centers.  The  constant  of  proportionality  -  the  universal  gravitational 
constant  G  -  was  not  measured  until  71  years  after  Newton’s  death,  and  then 
indirectly  by  Henry  Cavendish  (1731-1810).  Cavendish’s  aim  was  to  determine 
the  mass  density  of  the  Earth  and  because  the  planet’s  radius  was  known,  he  could 
effectively  weigh  the  world.  After  nearly  a  year  of  meticulous  observations, 
Cavendish  (1798)  announced  that  the  Earth  has  a  mass  density  of 

_ o 

pE  =  5,488  zb  33  kg  m  (when  corrected  for  a  small  arithmetical  error  in  his 
paper).  His  result  meant  that  the  mass  of  the  Earth  is  ME  =  4nREpE/3  « 
6  x  1024  kg,  where  the  approximate  radius  of  the  Earth,  RE  «  6.4  x  106  m,  was 
known  at  the  time.  (Distances  part  way  around  the  surface  of  the  Earth  had  been 
found  by  the  surveying  technique  of  triangulation,  and  combined  to  determine  the 
Earth’s  circumference  and  a  radius  of  about  6,400  km.)  In  Cavendish’s  time,  mass 
and  weight  were  assumed  to  be  equal  and,  as  he  stated  in  his  correspondence,  he 
succeeded  in  weighing  the  world.  It  weighed  in  at  a  little  more  than  6  billion 
trillion  metric  tons.  (A  metric  ton  is  1,000  kg  or  2,205  pounds). 

Although  he  didn’t  specifically  determine  the  gravitational  constant,  the  value 

_ i  i  o  _ 1  _ 9 

implied  from  Cavendish’s  work  is  G  =  6.754  x  10  m  kg  s  (Brush  and 
Holton  2001).  A  very  precise  value  of  G,  accurate  to  more  than  the  third  decimal 
place,  is  still  unknown,  since  gravity  is  a  relatively  weak  force  when  compared  to 
other  forces  that  might  act  on  the  relevant  experimental  apparatus  (Heyl  1930; 
Rose  et  al.  1969;  Luther  and  Towler  1982;  Gillies  1997;  Fixler  et  al.  2007).  The 
currently  accepted  value  is: 

G  =  6.67428  x  10“u  Nm2  kg“2  =  6.67428  x  10“u  m3  kg-1  s“2,  (3.3) 

with  an  uncertainty  of  1  part  in  10.  For  computations  involving  the  orbits  of  either 
the  natural  or  the  artificial  satellites  around  planets,  and  the  trajectories  of 
spacecraft  visiting  them,  astronomers  use  the  product  of  G  and  the  planet’s  mass 
MP ,  since  GMP  is  known  more  accurately  than  either  term  alone.  The  geocentric 
gravitational  constant,  GME ,  is,  for  example,  a  primary  astronomical  constant: 

GMe  =  3.986004391  x  1014  m3  s“2,  (3.4) 

O  A 

where  ME  =  5.9736  x  10"  kg  is  the  mass  of  the  Earth,  which  is  given  together 
with  other  physical  properties  of  the  planet  in  Table  3.1. 


76 


3  Gravity 


Example:  How  fast  are  the  Moon  and  planets  moving,  and  how  do  we 
measure  the  mass  of  the  planets? 

Assuming  a  circular  orbit  at  a  distance  D  with  period  P,  the  average  orbital 
speed  will  be  V  =  2nD/P ,  where  n  =  3.14159.  The  mean  distance  of  our 
Moon  from  the  Earth  is  D  =  384,400  km  =  3.844  x  108  m  and  its  orbital 
period  around  the  Earth  is  P  =  27.3  Earth  days,  where  1  day  =  86,400  s, 
and  its  average  orbital  speed  is  about  E02  km  s-1.  For  the  Earth’s  orbit 
around  the  Sun,  D  =  1  AU  =  1.496  x  1011  m  and  P  =  1  yr  =  3.156 

'—i  - * 

x  10  s,  with  an  average  orbital  speed  of  V  =  2nD/P  =  29.78  km  s_  ; 
since  one  hour  3,600  s,  the  Earth  is  moving  at  about  107,200  km/h,  a  lot 
faster  than  a  vehicle  on  the  highway.  Jupiter  is  located  at  a  distance  of  about 
5.2  AU  from  the  Sun,  so  from  Kepler’s  third  law,  in  which  the  square  of  the 
orbital  periods  scale  as  the  cubes  of  the  planetary  distance,  the  orbital  period 
Pj  of  Jupiter  about  the  Sun  will  be  Pj  =  (5.2  AU/E0  AU)3^2  =  1 1.86  years. 
Its  average  orbital  speed  is  about  13  km  s_1,  which  is  about  three  times 

slower  than  the  Earth’s  orbital  speed. 

We  can  estimate  the  mass  of  a  planet,  M,  from  the  motion  of  one  of  its  moons, 

2  3  2 

or  natural  satellites,  using  Kepler’s  third  law,  M  =  An  D  /{GP  ),  where  the 
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Newtonian  constant  of  gravitation  G  =  6.674  x  10  m  kg  s  .  For  our 
Moon,  with  the  distance  D  and  orbital  period  P  given  just  above,  we  infer  a  mass 
of  the  Earth  ME  k,  6.0  x  10"  kg.  The  orbital  parameters  for  Jupiter’s  natural 
satellite  Io  are  D  =  421,700  km  =  4.217  x  108  m  and  P  =  1.77  Earth 
days  =  1.53  x  105  s,  and  with  these  parameters  we  obtain  the  mass  of  Jupiter 
Mj  «  1.9  x  1027  kg  «  318  Me. 


Any  object  has  a  gravitational  potential  stored  within  it  due  to  its  efforts  at 
overcoming  relentless  gravity.  Two  separated  objects,  for  example,  have  worked 
against  the  gravitational  attraction  that  pulls  them  together,  achieving  a  reserve  of 
energy  and  a  potential  for  future  action. 

According  to  the  conservation  of  energy,  a  fundamental  law  of  physics,  energy 
cannot  be  created  or  destroyed,  just  transformed.  So  the  energy  that  went  into 
overcoming  the  pull  of  gravity  is  stored  in  any  object,  and  this  stored  potential 
energy  can  be  converted  into  the  kinetic  energy  of  motion. 

This  gravitational  potential  energy  is  due  to  an  object’s  position  and  is  asso¬ 
ciated  with  the  gravitational  force.  It  depends  on  the  height  of  the  object,  its  mass, 
and  the  strength  of  the  gravitational  field  it  is  in.  For  a  very  small  mass  m,  as  tiny 
as  a  point,  the  gravitational  potential  energy  U  when  separated  by  a  distance  r  in 
the  gravitational  field  of  another  point  mass  M  is: 


GMm 
U  = - 


r 
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Table  3.1  Earth’s  orbital  and  physical  properties 
Orbital  characteristics 

Po  =  orbital  period  of  Earth  about  Sun  =  365.25636  days  =  1.000  sidereal  year 
Vo  =  average  orbital  speed  of  Earth  about  Sun  =  2.9783  x  104  m  s”1  =  107,200  km  h-1 
aE  =  AU  =  astronomical  unit  =  mean  Earth-Sun  distance  =  1.495  9787  x  1011  m 
7i q  =  solar  parallax  =  arcsin  (ae/AU)  «  aej AU  w  8.794143  seconds  of  arc  (for  Earth’s 
equatorial  radius  ae  =  6.3781  x  106  m) 
e  =  eccentricity  =  0.01671 

aE  (1  +  e)  =  aphelion  =  1.52  098  232  x  10u  m  =  1.01671388  AU 

aE  (1  -  e)  =  perihelion  =  1.47  098  290  x  1011  m  =  0.98329134  AU 

Physical  characteristics 

Age  =  4.6  x  109  year 

Me  —  mass  =  5.9736  x  1024  kg 

Mq/Me  =  inverse  mass  =  332  946 

ae  =  equatorial  radius  =  6.3781  x  106  m 

ap  =  polar  radius  =  6.3568  x  106  m 

f  =  (ae  —  ap  )/ap  =  flattening  =  0.0033528  =  1/298.25642 

RE  =  mean  radius  =  {a2ap)1^  %  6.371  x  106  m 

pE  =  mean  mass  density  =  3 MEj  (4nRE)  =  5515  kg  m-3 

1  A  2  _ 9 

GMe  =  geocentric  gravitational  attraction  =  3.986  x  lO1^  m  s 
gE  =  equatorial  gravitational  acceleration  =  GME/aE  =  9.780  ms-2 

VescE  =  surface  escape  velocity  of  Earth  =  (2  GME/RE)1^2  ^  1.1186  x  104ms-1  ~  11.2kms-1 
B  =  magnetic  field  strength  of  Earth  (equator  to  poles)  =  0.3-0. 6  G  =  (3-6)  x  10-5  T 
(magnetic  field  poles  reverse  every  250,000  years) 

Pr  =  rotation  period  =  24  h  =  8.64  x  104  s 

dPr/dt  =  slow  down  of  rotation  =  0.002  s  century-1 

co  =  angular  velocity  of  rotation  =  7.292  x  10-5  radians  s-1 

Vr  —  equatorial  rotation  velocity  =  465.12  m  s-1  =  1,674.4  km  h-1 

A  =  albedo  (Bond)  =  0.306  or  albedo  (geometric)  =  0.367 

T  —  mean  surface  temperature  =  287.2  K 

Atmosphere 

P  =  mean  surface  pressure  at  sea  level  =  1  bar  =  1.01  x  105  Pa 
N2  =  nitrogen  molecule  =  78.08  %  by  volume 
02  =  oxygen  molecule  =  20.95  %  by  volume 
Ar  =  argon  =  0.92  % 

C02  =  carbon  dioxide  =  0.038  % 

H20  =  water  vapor  %  1  %  variable 


The  negative  sign  is  a  convention,  not  important  for  most  physical  purposes 
where  differences  in  energy  are  used. 

This  expression  can  be  used  to  determine  the  escape  velocity  from  the  gravity 
of  an  object  of  radius  R ;  just  equate  the  kinetic  energy  mV2 / 2  to  GMm/R  to  get  the 
velocity  V  of  escape,  or  Vesc ,  given  by: 


2  GM\  1/2 


which  is  independent  of  the  small  mass  m. 
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For  a  self-gravitating  sphere  of  uniform  mass  density,  rather  than  a  point  mass, 
the  gravitational  potential  energy  is  given  by  integrating  the  potential  energy  over 
all  parts  of  the  sphere,  resulting  in: 

(3. 

where  R  is  the  radius  of  the  sphere  and  the  mass,  M  is  given  by 


U  = 


-3  GM^ 
5  R 


M  =  ^nR3p,  (3.8) 

for  a  mass  density  p. 

The  gravitational  binding  energy  of  a  sphere  held  together  by  its  gravity  is 
3GM  / ( 5R ),  without  the  minus  sign;  it  is  the  amount  of  energy  required  to  pull  all 
of  the  material  apart  and  the  amount  of  energy  released,  mainly  by  heat,  during  its 
formation. 

Because  a  precise  value  of  the  gravitational  constant,  G,  is  only  known  to  three 
significant  figures,  the  orbits  of  the  planets  are  calculated  using  the  Gaussian 
constant  of  gravitation ,  denoted  by  the  symbol  k ,  first  proposed  by  the  German 
mathematician  Carl  Friedrich  Gauss  (1777-1855).  It  is  given  by  (Gauss  1809): 


471 2aE 

f|(m£  +  m0)’ 


where  aE  is  the  semi-major  axis  of  the  Earth’s  orbit  about  the  Sun,  the  orbital 
period  of  the  Earth,  PE,  is  one  year,  and  ME  and  M0  respectively  denote  the  mass 
of  the  Earth  and  the  Sun.  Here  aE  =  AU  is  the  astronomical  unit,  while  the  symbol 
ae  with  a  lowercase  subscript  e ,  is  the  equatorial  radius  of  the  Earth.  The 
aE  =  1.496  x  1011  m  and  M0  =  1.989  x  1030  kg.  By  using  this  constant  Gauss 
was  able  to  simplify  the  calculation  of  planetary  orbits;  he  had  previously  used  it  in 
his  1801  prediction  of  the  orbit  of  the  first  asteroid,  Ceres,  which  had  been  lost 
from  view. 

The  Canadian- American  astronomer  Simon  Newcomb  (1835-1909)  determined 
the  value  of  k  with  such  great  precision  (Newcomb  1895)  that  it  is  still  used  in 
computing  the  planetary  ephemerides  and  is  one  of  the  defining  astronomical 
constants.  This  value  is: 


k  =  0.0 1 720209895  (  AU)3/2Mq  1/2  (D) _  1 ,  (3.10) 

where  AU  denotes  the  astronomical  unit  and  the  mean  solar  day  D  =  86,400  s. 
Thus,  k  is  the  Newtonian  constant  of  gravitation  expressed  in  units  of  the 
astronomical  unit,  the  solar  mass,  and  the  day.  The  derived,  heliocentric,  or  Sun- 
centered,  gravitational  constant,  GM0,  is  given  by  GM0  =  (AU )3k2/D2  =  1.327 
244  x  1020  m3  s-2. 
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For  objects  near  the  Earth,  the  local  acceleration  of  gravity  g  can  be  considered 
to  be  approximately  constant  and  the  expression  for  the  gravitational  potential 
energy  relative  to  the  Earth’s  surface  becomes: 

U  =  mgh ,  (3.11) 

where  h  is  the  height  above  the  Earth’s  surface  and  g  is  the  surface  value  of  the 

_ o 

acceleration  of  gravity,  or  g  =  9.780  ms  at  the  Earth’s  equator.  The  local 
acceleration  of  gravity,  g,  determines  how  things  fall.  For  an  object  of  mass,  M, 
and  radius,  R,  we  have: 


GM 

8 


(3.12) 
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where  the  universal  constant  of  gravitation  G  =  6.674  x  10  m  kg  s  .  For 

_ o 

the  Earth,  the  local  acceleration  of  gravity  at  the  equator  is  gE  =  9.780  m  s  , 

_ o 

increasing  to  about  9.832  ms  ^  at  the  poles.  The  detailed  mathematical  expres¬ 
sions  for  the  variation  of  g  with  altitude  and  latitude  on  the  Earth  are  given  in 
Focus  3.2. 


Focus  3.2  The  Earth’s  gravity 

The  gravitational  acceleration  of  the  Earth,  g,  depends  on  the  distance 
from  the  planet’s  center.  The  value  gH  at  altitude  H  above  the  Earth’s 
mean  radius,  RE ,  is  given  by: 


gH 


gO^E 

We  +  W  ’ 


(3.13) 


_ O  /T 

where  the  standard  gravity  g0  =  9.8331  ms  at  RE  =  6.371  x  10°  m. 

As  we  previously  discussed  in  Chap.  1,  Focus  1.2,  the  surface  of  the  Earth 
is  not  perfectly  round,  being  extended  at  the  equator  and  squashed  at  the 
poles.  The  surface  radius  r  at  latitude  <fi  of  the  Earth  geoid  is: 


r  =  ae(\  -fsm24>),  (3-14) 

O 

where  the  equatorial  radius  ae  =  6.378  140  x  10  m,  and  the  flattening 
factor  /  is  given  by 


/ 


Clp 


U2  +  l-m  =  0.0033528  =  1/298.25642. 


(3.15) 


/T 

The  polar  radius  aP  =  6.356755  x  10  m;  the  dynamical  form  factor  J2 
for  the  Earth  is  given  by 


J2  =  0.0010826359. 


(3.16) 
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The  effective  gravity  of  the  Earth  is  reduced  by  its  rotation,  and  this 
reduction  is  greatest  at  the  equator.  The  ratio  m  of  centrifugal  acceleration  at 
the  equator  to  the  gravitational  acceleration  at  the  equator  is  given  by: 


m  = 


2  3 
CO  a 

e  =  0.00346 


GM  P 


(3.17) 


where  the  angular  velocity  of  the  Earth’s  rotation  co  =  2n  radians/ 
86,400  s  =  7.292  x  10-5  rad  s-1.  Therefore,  the  effective  gravity  of  the 
Earth  is  reduced  by  rotation,  but  at  most  by  about  3  %  and  near  the  equator. 

At  sea  level,  then  we  can  estimate  the  surface  gravitational  acceleration, 
g,  at  latitude,  0,  from  the  formula  derived  by  the  French  astronomer  and 
mathematician  Alexis  Claude  de  Clairault  (1713-1765).  Known  as  Claira¬ 
ult’ s  theorem ,  it  is  (Clairault  1743): 


g  =  ge 


1  +  5m  _/  ,sinz^ 


(3.18) 


where  the  surface  equatorial  acceleration  of  gravity  is  given  by: 


ge  = 


GMe  (  3  *  9 

1  +  -Ji  —  m  )  =  9.780327  ms“2 


a : 


(3.19) 


Expressed  numerically,  Clairault’ s  theorem  becomes: 

g  =  9.780327(1  +  O.OO53O24sin20  -  O.OOOOO58sin220)  ms-2,  (3.20) 

at  latitude  cj). 


3.3  Mass  of  the  Sun 

The  concept  of  universal  gravitation,  and  Newton’s  expression  for  the  gravitational 
force,  can  be  used  to  derive  Kepler’s  third  law  in  the  form  (see  Chap.  4): 

9  An2  n 

P2p  =  — 7 - -  a3P 

p  G(MP  +  Mq)  p 

where  the  constant  n  =  3.14159,  the  universal  gravitational  constant  G  «  6.674  x 
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10  m  kg  s  ,  the  aP  is  the  semi-major  axis  of  the  planet’s  orbital  ellipse  in 
meters,  PP  is  the  orbital  period  in  seconds,  and  MP  and  M©  respectively  denote  the 
mass  of  the  planet  and  the  mass  of  the  Sun  in  kilograms. 

Within  the  solar  system,  the  dominant  mass  is  that  of  the  Sun,  which  far 
surpasses  the  mass  of  any  other  object  there.  That  is  why  we  call  it  a  solar  system, 


3.3  Mass  of  the  Sun 


81 


governed  by  the  central  Sun.  The  sum  (MP  +  M0)  is  therefore,  to  the  first 
approximation,  a  constant  equal  to  the  Sun’s  mass,  M©,  regardless  of  the  planet 
under  consideration,  and  Kepler’s  third  law  becomes: 


p2  _ 

r  p  ~ 


4n" 


GMq 


Cl] 


Since  any  planet  mass,  Mp,  is  much  smaller  than  the  Sun’s  mass,  M, 


©! 


Mq  = 


4n2ap 
GPp  ’ 


(3.22) 
we  have: 

(3.23) 


where  and  n  =  3.141592. 

We  can  use  the  Earth’s  orbital  motion  at  a  mean  distance  of  one  astronomical 
unit,  or  1  AU  =  1.496  x  1011  m  and  its  orbital  period  of  one  year,  or 

rj 

1  year  =  3.1556926  x  10  s  to  infer  the  mass  of  the  Sun,  M©  from: 


G(lyr)2 


1.989  x  1030  kg. 


(3.24) 


It  is  the  benchmark  unit  for  specifying  the  mass  of  the  stars  and  galaxies. 

The  ratio  of  the  mass  of  the  Earth,  ME,  to  the  mass  of  the  Sun,  M©,  which  is 
independent  of  G,  is  given  by 


Me  =  1 

M©  332,946 ' 


(3.25) 


The  Sun  is  about  333,000  times  more  massive  than  the  Earth,  and  contains  more 
than  99.9  %  of  the  mass  of  the  entire  solar  system,  so  our  assumption  that  our 
planet’s  motion  is  controlled  by  the  massive  Sun  is  amply  justified. 

The  first  person  to  estimate  the  mass  of  the  Sun  was  Newton,  in  the  Principia , 
where  he  calculated  that  the  ratio  of  the  mass  of  the  Earth  to  the  mass  of  the  Sun  was 
1/28,700.  After  an  improved  value  for  the  distance  to  the  Sun  was  available,  he 
revised  his  result  to  obtain  a  ratio  of  1  / 169,282  in  the  third  edition  of  the  Principia. 
The  modern  value  is  of  1  /332,946  is  a  result  of  improved  determinations  of  the  AU. 


3.4  Tidal  Effects 
3.4.1  The  Ocean  Tides 

While  walking  along  the  beach  we  might  notice  that  the  waves  are  rising  farther 
and  farther  up  the  shore,  steadily  advancing  and  enlarging  the  bounds  of  the  sea. 
The  tide  is  flooding  the  beach.  But  several  hours  later  it  retreats  and  goes  down 
again.  We  say  that  the  tide  is  rising  and  falling,  while  the  sea  runs  in  and  out,  and 
Newton  showed  that  the  Moon’s  attraction  is  the  main  cause  of  the  ocean  tides. 
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Earth’s  Moon  attracts 

rotation  tidal  bulge 


Earth 

slows  down 


Fig.  3.4  Cause  of  the  Earth’s  ocean  tides  The  Moon’s  gravitational  attraction  causes  two  tidal 
bulges  in  the  Earth’s  ocean  water,  one  on  the  closest  side  to  the  Moon  and  one  on  the  farthest 
side.  The  Earth’s  rotation  twists  the  closest  bulge  ahead  of  the  Earth— Moon  line  (« dashed  line), 
which  produces  a  lag  between  the  time  the  Moon  is  directly  overhead  and  the  time  of  highest  tide. 
The  Moon  pulls  on  the  nearest  tidal  bulge,  slowing  down  the  Earth’s  rotation.  At  the  same  time, 
the  tidal  bulge  nearest  the  Moon  produces  a  force  that  tends  to  pull  the  Moon  ahead  in  its  orbit, 
causing  the  Moon  to  spiral  slowly  outward 


Table  3.2  Orbital  and  physical  properties  of  the  Moon 
Orbital  characteristics 

Dm  —  mean  distance  of  Moon  =  3.844  x  108  m  =  384,400  km 
VM  =  mean  orbital  speed  of  Moon  =  1.022  x  103  m  s-1  =  1.022  km  s-1 
VeScE  =  escape  velocity  of  Earth  at  Moon’s  distance  =  2 GME/Dmoon)1^2  =  1.44  x  103  m  s  1 
dDM/dt  =  rate  of  increase  of  Moon’s  distance  =  0.0382  ±  0.0007  m  year-1 
PM  =  orbital  period  of  Moon  =  27.3216  days  =  sidereal  month  =  fixed  star  to  fixed  star  (time 
from  new  moon  to  new  moon  is  29.530589  days  =  synodic  month) 
e  =  eccentricity  =  0.0549 
Dm  (1  +  e)  =  apogee  =  405,410  km 
Dm  (1  —  e)  =  perigee  =  362,570  km 
Physical  characteristics 
Age  =  4.6  x  109  year 
Mm  =  mass  =  7.348  x  1 022  kg 
Mq/Mm  =  inverse  mass  =  27,068,708.7 
Me/Mm  =  Earth-Moon  mass  ratio  =  81.30056 
/i  =  Mm/Me  —  Moon-Earth  mass  ratio  =  0.0123 
Rm  —  mean  radius  =  1.737  x  106  m 

pM  =  mean  mass  density  =  3 Mm/ (AtiR]^)  =  3346.4  kg  m-3 

_ o 

gM  =  equatorial  surface  gravity  =  1.622  m  s 

VescM  =  escape  velocity  from  surface  =  (2 GMm/Rm)1^2  ^ j  2.38  x  103  m  s_1  =  2.38  km  s_1 
Pr  —  sidereal  rotation  period  =  27.3216  days  (synchronous) 

A  =  albedo  =  0.136 

T  =  mean  equatorial  surface  temperature  =  220  K  (Temperature  range  at  lunar  equator 
100-390  K) 


3.4 
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Fig.  3.5  Earth’s  spring  and  neap  ocean  tides  The  height  of  the  tides  and  the  phase  of  the  Moon 
depend  on  the  relative  positions  of  the  Earth,  Moon,  and  Sun.  When  the  tide-raising  forces  of  the 
Sun  and  the  Moon  are  in  the  same  direction,  they  reinforce  one  another,  making  the  highest  high 
tides  and  the  lowest  low  tides.  These  spring  tides  (top)  occur  at  either  new  or  full  Moon.  The 
range  of  tides  is  least  when  the  Moon  is  at  first  or  third  quarter  and  the  tide-raising  forces  of  the 
Sun  and  the  Moon  are  at  right  angles  to  one  another.  The  tidal  forces  are  then  in  opposition, 
producing  the  lowest  high  tides  and  the  highest  low  tides,  or  the  neap  tides  (bottom).  In  this 
diagram,  the  height  of  the  tides  is  greatly  exaggerated  in  comparison  to  the  size  of  the  Earth 


Because  the  Moon’s  gravitational  force  decreases  with  increasing  distance,  the 
Moon  pulls  hardest  on  the  ocean  facing  it  and  least  on  the  opposite  ocean,  whereas 
the  Earth  between  is  pulled  with  an  intermediate  force.  In  this  way,  the  Moon’s 
gravity  draws  out  the  ocean  into  the  shape  of  an  egg  and  creates  two  high  tides.  As 
the  Earth’s  rotation  carries  the  continents  past  the  two  tidal  humps,  we  experience 
the  rise  and  fall  of  water,  the  ebb  and  flow  of  the  tides,  twice  every  day  (Fig.  3.4). 

To  understand  the  Moon’s  tidal  producing  force,  we  need  to  know  its  mass  and 
distance,  and  they  are  given  with  other  physical  information  about  the  Moon  in 
Table  3.2. 

The  Moon  creates  most  of  the  ocean  waves,  but  the  Sun  also  contributes  to  the 
size  and  rhythm  of  the  waves.  Although  more  massive  than  the  Moon,  our  Sun  also 
is  much  farther  away;  as  a  result  the  tide-producing  force  of  the  Moon  is  about 


84 


3  Gravity 


2.2  times  that  of  the  Sun.  Near  both  full  and  new  Moons,  the  tide-raising  forces  of 
the  Sun  and  the  Moon  are  in  the  same  direction,  producing  the  spring  tide 
(Fig.  3.5).  They  reinforce  one  another’s  tides  and  produce  high  tides  that  can  be  a 
few  times  higher  than  normal.  Two  weeks  later,  the  two  tidal  forces  are  in 
opposition  and  interfere  with  one  another,  and  the  range  of  these  neap  tides  is  then 
lower  than  any  others. 


Example:  Moon  tides  and  Sun  tides 

We  can  estimate  the  tide-producing  capability  of  the  Moon  from  the  dif¬ 
ference  A Fm  of  its  gravitational  force,  FM,  on  the  near  and  far  sides  of  the 
Earth  from: 


GMmMr 
{Dm  —  Re)2 


GMmMr 
{Dm  +  Re)2 


4  GMmMeRe 
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assuming  DM  Re,  where  the  Newtonian  gravitational  constant  G  =  6.674 
x  10“ 11  m3  kg-1  s-2,  the  mass  of  the  Moon  Mm  =  7.348  x  1022  kg,  the 
mass  of  the  Earth  ME  =  5.974  x  10"  kg,  the  mean  radius  of  the  Earth 

/r 

Re  =  6.371  x  1 0  m,  and  the  average  distance  of  the  Moon  from  the  center  of 
the  Earth  is  DM  =  3.844  x  108  m. 

The  ratio  of  the  Moon’s  tide-producing  force  A FM  to  the  Sun’s  tide- 
producing  force  AFq  is: 


AFm=Mm  /Dq\3=2  lg 

A F 0  Mq  \  Dm  ) 


(3.27) 
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where  the  mass  of  the  moon  is  Mm  =  7.348  x  10"“  kg,  the  Sun’s  mass 
Mq  =  1.989  x  10  kg,  the  mean  distance  of  the  moon  is 

o 

Dm  =  3.844  x  10  m,  and  the  mean  Sun-Earth  distance  is  Dq  =  1 
AU  =  1.496  x  1011  m.  In  other  words,  the  Moon  tides  are  about  twice  the 

Sun  tides  when  they  are  in  the  same  direction  in  the  sky. 

The  Moon  is  now  moving  away  from  the  Earth  at  a  rate  of 

0.038  m  year-1,  so  its  tide-producing  force  is  gradually  weakening,  while 
the  Sun  remains  at  the  same  mean  distance  from  the  Earth  with  an 
unchanging  tidal  effect.  The  Moon  tides  will  be  equal  to  the  Sun  tides  when 

o 

the  Moon  has  moved  out  to  a  distance  of  DM  &  4.98  x  10  m,  or  an 

o 

additional  1.14  x  10  m  beyond  its  current  distance.  At  the  current  rate  of 
recession,  that  will  happen  about  3  billion  years  from  now. 
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3.4.2  Tidal  Locking  into  Synchronous  Rotation 

A  planet’s  gravitational  force  pulls  any  natural  satellite,  or  moon,  into  a  slightly 
elongated  shape  along  an  axis  pointing  toward  the  planet.  That  is,  a  planet’s 
gravitation  produces  two  tidal  bulges  in  the  solid  body  of  the  satellite;  one  on  the 
closest  side  to  the  planet  and  one  on  the  satellite’s  farthest  side.  If  the  satellite’s 
rotation  twists  the  closest  bulge  ahead  of  the  planet- satellite  line,  the  planet  pulls 
back  on  it.  As  a  result,  one  hemisphere  of  the  satellite  always  faces  the  planet,  and 
the  satellite  takes  as  long  to  rotate  as  it  does  to  orbit  the  planet.  Then  we  say  that 
the  satellite  has  been  tidally  locked  into  synchronous  rotation  with  the  planet. 

The  Moon  is  in  synchronous  rotation  about  the  Earth,  so  the  Moon’s  rotation 
period  is  the  same  as  the  time  it  takes  for  the  Moon  to  orbit  the  Earth,  which  is 
27.32  Earth  days.  As  a  result,  from  Earth  we  always  see  the  Moon’s  near  side  and 
never  its  far  side.  Only  when  a  spacecraft  passes  beyond  the  Moon  and  looks  back 
at  it  can  we  see  the  far  side  of  the  Moon.  Most  of  the  major  moons,  or  large  natural 
satellites,  in  the  solar  system  have  synchronous  rotation  with  their  planet. 

If  the  mass  of  two  orbiting  bodies  is  comparable  and  their  physical  separation  is 
relatively  small,  they  both  may  be  tidally  locked  to  one  another.  This  is  the  case  for 
Pluto  and  its  nearby  large  moon,  Charon.  Mutual  tidal  locking  also  occurs  for  close 
binary  stars. 


3.4.3  The  Days  are  Getting  Longer 

As  the  Earth  rotates,  the  bulge  raised  on  its  surface  by  the  Moon’s  gravity  is 
always  a  little  ahead  of  the  Moon  rather  than  directly  under  it.  The  Moon  pulls 
back  on  the  bulge  and,  in  the  process,  slows  down  the  planet. 

When  the  ocean  tides  flood  and  ebb,  they  create  eddies  in  the  water,  producing 
friction  on  the  ocean  floor,  which  heats  the  water  ever  so  slightly  and  dissipates 
energy  at  the  expense  of  the  Earth’s  rotation.  The  tides  therefore  act  as  brakes  on 
the  spinning  Earth,  slowing  it  by  friction.  As  a  result  of  this  tidal  friction ,  the 
rotation  of  the  Earth  is  slowing  down  and  the  day  is  becoming  longer  at  a  rate  of 
2  ms,  or  0.002  s,  per  century  (Focus  3.3).  In  other  words,  the  days  are  getting 
longer  at  the  rate  of  1  s  every  50,000  years,  and  tomorrow  will  be  60  billionths  of 
a  second  longer  than  today. 

Focus  3.3  Tidal  friction  slows  the  rotation  of  the  Earth 

In  most  of  the  ocean,  the  tidal  currents  are  confined  to  the  top  of  the  deep 
sea,  never  reaching  its  bottom.  Most  of  the  tidal  energy  is  therefore  dissi¬ 
pated  in  shallow  seas  near  land,  where  the  turbulent  tidal  water  reaches  the 
ocean  bottom,  at  depths  of  100  m  or  less. 
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When  the  tide  moves  toward  a  beach  at  velocity,  V,  the  frictional  energy, 
AE,  dissipated  by  tidal  currents  on  the  sea  bottom  per  unit  time,  At ,  per  unit 
area,  A  A,  is 


AE 

AtAA 


ypV 3  «  2  Js  1  m  2, 


(3.28) 


where  the  density  of  sea  water  is  p  ^  1,000  kg  m”3,  a  typical  velocity  V  « 
1ms”1,  and  the  stress  on  the  sea  bottom  is  ypV2  with  an  empirical  drag 
coefficient  y  ~  0.002  for  wind  stress  on  the  ground  and  a  river’s  stress  on  its 
bed  as  well  as  tidal  currents  in  the  bottom  of  the  sea. 

In  1919,  Sir  G.  I.  Taylor  (1886-1975),  a  British  expert  on  turbulence  in  air 
and  water,  used  this  equation  to  obtain  AE/ At  «  101 1 J  s”1  for  the  Irish  Sea 
alone,  and  in  the  following  year  Sir  Harold  Jeffreys  (1891-1989)  estimated 
that  the  total  rate  of  energy  loss  by  tidal  friction  in  the  shallow  seas  sur¬ 
rounding  Europe,  Asia,  and  North  and  South  America  is 
AE/ At  «  1013  J  s'1  (Taylor  1919;  Jeffreys  1920).  This  is  comparable  to  the 
estimate  obtained  by  considering  the  flux  of  energy  convected  into  the 
shallow  seas  by  tidal  currents.  Jeffreys  (1975),  Munk  and  MacDonald  (1960, 
1975),  and  Lambeck  (1978,  1980)  have  discussed  both  the  flux  and  bottom 
friction  methods. 

The  lost  energy  comes  from  the  Earth’s  rotational  energy,  which  is  equal  to 
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where  the  mass  of  the  Earth  ME  =  5.9736  x  1024  kg,  Vrot  =  2nRE/PE,  the 

z' 

constant  n  «  3.14159,  the  mean  radius  of  the  Earth  RE  =  6.371  x  10  m, 
and  the  rotation  period  of  the  Earth  PE—  24  h  =  8.640  x  104  s.  For  a 
period  change  AP  in  time  interval  At ,  the  loss  in  rotational  energy  is: 
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Setting  this  equal  to  the  AE  and  collecting  terms,  we  obtain: 


APi 


AE 


6.7  x  10  13  ss  1  «  0.002  s  per  century,  (3.31) 


At  4n2MERE  At 
for  AE/ At  =  1013  J  s'1  and  one  century  100  years  =  3.156  x  109  s. 


The  long  term  increase  in  the  length  of  the  day  of  roughly  2  ms  per  century  has 
been  documented  over  the  past  2,700  years  from  historical  records  of  occultations 
of  stars  by  the  Moon  and  solar  and  lunar  eclipses  (Stephenson  and  Morrison  1984). 
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Paleontologists  have  made  indirect  historical  measurements  of  the  Earth’s  rotation 
through  studies  of  fossil  corals.  The  growth  patterns  of  these  corals  consist  of 
annual  bands  and  fine  daily  ridges,  produced  by  the  effects  of  seasonal  and  daily 
changes  of  water  temperature  on  the  growth  rate.  The  days  were  shorter  in  the  past, 
but  the  year  was  the  same,  so  the  number  of  days  per  year  increases  as  we  go  back 
in  time.  Ancient  corals  confirm  this,  and  they  show  a  greater  number  of  daily 
ridges  per  annual  band  than  modern  corals.  Careful  counting  reveals  that  the  day 
was  only  22  h  long  when  we  look  back  400  million  years.  Studies  of  daily  grown 
increments  have  been  extended  to  fossilized  algae  called  stromatolites,  which 
indicate  that  the  day  may  have  been  only  10  h  long  about  2  billion  years  ago 
(Wells  1963;  Mazzullo  1971). 


3.4.4  The  Moon  is  Moving  Away  from  the  Earth 

The  Moon  pulls  the  Earth’s  oceans,  and  the  oceans  pull  back,  in  accord  with 
Newton’s  third  law  that  every  action  has  an  equal  and  opposite  reaction.  The  net 
effect  is  to  swing  the  Moon  outward  into  a  more  distant  orbit.  This  is  because  the 
tidal  bulge  on  the  side  facing  the  Moon  is  displaced  ahead  of  the  Moon  and  this 
bulge  pulls  the  Moon  forward. 

As  the  Earth  slows  down,  the  angular  momentum  it  loses  is  transferred  to  the 
Moon,  which  speeds  up  in  its  orbit  around  us.  It  is  not  hard  to  see  that  this  will  swing 
the  Moon  away  from  the  Earth  if  we  look  at  the  key  equations  (Focus  3.4).  When  we 
do  the  arithmetic,  we  find  that  the  change  of  0.002  s  per  century  in  the  length  of  a  day 
implies  an  outward  motion  of  the  Moon,  amounting  to  about  0.04  m  yr-1.  Small  as  it 
is,  this  value  is  just  measurable  with  the  laser  light  sent  to  comer  reflectors,  called 
corner  cubes,  placed  on  the  Moon  by  the  Apollo  astronauts.  Pulses  of  light  are  sent 
from  the  Earth  to  the  tiny  reflecting  mirrors  on  the  Moon,  and  the  time  for  the  light  to 
travel  to  the  Moon  and  return  to  Earth  is  measured.  The  distance  to  the  Moon  is  the 
product  of  this  round-trip  light  travel  time  and  the  speed  of  light.  The  lunar  laser 
ranging  data  indicate  that  the  Moon  is  moving  away  from  the  Earth  at  a  rate  of 
0.0382  zb  0.0007  m  year-1  (Dickey  et  al.  1994). 

Focus  3.4  Conservation  of  angular  momentum  in  the  Earth-Moon 
system 

According  to  one  of  the  unbreakable  conservation  laws,  the  angular 
momentum,  or  the  product  of  mass,  M,  velocity,  V,  and  radius,  R ,  is 
unchanged  in  a  closed  system,  which  is  not  subject  to  an  outside  force.  Thus: 

Conservation  of  Angular  Momentum  =  M  x  V  x  R  =  constant. 

This  means  that  the  angular  momentum  that  the  Earth  loses  in  slowing  down 
will  be  transferred  to  the  Moon.  For  the  Earth,  the  angular  momentum  is 
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rotational,  with  V  =  2n  Re/Pe »  where  PE  is  the  Earth’s  rotation  period  of 
one  day  and  the  subscript  E  denotes  the  Earth.  So,  we  have: 


Earth’s  Rotational  Angular  Momentum  =  2nMERE/PE. 

Since  the  length  of  the  Earth’s  day  is  increasing  by  the  amount  APE  as 
time  goes  on,  the  Earth’s  rotational  angular  momentum  is  decreasing  by  the 
amount: 


_  _  _  _  r 

Decrease  in  Rotational  Angular  Momentum  =  2nMER\ 
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The  loss  has  to  be  made  up  by  an  equivalent  gain  somewhere  else  in  order 
to  conserve  angular  momentum.  This  is  done  by  an  increase  in  the  Moon’s 
orbital  angular  momentum,  which  is  given  by: 

Moon’s  Orbital  Angular  Momentum  =  Mm  x  Vm  x  Dm ,  (3.33) 

where  Mm  is  the  mass  of  the  Moon,  DM  is  the  distance  between  the  Earth  and 
the  Moon,  and  the  orbital  velocity  of  the  Moon  can  be  estimated 
from  Kepler’s  third  law  with  the  mass  of  the  Earth  and  the  Moon’s  dis¬ 
tance  from  it,  or  by: 


VM  =  2nDMlPM  «  ( GMe/Dm)11 2,  (3.34) 

_  11  O  _  1  _ O 

where  G  =  6.674xl0  mkg  s  is  the  universal  gravitational  con¬ 
stant.  Substituting  this  velocity  expression  into  the  angular  momentum 
relation,  we  obtain: 


Moon  ’  s  Orbital  Angular  Momentum  =  MmDm  ( GM E  / DM ) 1  .  (3.35) 


Since  the  mass  of  the  Moon  and  the  mass  of  the  Earth  do  not  change,  the 
Moon’s  distance  has  to  increase  by  an  amount  ADM  to  provide  an  increase  in 
the  angular  momentum. 


Increase  in  Orbital  Angular  Momentum  =  Mm  A DM 


GM  E 

Dm 


(3.36) 


Setting  the  loss  in  rotational  angular  momentum  equal  to  the  gain  in 
orbital  angular  momentum  and  collecting  terms  we  obtain 
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«  1.8  x  10  9m  s  1  «  0.057  m  yr  1 , 


(3.37) 
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where  ME  =  5.974  x  1024  kg,  RE  =  6.371  x  106  m,  APE  «  6.7  x 
1(T13  s  s_1,  Mm  =  7.348  x  1022  kg,  PE  =  24  h  =  8.640  x  104  s, 
G  =  6.674  x  1CT11  m3  kg-1  s~2,DM  =  3.844  x  108  mand  1  yr  =  3.156  x 

7  i 

10'  s.  This  is  the  approximate  amount,  of  0.04  m  yr  ,  measured  by  sending 
laser  pulses  from  the  Earth  to  the  corner  reflectors  left  on  the  Moon  by  astronauts 
(Dickey  etal.  1994). 


Example:  How  close  was  the  Moon  to  the  Earth  in  their  youth? 

o 

The  mean  distance  of  the  Moon  is  now  Dmoon  =  3.844  x  10  m,  and  laser 
signals  to  the  Moon’s  corner  reflector  mirrors  indicate  that  the  Moon  is 
moving  away  from  the  Earth  at  the  rate  of  dDmoon/dt  =  0.0382  myr-1.  The 
age  of  the  Earth  is  4.6  billion  year.  Assuming  that  the  Moon  was  formed 
when  the  Earth  was  in  its  youth,  and  that  the  Moon  has  always  been  moving 
away  from  the  Earth  at  the  presently  observed  rate,  then  the  Moon  has 
moved  over  a  distance  of  ADmoon  =  0.0382  x  4.6  x  10  =  1.76  x  10  m, 
or  about  half  its  present  distance.  If  the  Moon  moved  away  from  the  Earth  at 
a  faster  rate  when  it  was  young,  then  it  could  have  been  formed  by  a  collision 
with  the  newly  formed  Earth. 


Will  the  Moon’s  outward  motion  carry  it  away  from  the  Earth  altogether?  Only 
the  intrusion  of  a  massive  third  body  could  achieve  that.  What  will  ultimately 
happen  is  the  following.  The  combination  of  the  slowing  Earth  and  the  receding 
Moon  means  that  the  Earth’s  day  will  eventually  catch  up  with  the  length  of  the 
month.  When  the  day  and  the  month  are  equal,  the  Moon-induced  tides  will  cease 
moving;  from  then  on  the  oceans  will  rise  and  fall  much  more  gently  under  the 
influence  of  the  Sun.  The  Moon  will  hang  motionless  in  the  sky,  and  will  be  visible 
from  only  one  hemisphere.  At  that  stage  the  recession  of  the  Moon  will  stop. 

Then,  billions  of  years  from  now,  the  Sun’s  tidal  action  will  take  over;  slowing 
the  Earth’s  rotation  even  further,  until  the  day  becomes  longer  than  the  month.  At 
this  point,  angular  momentum  will  be  drawn  from  the  Moon,  and  it  will  begin 
approaching  the  Earth,  heading  on  a  course  of  self-destruction  until  it  is  finally  torn 
apart  by  the  tidal  action  of  the  Earth.  Perhaps  it  will  form  a  ring  around  our  planet. 
In  any  case,  it  will  probably  end  its  years  where  it  apparently  began  -  close  to  the 
Earth.  By  this  time,  however,  the  brighter  Sun  will  have  boiled  the  oceans  away, 
and  the  Earth  will  have  become  a  dry  and  barren  place. 
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3.4.5  A  Planet’s  Differential  Gravitational  Attraction 
Accounts  for  Planetary  Rings 

One  might  expect  the  numerous  particles  of  a  planetary  ring  to  have  accumulated 
long  ago  into  larger  satellites.  But  the  interesting  feature  of  these  rings  -  and  a  clue 
to  their  origin  -  is  that  they  do  not  coexist  with  large  moons.  Planetary  rings  are 
also  usually  closer  to  the  planets  than  their  large  satellites. 

The  rings  normally  are  confined  to  an  inner  zone  where  the  planet’s  tidal  forces 
would  stretch  a  large  moon  until  it  fractured  and  split  apart,  while  also  preventing 
small  bodies  from  coalescing  to  form  a  larger  moon  (Fig.  3.6).  The  outer  radius  of 
this  zone  in  which  rings  are  found  is  called  the  Roche  limit  after  the  French 

s  _ 

mathematician  Edouard  A.  Roche  (1820-1883),  who  described  it  (Roche  1849, 
1850,  1851). 


Fig.  3.6  Roche  limit  A  large  satellite  (top)  that  moves  well  within  a  planet’s  Roche  limit 

(dashed  curve)  will  be  tom  apart  by  the  tidal  force  of  the  planet’s  gravity.  This  was  first 

/ 

investigated  in  1849  by  the  French  mathematician  Edouard  A.  Roche  (1820-1883).  The  side  of 
the  satellite  closer  to  the  planet  feels  a  stronger  gravitational  pull  than  the  side  farther  away,  and 
this  difference  works  against  the  self-gravitation  that  holds  the  body  together.  A  small  solid 
satellite  (bottom)  can  resist  tidal  disruption  because  it  has  significant  internal  cohesion  in  addition 
to  self-gravitation 
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For  a  rigid  satellite  with  the  same  mass  density  as  its  planet,  the  Roche  limiting 
distance  from  the  center  of  the  planet  is  1.26  times  the  planet’s  radius  (Focus  3.5). 
The  Roche  limit  for  a  solid  body  is  1.38  times  that  radius  (Aggarwal  and  Oberbeck 
1974),  and  Roche’ s  initial  calculation,  for  fluid  objects,  was  2.446  times  the  planetary 
radius.  Anywhere  inside  this  distance,  a  large  moon  can  no  longer  remain  intact,  but 
instead  gets  torn  apart  by  planetary  tides.  Nevertheless,  because  of  their  material 
strength  and  cohesion,  small  moons  less  than  100  km  across  can  exist  inside  the 
Roche  limit  without  being  tidally  disrupted,  just  as  the  ring  particles  can. 

The  sharp  edges  of  planetary  rings  can  be  formed  by  small  satellites,  which  can 
pass  within  the  Roche  limit  with  enough  internal  cohesion  to  withstand  the  planet’s 
differential  gravitational  forces.  These  smaller  bodies  can  also  confine  ring  par¬ 
ticles  within  narrow  boundaries.  Goldreich  and  Tremaine  (1982)  have  reviewed 
the  dynamics  of  planetary  rings. 


Focus  3.5  The  Roche  limit 


To  visualize  the  significance  of  the  Roche  limit,  consider  two  particles  of 
equal  mass,  m,  separated  by  a  distance,  R,  and  located  at  a  distance,  D ,  from 
a  planet  of  mass,  MP.  The  gravitational  pull  of  the  planet  on  the  particle 
closest  to  it  will  be  greater  than  the  pull  on  the  more  distant  particle.  If  the 
difference  in  pull  on  the  near  and  far  particles,  the  tidal  force,  exceeds  the 
mutual  gravitational  attraction  between  the  two  particles,  they  cannot  remain 
close  to  each  other  and  will  disperse.  The  outcome  of  the  tug-of-war  between 
the  tidal  force  and  the  mutual  attraction  is  primarily  decided  by  the  particles’ 
distance  from  the  planet.  At  distances  less  than  the  Roche  limit,  DRoche, 
particles  are  pulled  apart,  and  this  prevents  the  accumulation  of  larger 
moons.  The  tidal  force  will  also  tear  apart  any  large  moon-like  object  that 
ventures  within  the  Roche  limit. 

The  gravitational  force,  FP ,  of  a  planet  of  mass,  MP,  on  a  smaller  mass,  m, 
whose  center  is  located  at  a  distance,  D,  from  the  center  of  the  planet  is: 


GMPm 

Fp  = 


(3.38) 


where  G  is  the  universal  constant  of  gravitation.  The  planet  will  pull  harder 
on  the  side  of  the  object  that  is  closer  to  it  and  less  hard  on  the  side  that  is 
further  away.  The  difference,  AF,  between  the  force  felt  by  one  side  and  the 
center  of  the  mass,  m,  is 


A  F  = 
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(P  ~  Rm)  (P  +  R-m) 
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(3.39) 


where  the  factor  of  V2  arises  because  the  center  of  mass  is  located  midway 
between  the  two  forces,  and  Rm  is  the  radius  of  the  smaller  object.  If  it 
approaches  the  planet,  D  becomes  smaller  and  this  tidal  disruptive  force  will 
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increase,  eventually  pulling  the  object  apart  at  a  critical  distance  DRoche  from 
the  center  of  the  planet. 

The  gravitational  binding  force,  FB ,  which  attracts  the  opposite  sides  of 

r\ 

the  object  and  holds  it  together,  is  Gm/Rfn  per  unit  mass,  or  for  the  total  mass 


Gm2 


(3.40) 


The  Roche  limit  is  reached  when  the  tidal  disruptive  force,  AF ,  equals  the 
binding  force,  FB ,  and  when  we  set  these  two  expression  equal  and  collect 
terms  we  obtain: 


(3.41) 


This  result  is  expressed  in  terms  of  Rm ,  the  radius  of  the  small  object,  but  by 
using  the  mass  densities  pP  and  pm  for  the  planet  and  small  mass  respec¬ 
tively,  with  Mp  =  47 zppRp/3  and  m  =  4npmR3m/3,  we  obtain  the  Roche  limit 
in  terms  of  the  planet  radius,  RP : 


D 


Roche 


(3.42) 


which  for  a  planet  and  smaller  object  of  the  same  mass  density  becomes 
D Roche  =  1-26  RP  (Jeans  1917). 

The  calculation  by  Roche  used  liquid  objects  whose  shapes  can  distort 
continuously,  and  his  result  is  (Roche  1849,  1850,  1851): 


D 


Roche 


=  2.446  ( — ) 

\PmJ 


1/3 


RP  ~  2.446 Rp. 


(3.43) 


For  a  satellite  with  no  internal  strength  and  whose  density  is  the  same  as  the 
planet,  the  Roche  limit  is  2.446  times  the  planetary  radius,  or  about  175,000  km 
for  Jupiter,  147,000  km  for  Saturn,  62,000  km  for  Uranus,  and  59,000  km  for 
Neptune.  Jupiter’s  insubstantial  dusty  ring,  the  magnificent  ice  particles  of 
Saturn’s  rings,  and  the  dark  boulders  in  the  narrow  rings  encircling  Uranus  and 
Neptune  all  lie  within  the  Roche  limit  for  the  relevant  planet.  The  Earth’s  Roche 
limit  is  15,584  km,  and  if  our  Moon  ever  ventured  within  this  distance  from  the 
Earth’s  center,  it  would  be  pulled  apart  by  tidal  forces  and  our  planet  would  have 
rings.  Nevertheless,  the  Moon  is  much  farther  away  from  the  Earth,  at  a  mean 
distance  of  approximately  384,400  km. 
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Perihelion 

advances 


Planet 


Fig.  3.7  Precession  of  Mercury’s  perihelion  Instead  of  always  tracing  out  the  same  ellipse,  the 
orbit  of  Mercury  pivots  around  the  focus  occupied  by  the  Sun.  The  point  of  closest  approach  to 
the  Sun,  the  perihelion,  is  slowly  rotating  ahead  of  the  point  predicted  by  Newton’s  theory  of 
gravitation.  This  at  first  was  explained  by  the  gravitational  tug  of  an  unknown  planet  called 
Vulcan  that  was  supposed  to  revolve  about  the  Sun  inside  Mercury’s  orbit,  but  we  now  know  that 
Vulcan  does  not  exist.  Albert  Einstein  (1879-1955)  explained  Mercury’s  anomalous  motion  in 
1915  by  inventing  a  new  theory  of  gravity  in  which  the  Sun’s  curvature  of  nearby  space  makes 
the  planet  move  in  a  slowly  revolving  ellipse 


3.5  What  Causes  Gravity? 

We  cannot  see  the  force  of  gravity  and  Newton  did  not  know  how  it  was  exerted. 
Albert  Einstein  (1879-1955)  subsequently  explained  it  by  supposing  that  a  mas¬ 
sive  body  like  a  star  bends  nearby  space.  This  bending  of  space  is  the  cause  of  the 
star’s  gravity.  However,  such  effects  are  noticeable  only  in  extreme  conditions 
near  a  very  massive  cosmic  object  like  a  star,  and  the  differences  between 
Newton’s  and  Einstein’s  theories  of  gravity  are  indistinguishable  in  everyday  life. 

One  result  of  the  Sun’s  curvature  of  nearby  space  is  that  planetary  orbits  are  not 
exactly  elliptical.  This  solved  a  perplexing  problem  with  the  motion  of  Mercury, 
the  nearest  planet  to  the  Sun.  Instead  of  returning  to  its  starting  point  to  form  a 
closed  ellipse  in  one  orbital  period,  Mercury  moves  slightly  ahead  in  a  winding 
path  that  can  be  described  as  a  rotating  ellipse  (Fig.  3.7).  As  a  result,  the  point  of 
Mercury’s  closest  approach  to  the  Sun,  the  perihelion,  advances  by  a  small 
amount  -  only  43  s  of  arc  per  century,  beyond  the  location  predicted  using 
Newton’s  theory. 

This  anomalous  twist  in  Mercury’s  motion  was  discovered  in  1854  and  rec¬ 
ognized  as  an  unexplained  problem  in  1859  by  the  French  astronomer  Urbain  Jean 
Joseph  LeVerrier  (1811-1877).  It  was  not  explained  for  more  than  a  half-century, 
when  Einstein  (1915)  proposed  that  the  planet  is  directed  along  a  path  in  curved 
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Fig.  3.8  Space  curvature  A 

massive  object  creates  a 
curved  indentation  on  the 
“flat”  space  described  by 
Euclidean  geometry,  which 
applies  in  our  everyday  life 
on  the  Earth,  where  we  do  not 
directly  encounter 
astronomical  amounts  of 
matter.  Notice  that  the 
amount  of  space  curvature  is 
greatest  in  the  regions  near  a 
cosmic  object  like  a  star, 
whereas  farther  away,  the 
effect  is  lessened 


space  (Fig.  3.8),  making  the  planet  overshoot  its  expected  location.  Observations 
of  Mercury  trace  out  the  invisible  curvature.  Roseveare  (1982)  has  discussed 
Mercury’s  perihelion  from  Le  Verrier  to  Einstein;  also  see  Nobili  and  Will  (1986). 

In  the  very  paper  that  explained  Mercury’s  anomalous  motion,  Einstein  showed 
that  the  curvature  of  space  near  the  Sun  also  deflects  the  path  of  light  from  other 
stars  (Fig.  3.9).  The  otherwise  straight  trajectory  of  starlight  is  bent  by  the  Sun’s 
gravity.  The  effect  can  be  measured  during  a  solar  eclipse  when  stars  pass  behind 
the  darkened  Sun. 

Newton  previously  had  speculated  that  massive  bodies  might  bend  nearby  light 
rays  under  the  assumption  that  light  has  mass,  and  the  German  astronomer  Johann 
George  von  Soldner  (1726-1833)  estimated  the  amount  of  light  bending  produced 
by  the  Sun  using  Newtonian  gravity  (Soldner  1801).  In  1911,  Einstein  confirmed 
Soldner’ s  result;  however,  when  he  took  space  curvature  into  account,  the 
expected  deflection  was  doubled  (Einstein  1915). 

The  successful  measurement  of  this  deflection  of  starlight  during  the  total  solar 
eclipse  on  29  May  1919  (Dyson  et  al.  1920),  made  Einstein  famous,  practically 
overnight.  The  initial  measurements  were  not  exact,  and  amounted  to  just  a  factor 
of  two;  nevertheless,  the  Sun’s  curvature  of  nearby  space  has  now  been  measured 
with  increasingly  greater  precision  for  nearly  a  century,  confirming  Einstein’s 
prediction  to  two  parts  in  a  hundred  thousand,  or  to  the  fifth  decimal  place.  His 
General  Theory  of  Relativity  (Einstein  1916),  which  replaces  the  Sun’s  gravity 
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Fig.  3.9  Sun  bending  starlight  As  the  Earth  orbits  the  Sun,  an  observer’s  line  of  sight  to  a  star 
or  other  cosmic  object  can  pass  near  to  the  Sun  or  far  from  it.  The  massive  Sun  curves  nearby 
space,  bending  the  trajectory  of  starlight  passing  near  to  it,  and  this  produces  an  apparent  change 
in  a  star’s  position.  The  amount  of  bending  and  change  in  stellar  position  that  were  predicted  by 
Einstein’s  General  Theory  of  Relativity  was  first  confirmed  in  1929  during  a  total  eclipse  of  the 
Sun 


with  geometry,  has  been  verified  by  so  many  solar  experiments  that  it  now  is 
widely  accepted. 

Natario  (2011)  provides  a  good  description  of  the  General  Theory  of  Relativity 
without  calculus,  whereas  Will  (1993)  has  reviewed  observational  verification  of 
the  theory. 

The  solar  curvature  of  space  has  been  measured  with  increasingly  greater 
precision  for  nearly  a  century,  confirming  Einstein’s  prediction.  According  to 
Einstein’s  theory,  a  light  ray  passing  a  minimum  distance  R0  from  the  center  of  a 
star  of  mass  M  will  be  deflected  by  the  angle 


0  =  2(l+y) 


GM 

RiT 


radians, 


(3.44) 
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where  G  =  6.674  x  10  m  kg  s  ~  is  the  universal  gravitational  constant,  the 
speed  of  light  is  c  =  2.9979  x  108  m  s”1,  and  y  =  1.000000000,  or  exactly  one, 
in  Einstein’s  theory  of  gravitation.  Newton’s  theory  of  gravitation  implies  y  =  0. 

on 

For  the  Sun,  with  the  mass  M0  =  1.989  x  10‘  kg,  the  light  bending  is: 


1  +  y 


seconds  of  arc, 


6=  1.75 


2 


(3.45) 
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where  the  Sun’s  radius  is  RQ  =  6.955  x  10  m  and  we  have  used  1 
radian  =  2.06265  x  105  ",  and  "  denotes  second  of  arc. 

In  one  test,  radio  astronomers  have  combined  observations  of  distant  quasars 
from  telescopes  located  on  opposite  sides  of  the  world,  turning  the  Earth  into  a 
gigantic  interferometer  that  accurately  measures  the  positions  of  radio  sources  as 
our  line  of  sight  to  them  nears  the  Sun  (Fomalont  and  Sramek  1976).  The  change 
in  position  determines  the  amount  of  space  curvature,  as  with  the  solar  eclipse 
results  for  stars,  but  with  much  greater  accuracy  and  without  a  total  solar  eclipse, 
since  the  Sun  is  a  relatively  weak  interferometric  radio  source.  Such  Very  Long 
Baseline  Interferometry,  abbreviated  VLBI,  has  confirmed  the  predicted  deflection 
of  radio  waves  by  the  Sun  to  the  0.0003,  or  0.03  %,  level  (Robertson  et  al.  1991; 
Shapiro  et  al.  2004;  Fomalont  et  al.  2009). 

Another  test  of  Einstein’s  theory  of  gravitation  measures  the  time  required  for  a 
radar  signal  to  make  a  round  trip  between  the  Earth  and  a  planet  (Shapiro  et  al. 
1971),  or  the  time  for  a  radio  signal  to  travel  from  a  spacecraft  home  (Shapiro  et  al. 
1971;  Reasenberg  et  al.  1979).  When  the  line  of  sight  passes  near  the  Sun,  the 
radio  waves  travel  along  a  curved  path  and  take  slightly  longer  to  return  to  Earth. 
The  measurements  require  extremely  precise  clocks,  for  the  extra  time  delay 
caused  by  the  Sun’s  curvature  of  nearby  space  amounts  to  only  one  ten- thousandth 
of  a  second.  Radio  links  with  the  Cassini  spacecraft,  for  example,  indicate  that 
y  =  1.00000  ±  2.1  x  10  5,  or  precisely  unity  with  an  accuracy  of  2  parts  in 
100,000  (Bertotti  et  al.  2003). 

A  modern,  extended  test  of  Einstein’s  theory  involves  the  measurement  of  the 
periastron  advance  and  light  bending  of  a  binary  pulsar  that  has  much  stronger 
gravitational  fields  than  found  in  our  solar  system,  with  additional  indication  of 
gravitational  radiation  from  observations  of  its  orbital  decay  (Focus  3.6). 

Focus  3.6  Testing  relativity  with  the  binary  pulsar 

The  American  radio  astronomers  Russell  A.  Hulse  (1950-  )  and  Joseph  H. 
Taylor,  Jr.  (1941-  )  discovered  the  now  famous  binary  pulsar  PSR 
B1913  +  16  in  1974,  during  a  deliberate  search  for  new  pulsars  using  the 
latest  computer  technology  with  the  305  m  radio  antenna  at  Arecibo,  Puerto 
Rico  (Hulse  and  Taylor  1975;  Hulse  1994).  Their  mini-computer  was  pro¬ 
gramed  to  scan  a  range  of  possible  pulsar  periodicities,  pulse  durations,  and 
frequency  dispersions,  registering  a  signal  whenever  a  pulsar  passed  through 
the  telescope  beam.  After  14  months  at  Arecibo,  and  the  discovery  of  40 
pulsars,  Hulse,  a  graduate  student  at  the  University  of  Massachusetts  at 
Amherst,  found  the  enigmatic  PSR  B 1913  +  16,  a  pulsar  in  a  binary  system. 
PSR  denotes  pulsar,  B  designates  a  binary  companion,  and  1913  +  16 
describes  the  position  of  the  pulsar  in  the  sky. 

The  pulsar  rotates  on  its  axis  17  times  a  second,  so  the  pulse  repetition 
period  is  about  0.059  s.  Moreover,  the  period  changes,  by  about  0.00003  s, 
and  the  period  change  is  itself  cyclical,  increasing  and  decreasing,  rising  and 
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falling  every  7.75  h.  This  meant  that  the  pulsar  was  in  orbital  motion  at  this 
longer  period,  with  the  pulses  being  compressed  together  when  the  pulsar 
approached  the  Earth  and  pulled  apart  when  moving  away.  A  pulsar  is  also  a 
neutron  star,  which  has  a  mass  of  about  the  mass  of  the  Sun  collapsed  to  a 
radius  of  about  10  km,  and  as  it  turned  out,  the  radio  pulsar  PSR 
B1913  +  16  was  in  rapid  orbit  with  another  neutron  star  that  did  not  emit 
detectable  radio  pulses,  perhaps  because  its  radiation  beam  is  not  aimed  at 
the  Earth. 

Hulse  completed  his  degree,  and  left  the  field  of  radio  astronomy  just  a 
few  years  later.  So,  precise  timing  of  the  radio  pulses  from  PSR  B1913  +  16 
were  continued  by  Hulse’ s  advisor,  Joe  Taylor,  and  his  other  graduate  stu¬ 
dents,  permitting  a  determination  of  the  orbital  parameters  of  the  system,  as 
well  as  measurements  of  the  mass  of  the  pulsar  and  its  silent  companion.  The 
minimum  separation  of  the  two  neutron  stars  at  periastron  is  about  1.1  RQ 
and  the  maximum  separation  at  apastron  is  4.8  RQ  where  the  Sun’s  radius 

o 

Rq  =  6.955  x  10  m.  The  periastron  shift  was  enormous,  at  4.2266  degrees 
per  year,  compared  with  Mercury’s  43  s  of  arc  per  century,  and  this  per¬ 
mitted  the  astronomers  to  infer  a  mass  of  M  =  2.828  M0  solar  masses  for 

OA 

the  binary  system,  where  the  Sun’s  mass  M0  =  1.989  x  10  kg.  The 
individual  masses  could  be  determined  from  another  relativistic  effect,  and 
they  weighed  in  at  1.441  M0  and  1.387  M0,  as  would  be  expected  for  two 
neutron  stars  (Taylor  and  Weisberg  1989). 

More  importantly,  after  four  years  of  measurements  and  the  analysis  of 
about  5  million  pulses,  Taylor  and  his  colleagues  found  that  the  orbital 
period  was  slowly  becoming  shorter,  implying  a  slow  shrinking  of  the 
average  orbit  size.  The  rate  of  decrease  of  the  orbital  period  was  76.5  mil¬ 
lionths  of  a  second  per  year  or  7.65  x  10-5  s  year-1,  indicating  that  the  two 
stars  are  drawing  closer  and  closer  to  each  other,  approaching  at  about  2.5  m 
per  year.  This  rate  of  orbital  decay  is  just  the  change  expected  if  their  orbital 
energy  is  being  radiated  away  in  the  form  of  gravitational  waves,  which  had 
never  been  seen  before  (Taylor  1992,  1994;  Taylor  and  Weisberg  1982; 
Weisberg  and  Taylor  2005). 

Einstein  (1916)  predicted  such  gravitational  radiation ,  showing  that  any 
accelerating  mass  would  emit  it  -  as  ripples  in  the  curvature  of  space-time. 
Gravity  waves  travel  at  the  speed  of  light,  as  electromagnetic  radiation  does. 
But  while  electromagnetic  waves  move  through  space,  gravity  waves  are  an 
undulation  of  space  itself. 

Gravity  waves  are  produced  whenever  a  mass  moves,  but  they  are 
exceedingly  faint  when  generated  and  become  diluted  as  they  propagate  into 
the  increasing  volume  of  space.  They  are  so  weak,  and  their  interaction  with 
matter  so  feeble,  that  Einstein  himself  questioned  whether  they  would  ever 
be  detected.  The  gravitational  radiation  loss  of  the  orbital  energy  of  PSR 
B1913  +  16  nevertheless  exactly  matches  the  amount  predicted  by 
Einstein’s  theory,  providing  clear  and  strong  evidence  for  the  existence  of 
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gravitational  radiation  -  for  which  Hulse  and  Taylor  received  the  1993 
Nobel  Prize  in  Physics. 

Neither  Einstein  nor  anyone  else  ever  predicted  that  two  neutron  stars 
would  be  found  that  emit  gravitational  waves  detected  by  timing  the  pulsar 
emission  of  one  of  them,  and  Taylor,  Hulse,  and  their  colleagues  did  not  set 
out  to  find  a  binary  neutron  star,  let  alone  detect  gravitational  waves.  It  was 
another  one  of  those  serendipitous  discoveries  that  make  astronomy  so 
wonderfully  unexpected  and  surprising. 


A  double  radio  pulsar  has  also  been  used  to  test  special  and  general  relativity. 
Designated  PSR  B0737-3039,  the  system  also  has  strong  gravitational  fields,  a 
rapid  perihelion  precession  of  16.90  degrees  per  year,  and  an  orbital  decay 
attributed  to  gravitational  radiation  (Lyne  2004;  Kramer  et  al.  2006;  Breton  et  al. 
2008).  Unlike  the  binary  pulsar  PSR  B1913  +  16,  this  new  system  contains  two 
pulsars,  attributed  to  two  rotating  neutron  stars  that  emit  radio  pulses.  They  orbit 
each  other  at  a  speed  of  300  km  s-1  and  complete  one  orbit  every  2.4  h. 

Joss  and  Rappaport  (1984)  have  reviewed  neutron  stars  in  interacting  binary 
systems.  Backer  and  Hellings  (1986)  and  Taylor  (1994)  have  provided  reviews  of 
pulsar  timing  and  general  relativity.  Kramer  and  Stairs  (2008)  have  summarized 
knowledge  of  the  double  pulsar,  and  Hughes  (2009)  has  discussed  gravitational 
waves  from  merging  compact  binaries. 


Chapter  4 

Cosmic  Motion 


4.1  Motion  Opposes  Gravity 

4.1.1  Everything  Moves 

All  that  exists,  from  atoms  to  planets  and  stars  to  galaxies,  is  always  moving.  This 
motion  keeps  cosmic  objects  suspended  in  space. 

Galileo  Galilei  (1564-1642)  imagined  an  ideal  world  in  which  there  are  no 
external  forces  acting  on  an  object,  and  supposed  that  such  an  object  will  keep  on 
moving  at  constant  speed  (Galilei  1632).  Isaac  Newton  (1643-1727)  extended  the 
idea  in  his  first  law  of  motion,  which  states  that  every  object  continues  in  its  state 
of  rest,  or  of  uniform  velocity  in  a  straight  line,  as  long  as  no  net  force  acts  on  it 
(Newton  1687).  In  other  words,  a  moving  object  continues  in  motion  with  the  same 
speed  and  in  the  same  direction  unless  an  external  force  is  applied  to  it. 

The  most  significant  outside  force  in  the  universe  is  that  of  gravity,  and  it  is 
motion  that  opposes  gravitational  attraction.  Motion  and  gravity  together  shape  the 
universe,  giving  it  form  and  structure.  So  everything  moves,  and  the  way  cosmic 
objects  move  is  governed  by  the  rules  of  motion  and  gravitation. 


4.1.2  Escape  Speed 

The  energy  of  motion  is  known  as  kinetic  energy,  and  for  a  mass  m  moving  at 

r\ 

speed  V,  the  kinetic  energy  is  mV  / 2,  so  the  faster  something  moves  the  more 
kinetic  energy  it  has.  If  an  object  moves  fast  enough,  and  its  kinetic  energy 
becomes  large  enough,  it  can  overcome  the  gravitational  forces  acting  upon  it  and 
move  out  of  their  sphere  of  influence. 

The  minimum  speed  required  to  counteract  and  overcome  the  gravitational 
force  on  an  object  is  known  as  the  escape  speed,  since  the  object  can  then  escape 
into  surrounding  space.  The  escape  speed,  denoted  Vesc ,  needed  for  a  small  body  of 
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mass,  m,  to  break  away  from  the  gravitational  pull  of  a  larger  mass,  M,  is  obtained 
by  equating  the  kinetic  energy  of  the  small  mass  to  the  gravitational  potential 
energy  holding  it  in  (Sect.  3.2).  That  is: 


Kinetic  energy  =  -mVt 

2' 


esc 


GMm 

D 


Gravitational  potential  energy, 


or 


2GM 
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2 

\2  GM 

D 

—  1 

V  D  ’ 

where  D  is  the  distance  between  the  centers  of  the  larger  and  smaller  mass,  and  the 
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Newtonian  constant  of  gravitation  G  =  6.674  x  10"11  nr3  kg"1  s  . 

Although  the  escape  speed  is  often  called  the  escape  velocity,  the  escape  speed 
does  not  depend  on  the  direction  of  motion,  whereas  strictly  speaking  a  velocity 
includes  both  the  speed  and  direction.  No  matter  what  the  direction  of  travel  is,  an 
object  moving  at  the  escape  speed  can  break  away  from  another  object’s  gravi¬ 
tational  force,  provided,  of  course,  that  it  isn’t  directed  into  the  surface  of  the 
larger  mass. 

The  escape  speed  is  independent  of  the  small  mass  m ,  and  it  is  dependent  only 
on  the  distance  D  of  the  small  mass  and  the  value  of  the  big  mass  M.  At  larger 
distances  the  escape  speed  becomes  smaller  because  the  strength  of  the  gravita¬ 
tional  force  exerted  by  the  big  mass  is  less. 

Any  object,  from  an  atom  to  a  rocket,  must  move  faster  than  the  escape  speed  at 
a  planet’s  surface  if  it  is  to  move  off  into  surrounding  space.  The  reason  why  there 
is  no  hydrogen  in  the  Earth’s  atmosphere,  for  example,  is  that  at  large  altitudes,  up 
in  the  ionosphere,  the  temperature  is  so  high  that  the  light-weight  hydrogen  atoms 
move  at  speeds  greater  than  our  planet’s  escape  speed,  and  evaporate  off  into 
space.  To  obtain  the  surface  escape  speed  of  an  object  at  its  radius  R ,  just  let 
D  =  R  in  the  expression  for  escape  speed. 

Example:  Escape  speed  of  the  Earth,  Moon,  and  Sun 

What  is  the  minimum  speed  needed  for  a  rocket  to  escape  from  the  gravi¬ 
tational  pull  of  the  Earth,  VescE ,  and  from  the  Moon,  VescMl  We  can  use  the 
expression  Vesc  =  (2 GM/R)1^2,  where  the  Newtonian  gravitational  constant 
G  =  6.614  x  10"11  m3  kg"1  s“2,  the  M  is  the  mass  of  the  object,  R  is  its 
radius,  and  the  Vi  superscript  denotes  the  square  root.  For  the  Earth,  the  mass 
Me  =  5.9736  x  1024  kg  and  the  mean  radius  RE  =  6.371  x  106  m,  to  give 
VescE  ~  1-12  x  104  m  s  \  For  the  Moon,  we  have  a  mass  of 
Mmoon  =  7.348  x  1022  kg  and  a  mean  radius  of  Rmoon  =  1.737  x  106  m,  to 

o  _ 1 

give  VesCM  =2.38x10  ms  .  That  explains  why  a  lunar  lander  requires 
much  less  rocket  propulsion  to  leave  the  Moon  to  rejoin  its  orbiting  com¬ 
mand  module  than  either  spacecraft  needs  to  leave  the  Earth.  If  the  command 
module  was  orbiting  the  Moon  in  synchronous  orbit,  to  remain  always  above 
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the  same  point  on  the  Moon,  it  would  have  to  have  an  orbital  period  equal  to 

/r 

the  Moon’s  rotation  period  of  27.3  Earth  days  =  2. 36  x  10  s,  and  the 
orbital  distance,  D,  of  the  command  module  would  be  D  =  [ GMmoon 

8.8  x  10  m,  which  is  a  substantial  portion  of  the  mean 

o 

distance  between  the  Earth  and  the  Moon,  of  3.844  x  10  m,  resulting  in 
poor  visibility  of  the  lunar  surface  from  the  command  module.  It  has  to  move 
much  faster  around  the  Moon  and  closer  to  it.  It  is  the  Sun  that  dominates  the 
mass  of  the  solar  system,  with  a  mass  of  M0  =  1.989  x  10  kg.  The  escape 
velocity  at  the  visible  disk  of  the  Sun  is  VescQ  =  (2 GMQ/RQ)m  «  6.18  x 
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10  m  s  ,  where  the  Sun’s  radius  RQ  =  6.955  x  10  m.  The  most  distant 
comets  reside  in  a  remote  reservoir,  known  as  the  Oort  cloud,  located  at  a 
distance  of  D  &  100,000  AU  =  105  AU.  The  orbital  period  of  such  a  comet 
will  be  (105)3/2  «  32  million  years,  and  its  orbital  speed  will  be  about 
V  =  2nD/P  «  94  m  s_1,  a  very  slow  and  leisurely  motion  by  cosmic 
standards;  note  1  AU  =  1.496  x  1011  m  and  1  year  =  3.156  x  107  s.  This 
makes  sense,  for  the  orbital  speed  will  fall  as  the  inverse  square  root  of  the 
distance.  The  Earth  orbits  the  Sun  at  an  average  orbital  speed  of  about 
30  km  s"1  =  30,000  ms-1. 


Using  the  mass  and  radius  of  the  Earth,  the  Sun,  and  the  Earth’s  Moon,  we 
obtain  respective  escape  speeds  of  11.2,  618  and  2.38  km  s_1.  If  you  want  to  send 
a  rocket  off  into  interplanetary  space,  it  has  to  move  faster  than  the  escape  speed  at 
the  Earth’s  surface,  about  11.2  km  s-1  =  1.12  x  104  m  s-1.  Owing  to  its  larger 
mass,  the  escape  velocity  of  the  Sun  is  about  54  times  larger  than  that  of  the  Earth 
in  spite  of  the  Sun’s  larger  radius.  At  the  visible  disk  of  the  Sun,  we  have 
V esc®  —  6. 1 17  x  105  m  s  '.  The  escape  speed  from  the  surface  of  the  Moon  is 
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just  2.38  x  10  ms  ,  which  explains  why  the  relatively  small  Lunar  Module 
spacecraft  could  land  on  the  Moon  and  blast  off  it  with  relatively  low  rocket 
propulsion,  returning  to  its  larger,  mother  spacecraft,  the  Lunar  Command  Module , 
that  was  orbiting  the  Moon  and  was  launched  from  the  Earth  with  considerably 
greater  rocket  thrust.  The  Moon’s  low  escape  speed  also  helps  explain  why  it  has 
no  atmosphere  to  speak  of. 

The  mass,  radius,  and  escape  speeds  of  representative  planets  and  stars  are 
given  in  Table  4.1. 


4.2  Orbital  Motion 

A  planet  would  continue  going  the  way  it  started,  moving  along  a  straight  line,  if  it 
were  not  for  the  Sun’s  gravitational  force  that  deflects  the  planet  into  a  curved 
solar  orbit.  Therefore,  it  is  the  Sun’s  gravitational  attraction  that  keeps  the  planets 
forever  moving  along  their  orbital  paths.  But  why  doesn’t  the  enormous  solar 
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Table  4.1  Mass,  radius  and  escape  speed  of  some  cosmic  objects 

Object  Mass  (kg)  Radius  (m)  Escape  speed  (km  s-1) 


Ceres,  largest  asteroid 

1.17 

X 

1021 

3.8 

X 

105 

0.64 

Earth’s  moon 

7.348 

X 

1022 

1.737 

X 

106 

2.38 

Earth 

5.9736 

X 

1024 

6.371 

X 

106 

11.2 

Jupiter 

1.90 

X 

1027 

7.15 

X 

107 

59.5 

Sun 

1.989 

X 

io30 

6.955 

X 

108 

618 

Sirius  B,  white  dwarf  star 

2 

X 

io30 

1 

X 

107 

5,200 

Neutron  star 

2 

X 

IO30 

1 

X 

104 

2  x  10 

gravity  pull  all  of  the  planets  into  the  Sun?  Motion  holds  the  planets  in  their  orbits, 
opposing  the  relentless  pull  of  the  Sun’s  gravity  and  keeping  the  planets  from 
falling  into  the  Sun. 

Each  planet  is  moving  in  a  direction  perpendicular  to  an  imaginary  line  con¬ 
necting  it  to  the  Sun,  at  exactly  the  speed  required  to  overcome  the  Sun’s  gravi¬ 
tational  pull,  maintaining  an  equilibrium  between  motion  and  gravitation  that 
keeps  the  planets  in  perpetual  motion. 

For  the  planetary  orbits,  or  any  other  orbit  of  small  eccentricity,  the  length  of 
the  orbit  is  close  to  a  circular  one.  The  mean  orbital  speed,  VOP ,  of  a  planet  in 
circular  motion  about  the  Sun  at  a  distance  DP ,  is: 


Vop 


2nDP 


•> 


where  n  =  3.14159,  the  circumference  of  a  circle  with  radius  DP  is  2nDP  and  PP  is 
the  orbital  period.  The  mean  orbital  velocity  of  the  Earth  around  the  Sun  is,  for 
example,  is  29.8  km  s-1  =  2.98  x  104  m  s_1,  where  the  mean  Earth-Sun  distance 
is  1  AU  =  1.496  x  1011  m  and  the  orbital  period  is  1  year  =  3.1557  x 
107  s. 

When  there  are  two  objects  orbiting  a  common  center  of  mass,  and  one  of  them 
has  a  very  small  mass  when  compared  to  the  mass  of  the  other  one,  as  is  the  case  of 
planets  orbiting  the  massive  Sun,  the  orbital  speed  depends  only  on  the  dominant 
mass  and  the  distance  of  the  orbiting  object  from  it.  For  the  planets,  the  orbital 

on 

speed  depends  only  on  the  Sun’s  mass  M0  =  1.989  x  10  kg  and  the  distance  of 
the  planet,  or: 


Vop 


GMe  I  1/2=  Vesc© 

Dp  _  y/2 


which  follows  from  Kepler’s  third  law  assuming  a  circular  orbit  or  one  of  small 
eccentricity.  It  tells  us  that  the  more  distant  planets  move  at  a  slower  speed.  The 
orbital  speed  is  independent  of  the  planet’s  mass,  which  is  why  the  planetary 
realm,  known  as  the  solar  system,  is  dominated  by  the  Sun. 

This  equation  also  indicates  that  the  escape  speed  is  -J2  times  larger  than  the 
orbital  speed  of  a  body.  The  -J2  factor  is  a  very  small  number,  just  1.414,  so  the 
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orbital  speed  is  very  close  to  the  escape  speed  at  the  relevant  distance.  Of  course,  a 
planet  couldn’t  be  moving  just  as  fast  as  the  escape  speed,  or  any  faster  than  that, 
for  it  would  then  escape  from  the  solar  system,  moving  off  into  interstellar  space; 
and  if  the  planet  moved  any  slower  than  its  orbital  speed,  it  would  be  pulled  into 
the  Sun  and  consumed  by  it. 

We  can  square  both  sides  of  the  previous  two  equations  and  collect  terms  to 
obtain 


471" 


GMq 


D\ 


which  is  the  Newtonian  expression  for  Kepler’s  third  law  (Sect.  3.3). 

Example:  How  fast  does  the  Moon  move  around  the  Earth? 

The  Moon  orbits  our  planet  at  a  mean  distance  from  the  Earth  of 
Dm  =  3.844  x  108  m  with  an  orbital  period  PM  of  27.32  days,  where 
1  day  =  86,400  s.  This  is  the  Moon’s  sidereal  orbital  period,  from  fixed  star 
to  fixed  star.  For  a  circular  orbit,  the  Moon’s  mean  orbital  speed  about  the 

Q  1 

Earth  would  be  V0m  =  27 iDm/Pm  =  1-02  x  10  m  s“  .  We  can  compare 
this  orbital  speed  to  the  escape  speed,  VesCE,  from  the  Earth’s  gravity  at  the 
Moon’s  mean  distance,  VesCE  =  ( 2GMe/Dm)1/ 2  =  1.44  x  103  m  s_1,  where 

O  A 

the  mass  of  the  Earth  is  ME  =  5.9736  x  ICE  kg  and  the  Newtonian  grav- 
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itational  constant  is  G  =  6.674  x  10  m  kg  s  .  The  mean  orbital 
speed  of  the  Moon  VQ m  is  just  equal  to  V^c^/V^  at  Moon’s  distance, 
which  shows  that  the  Moon  is  bound  to  the  Earth  by  its  gravitational  pull, 
diminished  by  the  distance  to  the  Moon,  and  that  the  Moon  is  perpetually 
falling  toward  the  Earth  while  moving  around  it. 


Calculations  of  the  speed  of  an  orbiting  object  also  apply  to  communications, 
military,  and  weather  satellites,  which  might  be  launched  into  geosynchronous 
orbits  with  an  orbital  period  equal  to  the  Earth’s  rotation  period. 


Example:  Geosynchronous  orbits 

In  a  geosynchronous  orbit,  a  satellite’s  orbital  period  equals  the  Earth’s 
rotation  period,  so  the  satellite  stays  in  the  same  location  above  the  planet’s 
surface.  The  distance,  DGS ,  of  this  kind  of  satellite  above  the  center  of  the 
Earth  can  be  obtained  from  a  rearrangement  of  Kepler’s  third  law: 


GMeP 


21  1/3 
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where  the  gravitational  constant  G  =  6.674  x  10  m  kg  s  ,  the  mass 
of  the  Earth  ME  =  5.9736  x  10"  kg,  the  rotation  period  of  the  Earth  is 
Pr  =  24  h  =  86,400  s,  and  the  constant  n  =  3.14159.  Substitution  into  this 
formula  gives  DGS  «  4.22  x  10  m.  The  more  exact  value  of 

,-j 

Dgs  =  4.2164  x  10  m  is  obtained  using  the  Earth’s  sidereal  rotation  per¬ 
iod  of  Pr  =  86,164  s.  For  a  geosynchronous  satellite  orbiting  the  Earth’s 
equator,  the  altitude  H  of  the  satellite  above  sea  level  will  be  H  =  DGS  — 

'-j 

ae  =  3.5786  x  10  m,  where  the  equatorial  radius  of  the  Earth  is 
ae  =  6.3781  x  106  m. 


When  two  orbiting  objects  have  comparable  mass,  as  is  the  case  for  some 
binary  stars,  then  the  mean  orbital  velocity,  VGi  of  an  object  of  mass  Mx  orbiting 
another  mass  M2  at  a  distance  a  is  given  by: 


GMj 

{Mi  +  M2)a 


Here  a  is  the  separation  of  the  two  objects.  If  rx  and  r2  denote  their  respective 
distances  from  a  common  center  of  mass,  and  we  assume  circular  orbits,  then 
rxMx  —  r2M2  with  a  =  rx  +  r2  =  rx  (Mx  +  M2)/M2. 

Astronomers  record  the  spectral  lines  of  a  star,  and  look  for  periodic  variations 
in  the  observed  line-of-sight  velocity,  Vobs\  =  Vo i  sin  i,  where  i  is  the  inclination 
angle  between  the  perpendicular  to  the  orbital  plane  and  the  line  of  sight.  The 
observed  period  of  variations  in  the  detected  radial  velocity  along  the  line  of  sight 
is  the  orbital  period,  P,  given  by  Kepler’s  third  law: 


2  4n2a3 

~  G(MX  +M2)’ 


We  can  use  these  equations  to  obtain  an  expression  for  the  mass,  M2 : 


pv  3 

M\  sin3  i  =  — ^-{ Mi+M2  f .  (4.9) 

2tlG 

When  the  mass  of  object  1  greatly  exceed  the  mass  of  object  2,  as  is  the  case 
when  looking  for  previously  unseen  exoplanets  orbiting  a  nearby  star,  or  when 
Mx  >  M2, 


/  p  \  E3 

Mi  sin  j  V0bsyM2J\ 


(4.10) 


and  the  mass  of  the  star,  Mx,  can  be  inferred  from  other  observations  (Sect.  10.1). 
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4.3  The  Moving  Stars 
4.3.1  Are  the  Stars  Moving? 

Each  night  the  stars  rise,  move  slowly  across  the  dark  sky,  and  then  disappear  from 
view;  but  this  slow  apparent  movement  of  the  stars  is  not  due  to  the  motions  of  the 
stars  themselves.  It  is  caused  by  the  rotating  Earth,  which  spins  beneath  the 
celestial  sphere.  Despite  eons  of  stellar  observations  in  antiquity,  there  was  no 
evidence  that  any  of  these  stars  were  moving. 

Yet,  if  the  stars  were  motionless,  their  mutual  gravitation  eventually  would  pull 
them  together  into  a  single  mass.  Without  motion,  there  would  be  nothing  to  keep 
the  stars  apart,  and  they  could  not  be  suspended  in  space.  So  there  is  no  star  that  is 
completely  at  rest,  and  the  stars  must  be  moving  ever  so  slightly  from  their 
apparent  places  in  the  night  sky. 

Moreover,  the  speeds  of  the  moving  stars  are  not  modest.  Observations  indicate 
that  the  stars  are  moving  at  speeds  of  about  10  km  s-1  relative  to  their  stellar 
neighbors.  The  Sun,  for  example,  is  currently  traveling  at  a  speed  of  about 
20  km  s-1,  or  20,000  ms-1,  relative  to  other  nearby  stars.  This  is  about  1,000 
times  faster  than  a  car  moves  on  a  highway. 

Stars  also  move  together  at  larger  speeds  in  directed  motions.  Both  the  Sun  and 
nearby  stars,  for  example,  are  whirling  about  the  remote  center  of  the  Milky  Way 
at  a  speed  of  220  km  s_1.  If  these  stars  traveled  at  faster  speeds,  they  would  move 
off  into  space,  even  out  of  the  Milky  Way;  if  they  were  moving  at  slower  speeds, 
they  would  be  pulled  by  gravitation  into  the  center  of  the  Milky  Way.  But  because 
they  are  so  far  away,  the  stars  seem  to  be  moving  slowly  through  space,  only 
gradually  changing  their  apparent  separation  and  grouping. 


4.3.2  Components  of  Stellar  Velocity 

Stars  seem  to  be  moving  here,  there  and  everywhere,  so  it  is  not  easy  to  figure  out 
where  they  are  going.  However,  a  star’s  motion  manifests  in  two  ways,  depending 
on  the  method  used  to  observe  it,  and  these  two  components  of  velocity  can  be 
combined  to  give  the  direction  of  motion  (Fig.  4.1).  The  “sideways”  velocity 
component  is  directed  perpendicular  or  transverse  to  the  line  of  sight,  with  a  speed 
designated  by  Vj_,  where  the  subscript  _L  denotes  perpendicular.  The  other  com¬ 
ponent,  a  radial  velocity  with  a  speed  denoted  by  Vr,  is  the  velocity  moving  toward 
or  away  from  us  along  the  line  of  sight  to  a  star.  When  these  two  velocity  com¬ 
ponents  are  known,  we  can  determine  the  speed  and  direction  of  a  star  in  three 
dimensions.  The  Pythagorean  theorem  gives  the  magnitude  of  the  star’s  space 
velocity,  its  true  speed  in  space,  Vs,  given  by  Vf  =  V2r  +  Vj_. 

A  star’s  motion  across  the  line  of  sight  produces  an  angular  change  in  position, 
called  proper  motion,  which  depends  on  both  the  star’s  distance  and  the 


106 


4  Cosmic  Motion 


Fig.  4.1  A  star  moves  The  space  velocity,  Vs  of  a  star  relative  to  an  observer  can  be  resolved 
into  two  mutually  perpendicular  components:  (1)  the  radial  velocity,  Vr,  directed  along  the  line  of 
sight;  and  (2)  the  tangential  velocity,  Vj_,  which  is  perpendicular  or  transverse  to  the  line  of  sight. 
From  the  Pythagorean  theorem  V$  =  Vr  +  V±.  Over  a  given  interval  of  time,  shown  here  as  one 
year,  the  star  will  move  through  a  proper  motion  angle  /i,  which  depends  on  V±  and  the  star’s 
distance,  D,  from  the  observer.  In  this  figure,  the  proper  motion  fi  =  V±/D  is  exaggerated  greatly 
by  more  than  10,000  for  even  the  closest  star.  At  a  distance  of  only  6  light-years,  Barnard’s  star 
has  the  largest  known  proper  motion  of  10.3  s  of  arc  per  year 


perpendicular  speed  V±.  The  radial  velocity  is  observed  through  the  Doppler 
effect,  which  measures  how  the  star’ s  spectral  lines  appear  to  shorten  or  lengthen 
in  wavelength  depending  on  the  relative  velocity  of  the  star  and  the  observer,  and 
whether  the  motion  is  toward  or  away  from  the  observer.  When  a  star  is  moving 
directly  away,  then  there  is  no  perpendicular  component  to  its  motion,  and  if  the 
star  is  moving  directly  across  the  line  of  sight,  then  the  radial  component  of  the 
star’s  motion  is  reduced  to  zero. 

It  is  difficult  to  judge  a  star’s  speed  if  it  is  headed  straight  toward  or  away  from 
us,  just  as  it  is  difficult  to  determine  how  fast  a  distant  car  is  moving  on  a  highway. 
However,  if  a  star  crosses  at  right  angles  to  our  line  of  sight,  we  could  see  a  change 
in  its  position.  To  detect  that  change,  astronomers  needed  to  look  at  the  nearest 
stars  where  the  angular  change  in  position  is  greatest. 

Given  enough  time,  the  displacement  of  a  nearby  star’s  celestial  position  can  be 
detected.  The  English  astronomer  Edmond  Halley  (1656-1742)  first  noticed  the 
change  when  he  compared  the  positions  of  extremely  bright  stars,  such  as  Sirius 
and  Arcturus,  with  those  measured  by  the  Greek  astronomer  Hipparchus  around 
150  BC  and  recorded  in  Ptolemy’s  reproduction  of  Hipparchus’  catalogue. 
Halley’s  comparison  indicated  that  at  least  three  stars  had  changed  position  and 
moved  (Halley  1717). 

So  it  took  more  than  1,800  years  before  anyone  noticed  that  a  star  could  move. 
Nowadays,  with  vastly  improved  technology  and  observations  from  spacecraft,  the 
motions  of  many  tens  of  thousands  of  stars  are  known  with  great  accuracy. 
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4.3.3  Proper  Motion 

The  stellar  motion  that  Halley  detected  is  an  angular  change  in  a  star’s  position 
over  time,  due  to  its  velocity  transverse  or  perpendicular  to  the  line  of  sight.  The 
angular  rate  of  change  is  known  as  proper  motion ,  which  is  intrinsic  to  the  star  and 
belongs  to  it,  in  contrast  to  any  improper  motion  that  might  be  caused  by  the 
Earth’s  movement  in  space. 

Proper  motion  is  not  a  velocity;  it  is  the  angular  rate  at  which  a  star  moves 
across  the  sky  over  years  or  centuries,  and  it  does  not  by  itself  determine  the  speed 
of  motion.  To  convert  a  star’s  proper  motion  into  a  velocity  or  speed,  we  must 
know  the  star’s  distance,  and  in  Halley’s  time  no  one  knew  the  distance  of  any  star 
other  than  the  Sun. 

For  a  star  located  at  distance  D ,  the  proper  motion  p  is: 

p  =  V±/D  rads-1,  (4.11) 

where  1  rad  =  2.06265  x  10  and  denotes  seconds  of  arc.  Proper  motion  is 
designated  by  the  Greek  letter  mu,  or  the  symbol  p.  The  speed  perpendicular  to  the 
line  of  sight,  Vj_,  is  known  as  the  transverse  velocity.  If  V±  is  given  in  units  of 
km  s'1  and  D  is  in  units  of  parsecs,  we  have: 

Annual  Proper  Motion  =  p  =  0.211  V±/D  "  yr“ 1 ,  (4.12) 

and 

V±  =  4.74/iDkms"1,  (4-13) 

^7 

where  1  yr  =  3.156  x  10  s,  and  the  p  in  this  case  is  called  the  annual  proper 
motion.  One  parsec  is  abbreviated  1  pc,  and  1  pc  =  3.08568  x  10  m  is  the 
typical  separation  between  adjacent  stars.  The  coefficient  0.211  comes  from 
2.06265  x  3.156/3.08568  in  the  various  conversion  factors,  and  4.74  = 
1.0/0.211. 

4.3.4  Radial  Velocity 

The  other  component  of  a  star’s  velocity,  the  radial  velocity  directed  along  the  line 
of  sight,  can  be  measured  using  the  Doppler  shift  of  a  spectral  feature  in  the  star’ s 
radiation.  Such  a  feature,  called  a  spectral  line,  has  a  definite,  well-known 

wavelength(Sect.  6.1). 

Just  as  a  source  of  sound  can  vary  in  pitch  or  wavelength,  depending  on  its 
motion,  the  wavelength  of  electromagnetic  radiation  shifts  when  the  emitting  source 
moves  with  respect  to  the  observer.  Such  a  shift  is  named  after  the  Austrian  scientist, 
mathematician,  and  schoolteacher  Christiaan  Doppler  (1803-1853)  who  discovered 
it  more  than  one  and  a  half  centuries  ago  (Doppler  1842;  Andrade  1959).  If  the 
motion  is  toward  the  observer,  the  shift  is  to  shorter  wavelengths;  when  the  motion  is 


108 


4  Cosmic  Motion 


Fig.  4.2  Doppler  effect  A  stationary  source  of  radiation  (top)  emits  regularly  spaced  light  waves 
that  get  stretched  out  or  scrunched  up  if  the  source  moves  (bottom).  Here  a  star  moving  away 
(bottom  right)  from  the  observer  (bottom  left)  is  shown.  The  stretching  of  light  waves  that  occurs 
when  the  source  moves  away  from  an  observer  along  the  line  of  sight  is  called  a  redshift,  because 
red  light  waves  are  relatively  long  visible  light  waves;  the  compression  of  light  waves  that  occurs 
when  the  source  moves  along  the  line  of  sight  toward  an  observer  is  called  a  blueshift ,  because 
blue  light  waves  are  relatively  short.  The  wavelength  change,  from  the  stationary  to  moving 
condition,  is  called  the  Doppler  shift ,  and  its  size  provides  a  measurement  of  radial  velocity,  or 
the  speed  of  the  component  of  the  source’s  motion  along  the  line  of  sight.  The  Doppler  effect  is 
named  after  the  Austrian  physicist  Christiaan  Doppler  (1803-1853),  who  first  considered  it  in 
1842 

away,  the  wavelength  becomes  longer  (Fig.  4.2).  We  notice  the  effect  for  sound 
waves  when  listening  to  the  changing  pitch  of  a  passing  ambulance  siren.  The  tone  of 
the  siren  is  higher  as  the  ambulance  approaches  and  lower  when  it  moves  away. 

If  the  spectral  line  is  emitted  at  a  specific  wavelength,  by  a  source  at 

rest,  the  wavelength,  2 observed,  observed  from  a  moving  source  is  given  by  the 
relation: 

^ observed  ^ emitted  Vr  r  T  t  /  A  A\ 

- - - =  —  for  h  «  c,  (4.14) 
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where  the  quantity  z  is  known  as  the  redshift,  Vr  is  the  speed  of  the  source’s  radial 

motion  along  the  line  of  sight  away  from  the  observer;  since  the  speed  and 

direction  are  known,  Vr  denotes  the  radial  velocity.  The  speed  of  light 

8  1 

c  =  2.9979  x  10  m  s  .  The  parameter  z  is  called  the  redshift  since  the  Doppler 
shift  is  toward  the  longer,  redder  wavelengths  in  the  visible  part  of  the  electro¬ 
magnetic  spectrum.  When  the  motion  is  toward  the  observer,  Vr  is  negative  and 
there  is  a  blue  shift  to  shorter,  bluer  wavelength.  The  greater  the  speed  along  the 
line  of  sight  in  either  direction,  the  bigger  the  wavelength  shifts. 

The  notation  Vr  «  c  in  our  equation  for  the  redshift,  z,  means  that  the  formula 
applies  for  radial  velocities,  Vn  much  less  than  the  speed  of  light,  c.  This  is  the 
case  for  the  motions  of  stars  in  our  Milky  Way.  Nevertheless,  remote  collections  of 
billions  of  stars,  the  galaxies,  are  moving  at  radial  velocities  that  increase  with 
their  distance,  and  the  exceptionally  remote  ones  can  have  radial  velocities  that 
approach  the  speed  of  light,  or  Vr  «  c.  In  this  case  our  equation  has  to  be  modified 
(see  Sect.  14.3). 


4.3.5  Observed  Proper  Motions  of  Stars 

The  star  with  the  largest  proper  motion  races  across  the  sky  at  about  10.4  s  of  arc, 
denoted  as  10.4",  each  year.  This  is  Barnard’s  star ,  named  after  the  American 
astronomer  Edward  E.  Barnard  (1857-1923)  who  discovered  it  (Barnard  1916).  In 
our  lifetime  this  star  will  move  by  roughly  half  the  angular  diameter  of  the  Moon; 
however,  because  it  is  a  dim,  faint  star  a  telescope  is  required  to  see  it.  Barnard’s 
star  is  1  /27th  of  the  brightness  of  the  faintest  star  that  can  be  seen  with  the  unaided 
eye.  It  is  a  relatively  nearby  star,  located  at  a  distance  of  just  5.98  light-years,  or 
1.834  pc,  and  its  large  proper  motion  is  attributed  to  both  a  high  transverse  speed 
and  the  closeness  of  the  star. 

Barnard’s  star  moves  across  the  line  of  sight  at  a  speed  of  V±  =  90.4  km  s-1. 
When  combined  with  its  radial  velocity  of  Vr  =  —  1 10.6  km  s  \  with  the  negative 
sign  indicating  that  the  star  is  approaching  us,  a  space  velocity  of 
Vs  =  142.7  km  s-1  relative  to  the  Sun  is  obtained,  from  V|  =  V,  +  Vj_.  At  its 
radial  velocity,  Barnard’s  star  will  move  one  light-year  closer  to  us  in  about 
2,100  years,  using  1  light-year  =  9.461  x  10  m  and  1  year  =  3.156x10  sto 
convert  between  units. 

Example:  How  fast  does  Barnard’s  star  move? 

Barnard’s  star  has  an  annual  proper  motion  of  ju  =  10.4"  year-1.  Its  dis¬ 
tance,  D ,  inferred  from  its  parallax  (see  Sect.  10.1)  is  D  =  5.98  light- 
years  =  1.834  pc,  where  1  pc  =  3.26  light-years.  The  star’s  transverse 
velocity,  perpendicular  to  the  line  of  sight,  is  Vj_  =  4.74  jiD  = 
90.4  km  s-1.  The  star’s  redshift  is  z  =  —3.689  x  10-4  =  —0.0003689,  so 
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its  radial  velocity  is  Vr  =  z  x  c  =  —110.6  km  s  ',  where  the  speed  of  light 
c  =  2.9979  x  105  km  s  '.  The  space  velocity,  V^,  of  Barnard’s  star  is 
inferred  from  Vf  =  V±  +  V2,  or  Vs  =  142.8  km  s-1. 


The  closest  star,  Proxima  Centauri,  is  just  4.24  light-years  away,  and  at  a 
distance  of  5.98  light-years  Barnard’s  star  is  nearly  that  close.  However,  Proxima 
Centauri  is  also  moving  closer,  with  a  radial  velocity  of  —21.7  km  s-1,  so  it  will 
keep  its  status  as  the  closest  star  for  a  very  long  time  to  come.  The  proper  motion 
of  Proxima  Centauri  is  3.85"  year-1. 

The  star  with  the  second  largest  proper  motion,  at  8.7"  yr-1,  is  Kapteyn’s  star, 
named  for  the  Dutch  astronomer  Jacobus  C.  Kapteyn  (1851-1922),  who  first 
catalogued  it  (Kapteyn  1898).  It  has  a  distance  of  12.8  light-years  or  3.92  pc,  so  it 
is  moving  across  the  line  of  sight  at  a  speed  of  V±  =  162  km  s-1.  Kapteyn’s  star 
has  a  radial  velocity  of  Vr  =  245.5  km  s-1,  giving  it  a  true  space  velocity  relative 
to  the  Sun  of  Vs  =  293.6  km  s-1.  This  intriguing  star  moves  around  the  center  of 
the  Milky  Way  in  the  opposite  direction  to  the  other  nearby  stars.  It  may  have 
originated  outside  the  Milky  Way  disk  and  is  now  hurtling  through  it. 

Most  proper  motions  are  exceedingly  small  and  usually  measured  in  seconds  of 
arc  per  century,  or  milliarcseconds  per  year,  which  means  the  same  thing.  Due  to 
atmospheric  blurring  the  angular  resolution  of  the  best  telescope  at  the  best 
location  on  the  Earth  is  only  about  0.2",  and  we  would  have  to  wait  more  than 
20  years  to  measure  a  proper  motion  of  this  size.  However,  the  effect  is  cumu¬ 
lative;  therefore  successive  generations  of  astronomers  can  measure  proper 
motion.  After  20  centuries,  the  proper  motion  of  many  stars  might  be  20",  which 
explains  why  Halley  was  able  to  detect  the  effect  using  ancient  observations. 

It  is  much  easier  to  measure  proper  motion  from  space,  outside  the  Earth’s 
atmosphere.  Instruments  aboard  the  HIPPARCOS  satellite  have  pinpointed  the 
positions  and  established  the  proper  motions  of  more  than  100  thousand  stars  with 
an  astonishing  precision  of  0.001".  The  stellar  distances  are  inferred  from  parallax 
measurements,  and  that  explains  the  spacecraft’s  name,  an  acronym  for  High 
Precision  PARallax  Collecting  Satellite.  The  perpendicular  velocities  can  be 
determined  from  the  proper  motions  and  distances. 

Astronomers  specify  the  proper  motion  in  right  ascension  a  and  the  proper 
motion  fis  in  declination  3.  The  magnitude  of  the  total  proper  motion,  /i,  is  given 
by  the  vector  addition  of  its  components  =  /i2  -f  //2  cos2  3 ,  where  the  cos  3 
factor  accounts  for  the  projection  of  /ia  on  the  celestial  sphere.  The  components  of 
proper  motion  and  the  radial  velocities  of  stars  with  exceptionally  high  proper 
motion  are  listed  in  Table  4.2,  where  the  proper  motions  are  in  units  of  milli- 
arcseconds  per  year,  or  10  year  ,  and  abbreviated  mas  year  ,  and  the  +  or 
—  sign  of  the  radial  velocity  indicates  motion  away  or  toward  the  observer, 
respectively. 
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Table  4.2  Stars  with  the  highest  proper  motiona 


Star 

g^cosd  (mas 
year-1) 

gs  (mas 
year-1) 

Parallax 

(mas) 

Radial  velocity 
(km  s-1) 

Barnard’s  star 

-798.71 

1,0337.77 

549.30 

-110.6 

Kapteyn’s  star 

6,500.34 

-5,723.17 

255.12 

+245.5 

Groombridge 

1830 

4,003.69 

5,814.64 

109.22 

-98.0 

Lacaille  9352 

6,766.3 

1,327.99 

303.89 

+9.7 

Gliese  1  (GJ  1) 

5,633.95 

-2,336.69 

229.32 

+23.6 

a  The  designation  mas  is  short  for  milliarcseconds  or  0.001  =  10  3  s  of  arc 


4.3.6  Motions  in  Star  Clusters 

Gravitation  can  constrain  the  paths  of  stars  that  are  congregated  within  star  clusters 
(Table  4.3).  As  many  as  1  million  stars,  for  example,  are  crowded  together  in  a 
typical  globular  star  cluster.  The  cluster  is  tightly  bound  by  gravity,  which  gives  it 
a  distended  spherical  shape  and  relatively  high  stellar  density  toward  the  center 
(Figs.  4.3,  4.4).  The  name  of  this  category  of  star  cluster  is  derived  from  the  Latin 
globules ,  for  “a  small  sphere”.  Another  type  of  stellar  grouping,  known  as  an  open 
star  cluster,  includes  up  to  a  few  thousand  stars  that  were  formed  at  the  same  time, 
but  are  only  bound  loosely  to  one  another  by  mutual  gravitational  attractions 
(Fig.  4.5).  Unlike  globular  star  clusters,  which  can  be  held  together  by  its  stars’ 
mutual  gravitational  pull  for  tens  of  billions  of  years,  an  open  star  cluster  will 
disperse  within  a  few  million  years. 

The  stars  in  a  globular  cluster  are  moving  around  like  a  swarm  of  bees,  or  like 
hot,  subatomic  particles  inside  a  star.  The  stellar  motions  oppose  the  combined 
gravitational  attraction  of  all  of  the  stars,  preventing  them  from  gathering  together 
and  collapsing  to  the  center  of  the  star  cluster. 

In  a  short  elegant  discussion,  the  great  British  astronomer  Arthur  Stanley 
Eddington  (1882-1944)  demonstrated  that  the  internal  kinetic  energy  of  a  star 
cluster  is  half  its  gravitational  potential  energy  (Eddington  1916).  He  also  pointed 
out  that  this  result  could  have  been  obtained  at  once  from  what  is  known  as  the 
virial  theorem,  a  formula  whose  previous  use  had  been  almost  entirely  restricted  to 
gases.  In  Eddington’s  application,  stars  replace  the  atoms  and  molecules  of  a  gas. 

The  virial  theorem  describes  the  stability  of  a  finite,  self-gravitating  collection 
of  particles,  either  atoms  or  stars,  which  is  bound  by  gravitational  forces.  It  states 
that  the  total  kinetic  energy  averaged  over  time  is  just  equal  to  half  the  total 

Table  4.3  Physical  properties  of  star  clusters 

Open  star  cluster  Globular  star  cluster 

Ns  =  total  number  of  stars  =  100  to  1,000  Ns  =  total  number  of  stars  =  104  to  106 

Rc  =  radius  =  1  to  10  pc  «  (3  to  31)  x  1016  m  Rc  =  radius  =  10  to  100  pc  «  (3  to  31)  x  1017  m 

Age  =  107  to  109  year  Age  =  (10  to  14)  x  109  year  =  10  to  14  Gyear 
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Fig.  4.3  Globular  star  cluster  NGC  6934  Several  hundred  thousand  stars  swarm  around  the 
center  of  the  globular  star  cluster  NGC  6934,  which  lies  at  a  distance  of  about  50,000  light-years 
from  the  Earth.  These  ancient  stars  are  estimated  to  be  about  10  billion  years  old.  This  sharp 
image,  obtained  from  the  Hubble  Space  Telescope ,  is  about  3.5  min  of  arc  and  50  light-years 
across.  (Courtesy  of  NASA/ESA.) 


gravitational  potential  energy.  For  Ns  stars  of  individual  mass  Ms  the  star  cluster 
has  a  total  mass  of  Mc  =  Ns  Ms ,  and  it  will  be  gravitationally  bound  together  in  a 
stable  configuration  if: 


1 

-My  <  VS  > 


GMCMS 

2RC 


(4.15) 


and 


<VS> 


GNSMS 


1/2 


(4.16) 
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Fig.  4.4  Faint  stars  in  a 
globular  cluster  This  five- 
day  exposure  from  an 
instrument  aboard  the  Hubble 
Space  Telescope  includes  the 
faintest  detectable  stars  in  the 
globular  star  cluster  NGC 
6397,  which  is  located  about 
8,500  light-years  away  from 
the  Earth.  Some  of  these 
objects  are  white  dwarf  stars, 
the  collapsed,  bumed-out 
relics  of  former  stars  like  the 
Sun.  White  dwarfs  cool  down 
at  a  predictable  rate,  which 
can  be  used  to  measure  the 
age  of  this  globular  cluster, 
estimated  to  be  about 
12  billion  years.  The  crossed 
lines  radiating  from  the  bright 
stars  are  diffraction  spikes 
caused  by  the  struts  that 
support  the  telescope  mirror. 
(Courtesy  of 

NASA/ESA/Harvey  Richer, 
University  of  British 
Columbia.) 


where  Vesc  denotes  the  escape  velocity  of  the  cluster,  Rc  is  the  radius  of  the  star 
cluster,  Vs  is  a  star’s  velocity  and  the  brackets  <  >  denote  a  time  average  with  a 
time-averaged  stellar  speed  of  <VS>  ,  and  the  Newtonian  gravitational  constant 
G  =  6.674  x  10-11  m3  kg-1  s”2. 

Example:  How  fast  do  stars  move  in  a  bound  star  cluster? 

The  number  of  stars,  Ns,  in  a  globular  star  cluster  can  be  about  a  million,  or 

/r 

Ns  =  10  ,  each  with  a  mass,  Ms,  about  equal  to  that  of  the  Sun 

op 

Ms  =  Mq  =  1.989  x  10  kg.  They  are  apparently  bound  together  in  a 
sphere  with  a  radius  of  Rc  =  10  pc  =  3.086  x  10  m.  According  to  the 
virial  theorem,  the  kinetic  energy  of  the  stars,  moving  at  an  average  star 
velocity  <VS>,  must  balance  just  half  the  gravitational  pull  of  all  the  stars  on 
any  one  star,  or  that  Ms  <VS>2  /  2  =  GNSM2S/(2RC),  where  the  Newtonian 

1  i  o  _  i  _ o 

gravitational  constant  G  =  6.674  x  10  m  kg  s  .  Substituting  the 
numbers  into  this  equation  we  obtain  <VS>  =  2.07  x  1 04  m  s  1  = 
20.7  km  s'1. 


114 


4  Cosmic  Motion 


Fig.  4.5  Open  star  cluster  NGC  265  A  brilliant  cluster  of  bright  blue  stars  is  located  in  the 
Small  Magellanic  Cloud,  about  200,000  light-years  away  and  about  65  light-years  across.  This 
Hubble  Space  Telescope  image  subtends  an  angle  of  about  70  s  of  arc.  (Courtesy  of 
NAS  A/ESA/E. Olszewski  University  of  Arizona.) 

If  the  stars  move  on  average  at  a  slower  speed  than  <VS>,  they  will  be  pulled 
gravitationally  into  each  other  and  the  cluster  will  collapse.  If  the  stars  move  at  an 
average  speed  that  is  faster  than  <VS>,  they  eventually  will  disperse  because  the 
cluster  cannot  hold  together.  This  is  what  is  happening  to  open  star  clusters,  and  to 
star  associations  that  are  bound  even  more  loosely.  In  fact,  some  stars  are  moving 
out  of  certain  stellar  associations  at  unexpectedly  rapid  speeds. 


4.3.7  Runaway  Stars 

Some  stars  race  through  space  with  an  abnormally  high  velocity  relative  to  the 
surrounding  interstellar  medium.  These  high-speed  stars  are  known  as  runaway 
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Fig.  4.6  Runaway  star  A 

high-speed  star  slams  into 
dense  interstellar  gas, 
creating  a  bow  shock  wave 
that  may  be  a  million 
kilometers  wide.  The  star  is 
thought  to  be  relatively 
young,  only  millions  of  years 
old.  Moving  at  a  speed  of 
about  100  km  s-1,  it  has 
journeyed  160  light-years 
since  its  birth,  most  likely  in  a 
loosely  bound  stellar 
association.  (Courtesy  of 
NASA/ESA/R.  Sahai/JPL.) 


stars ,  because  they  are  moving  away  from  their  place  of  origin.  They  are  former 
members  of  very  loose  star  clusters,  known  as  stellar  associations,  containing 
bright,  and  relatively  hot,  massive  and  young  stars,  that  are  designated  as  O  and  B 
stars. 

As  first  noticed  by  the  Armenian  astronomer  and  statesman  Viktor  Ambartsumian 
(1908-1996),  these  O  and  B  stars  are  expanding  away  from  one  another  and  from  a 
common  origin  at  speeds  of  about  10  km  s-1  (Ambartsumian  1949).  The  associa¬ 
tions  are  now  dispersing  and  disintegrating,  but  still  moving  together  in  a  roughly 
spherical  shape  due  to  their  relatively  young  age.  They  are  not  expected  to  stay 
together  for  longer  than  a  few  tens  of  millions  of  years. 

Runaway  stars  are  moving  with  faster  speeds  than  other  stars  in  the  OB  asso¬ 
ciations  but  with  proper  motions  that  often  point  away  from  the  stellar  association 
to  which  they  once  belonged.  These  runaways  are  most  likely  escaped  members  of 
former  binary  star  systems  that  once  belonged  in  the  association,  until  one  of  the 
two  stars  exploded.  As  described  by  the  Dutch  astronomer  Adriaan  Blaauw 
(1914-2010),  runaway  stars  are  very  massive  stars  whose  high  space  velocities  are 
comparable  to  the  orbital  velocities  expected  for  massive  binary-star  systems 
(Blaauw  and  Moran  1954;  Blaauw  1961,  1964).  Because  massive  stars  burn  their 
thermonuclear  fuel  faster,  and  have  a  shorter  lifetime  than  normal  stars,  one 
member  of  such  a  binary  system  will  quickly  exhaust  its  thermonuclear  reserves 
and  explode  as  a  supernova,  thereby  releasing  the  other  member  as  a  high-velocity 
star.  The  evolution  and  explosive  fate  of  such  massive  stars  is  considered  in 
Sect.  13.5. 

The  Hubble  Space  Telescope  has  captured  striking  images  of  runaway  stars 
plowing  through  regions  of  dense  interstellar  gas  and  creating  brilliant  bow- shock 
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structures  and  trailing  tails  of  glowing  gas  (Fig.  4.6).  These  features  are  formed 
when  the  stars’  powerful  stellar  winds  slam  into  the  surrounding  gas.  The  shocks 
indicate  that  the  runaway  stars  are  traveling  at  speeds  between  50  and  100  km  s-1 
relative  to  the  dense  gas  through  which  they  are  moving.  This  is  five  or  ten  times 
faster  than  the  expansion  speeds  of  the  stellar  associations  or  the  average  speeds  of 
stellar  motions  with  respect  to  nearby  stars  or  the  local  interstellar  medium. 


4.4  Cosmic  Rotation 

In  addition  to  moving  through  space,  an  astronomical  object  also  rotates  or  spins 
about  its  axis.  This  rotation  often  can  be  traced  back  to  an  origin  from  a  more 
distended  object  of  slower  spin,  but  sometimes  it  is  related  to  a  glancing  collision 
in  the  past. 

The  period  of  rotation  is  the  time  it  takes  to  complete  one  revolution,  or  the 
time  for  the  planet  or  star  to  spin  into  the  same  orientation  in  space.  For  the  planets 
and  the  Sun,  this  intrinsic  rotation  period  is  known  as  the  sidereal  rotation  period, 
from  fixed  star  to  fixed  star;  it  has  been  corrected  for  any  observational  effects  such 
as  the  Earth’s  orbital  motion  around  the  Sun. 


4.4.1  Unexpected  Planetary  Rotation 

For  solid  rocky  planets,  the  rotation  period  is  everywhere  the  same  on  the  planet’s 
surface.  The  Earth,  for  example,  rotates  once  very  24  h  or  86,400  s  at  all  latitudes, 
or  at  every  angular  distance  north  or  south  of  the  equator.  As  a  result,  all  points  of 
the  globe  take  the  same  amount  of  time  to  complete  one  rotation  and  a  day  lasts 
24  h  everywhere  on  the  planet.  If  the  rotation  period  differed  at  different  latitudes, 
the  solid  planet  would  break  apart. 

You  might  think  that  it’s  easy  to  determine  the  rotation  period  of  a  planet.  All 
you  need  to  do  is  watch  how  long  it  takes  for  a  prominent  surface  feature  to  spin 
around  behind  the  planet  and  reappear.  But  this  was  not  the  case  for  Mercury, 
which  is  so  close  to  the  Sun  that  most  people  have  never  even  seen  the  planet, 
let  alone  resolved  anything  on  its  surface.  And  the  situation  was  even  worse  for 
cloud-covered  Venus,  whose  surface  can  never  be  seen. 

Astronomers  once  supposed  that  solar  tides  in  the  body  of  Mercury  would  cause 
the  planet  to  rotate  on  its  axis  once  every  87.97  Earth  days,  in  step  with  its  orbital 
period.  Just  as  the  Earth’s  Moon  always  presents  the  same  face  to  the  Earth,  it  was 
thought  that  one  side  of  Mercury  was  always  turned  toward  the  Sun.  To  test  this 
idea,  the  Italian  astronomer  Giovanni  Schiaparelli  (1835-1910)  monitored 
Mercury’s  surface  markings  seen  though  his  0.46-m  (18-inch)  telescope,  and  he 
concluded  that  the  same  side  of  the  planet  did,  indeed,  always  face  the  Sun 
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Fig.  4.7  Radar  probes  of  Mercury  A  radio  signal  spreads  out  as  a  spherical  wave,  and  Mercury 
intercepts  a  small  fraction  of  them.  As  the  wave  sweeps  by  the  planet,  it  is  reflected  in  spherical 
wavelets  whose  wavelengths  are  Doppler- shifted  by  the  rotational  motion  of  Mercury’s  surface. 
The  waves  from  the  receding  side  are  red-shifted  toward  longer  wavelengths  and  those  from  the 
approaching  side  are  blue-shifted  to  shorter  wavelengths.  The  total  amount  of  wavelength  change, 
from  red  to  blue,  reveals  the  speed  of  rotation,  and  the  rotation  period  can  be  obtained  by  dividing 
the  planet’s  circumference  by  this  speed 

(Schiaparelli  1889;  Defrancesco  1988).  For  three-quarters  of  a  century,  telescopic 
observers  agreed  with  his  conclusion.  All  of  these  astronomers  were  dead  wrong! 

In  1967,  Mercury’s  true  rotational  period  was  determined  with  radio  signals  that 
rebounded  from  the  planet  (Dyce  et  al.  1967).  The  world’s  largest  radio  telescope, 
located  in  Arecibo,  Puerto  Rico,  was  used  to  transmit  2  million  watts  of  pulsed 
radio  power  at  the  planet,  and  to  receive  the  faint  echo.  This  technique  is  known  as 
radio  detection  and  ranging,  abbreviated  radar,  and  it  is  also  used  to  locate  and 
guide  airplanes  near  airports. 

Each  pulse  was  finely  tuned,  within  a  narrow  range  of  wavelengths  around  0.497  m, 
and  emitted  for  only  about  a  millisecond.  Upon  hitting  the  planet,  its  rotation  de-tuned 
the  pulse,  slightly  spreading  it  over  a  wider  range  of  wavelengths  (Fig.  4.7).  One  side  of 
the  globe  was  rotating  away  from  the  Earth,  while  the  other  side  was  rotating  toward 
our  planet.  These  motions  produced  slight  changes  in  the  wavelength  of  the  echo, 
which  arrived  back  at  Arecibo  shortly  before  the  next  radar  pulse  was  sent.  The 
rotational  velocity  and  period  were  calculated  from  the  broadened,  wavelength-shape 
of  the  return  echo,  using  the  well-known  expression  for  the  Doppler  effect. 

A  rotating  object  will  produce  a  blueshift  on  the  side  spinning  toward  an 
observer,  and  a  redshift  on  the  opposite  side.  Their  combined  effect  will  broaden  a 
narrow  spectral  line  or  a  finely  tuned  radio  pulse  at  wavelength  2  by  an  amount  Ak 
given  by  the  expression: 


Ak  =  Vrot 
k  c 

The  period,  P,  of  rotation  for  an  object  of  radius  R  is  P  =  2nR/Vrot ,  and  Vrot  is  the 
rotation  velocity. 


(4.17) 
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The  radar  result  for  Mercury  came  as  an  unexpected  surprise.  Its  rotation  period 
was  58.646  Earth  days,  or  exactly  two-thirds  of  the  87.969-day  period  that  had 
been  accepted  so  long.  Thus,  with  respect  to  the  stellar  background,  Mercury  spins 
on  its  axis  three  times  during  two  full  revolutions  about  the  Sun,  which  follows 
from  3  x  58.646  =  2  x  87.969,  and  it  is  technically  known  as  spin-orbit  cou¬ 
pling.  In  comparison,  the  Earth’s  Moon  has  a  1:1  spin-orbit  resonance  in  which  its 
rotation  period  is  equal  to  its  orbital  period. 

The  Italian  scientist  Giuseppe  Colombo  (1930-1984)  provided  an  explanation 
for  this  result  in  terms  of  the  Sun’s  varying  tidal  forces  as  Mercury  revolves  about 
its  elongated  orbit  (Colombo  and  Shapiro  1966;  Goldreich  and  Peale  1966).  The 
solar  gravity  pulls  hardest  on  Mercury  when  the  planet  is  closest  to  the  Sun,  at 
perihelion,  and  least  at  the  opposite  side  of  its  eccentric  orbit,  at  aphelion.  This 
extra  gravitational  pull  of  the  Sun  at  perihelion  gives  an  abrupt  twist  to  Mercury’s 
non-spherical  body,  speeding  up  the  rotation  rate  and  forcing  it  into  synchronism 
at  perihelion  with  the  3:2  resonance.  If  Mercury’s  orbit  around  the  Sun  were  much 
closer  to  a  circular  shape,  like  the  nearly  round  orbit  of  the  Moon  around  the  Earth, 
then  the  Sun’s  tidal  forces  would  have  slowed  Mercury’s  rotation  into  synchro¬ 
nism  with  its  orbital  motion,  in  a  1:1  resonance  with  a  rotation  period  equal  to  its 
orbital  period. 

No  human  eye  has  ever  gazed  on  the  surface  of  Venus,  which  is  forever  hidden 
by  a  thick  overcast  of  impenetrable  clouds,  but  radio  waves  can  penetrate  this 
obscuring  veil  and  touch  the  landscape  hidden  beneath.  By  bouncing  pulses  of 
radio  radiation  off  the  surface  of  Venus,  the  radar  astronomers  also  discovered  in 
1967  that  this  planet  spins  in  the  backward  direction,  opposite  to  that  of  its  orbital 
motion.  That  is,  unlike  the  other  terrestrial  planets,  Venus  does  not  rotate  in  the 
direction  in  which  it  orbits  the  Sun. 

The  radar  observations  also  showed  that  Venus  spins  with  a  period  longer  than 
any  other  planet,  at  243.018  Earth  days.  This  rotation  period  is  even  longer  than 
the  planet’s  224.7  Earth-day  period  of  revolution  around  the  Sun,  so  the  day  on 
Venus  is  longer  than  its  year.  Tides  raised  by  the  Sun  in  the  planet’s  thick 
atmosphere  may  explain  why  Venus  turns  very  slowly  and  in  the  wrong  way,  but  it 
might  have  alternatively  been  knocked  into  a  backwards  rotation  by  a  collision 
with  a  planet-sized  object  in  the  early  epochs  of  the  solar  system,  when  such 
collisions  were  more  common. 

Unlike  the  rocky  terrestrial  planets,  the  gaseous  giant  planets  do  not  rotate  at  a 
uniform  rate,  and  this  results  in  a  non-spherical  shape.  The  outward  force  of 
rotation  opposes  the  inward  gravitational  force,  and  this  reduces  the  pull  of  gravity 
in  the  direction  of  spin.  As  a  result,  the  giant  planets  rotate  faster  in  their  equatorial 
middle,  where  there  is  a  perceptible  bulge,  and  slower  at  the  flattened  poles.  So 
they  have  an  oblate  shape  that  is  elongated  along  the  equator  (Table  4.4).  The 
same  thing  even  happens  to  the  Earth,  but  by  a  relatively  small  amount  since  it  is 
solid  instead  of  gaseous  inside. 

The  apparent  outward  force  that  draws  a  rotating  body  away  from  the  center  of 
gravitational  acceleration  is  known  as  centrifugal  force ,  from  the  Latin  centrum 
meaning  “center”  and  fugere,  meaning  “to  flee”.  It  tends  to  push  the  equatorial 
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Table  4.4  Oblateness  of  the  giant  planets  and  the  Eartha 


Planet 

Equatorial  radius,  Re  (km) 

Polar  radius,  Rp  (km) 

Oblateness  (Re-Rp)/Re 

Earth 

6,378.140 

6,356.755 

0.003353 

Jupiter 

71,492 

66,854 

0.0649 

Saturn 

60,268 

54,364 

0.0980 

Uranus 

25,559 

24,973 

0.0229 

Neptune 

24,766 

24,342 

0.0171 

a  The  radii  are  given  in  units  of  kilometers,  abbreviated  km.  The  radii  of  the  giant  planets  are 
those  at  the  level  where  the  atmospheric  pressure  is  equal  to  one  bar,  the  pressure  of  air  at  sea 
level  on  Earth 


regions  out.  The  ratio  of  the  centrifugal  acceleration  at  the  equator  to  the  gravi¬ 
tational  acceleration  at  the  equator  is 


cd2R 3  4tz2R3 
GM  ~  P2GM 


(4.18) 


for  a  planet  rotating  at  an  angular  velocity  co  =  2i i/P,  rotation  period  P,  radius 
R  and  mass  M,  where  the  universal  constant  of  gravitation  G  =  6.674  x 
10“ 11  m3  kg  1  s-2. 

The  rapid  rotation  of  a  planet  or  star  might  push  the  equatorial  regions  out  so  far 
that  it  rips  the  object  apart,  and  this  provides  an  upper  limit  to  the  possible  rotation 
speed  and  a  lower  limit  to  the  rotation  period  (Focus  4.1).  If  the  rotation  is  too  fast, 
and  the  equatorial  push  is  too  much,  there  is  nothing  left  to  rotate. 


Focus  4.1  How  fast  can  a  planet  or  star  rotate? 

The  rotation  of  a  planet  or  star  forces  its  equatorial  regions  out,  and  if  the 
speed  of  rotation  is  too  fast  the  object  will  fall  apart.  This  will  happen  if  the 
equatorial  rotation  velocity  of  the  star,  Vrot ,  exceeds  the  escape  velocity, 
Vesc,  which  for  an  object  with  mass,  M,  and  radius,  R,  occurs  when: 


2nR 


>V< 


2  GM\  1/2 

~R~) 


(4.19) 


where  the  symbol  >  denotes  greater  than  or  equal,  the  constant 
7i  «  3.14159  and  the  Newtonian  gravitation  constant  G  =  6.674  x 

_  11  Q  _  1  _ O 

10  m  kg  s  .  Collecting  terms  in  this  equation,  we  see  that  the  break 
up  happens  for  rotation  periods  P  of 


P< 


2  n2R3\  1/2 

GM ) 


(4.20) 


where  the  symbol  <  denotes  less  than  or  equal.  We  would  obtain  the  same 
condition,  within  a  factor  of  the  square  root  of  two,  or  if  we  set  the  ratio 
of  the  centrifugal  acceleration  at  the  equator  equal  to  the  gravitational 
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acceleration  at  the  equator.  For  a  planet  or  star  of  uniform  mass  density 
p  =  M/(4nR ~  /3),  the  mass  per  unit  volume,  the  upper  limit  to  the  period  of 
rotation  is: 


P< 


(4.21) 


_ Q 

For  a  rocky  planet  with  a  mass  density  of  3,000  kg  m  ,  like  that  of  the 
Earth’s  crust,  the  fastest  possible  rotation  period  is  P  ^  6,470  s  «  1.8  h. 
Most  asteroids,  for  example,  do  not  rotate  faster  than  once  every  2.2  h. 

_ Q 

Stars  with  a  mean  mass  density  like  that  of  the  Sun,  at  1,410  kg  m  , 
would  be  expected  to  rotate  with  periods  longer  than  about  2.6  h,  but  stars  do 
not  have  a  uniform  mass  density.  Moreover,  collapsed  stars,  like  neutron 
stars  and  pulsars  have  very  high  mass  densities  approaching  nuclear  densi- 

i  n  _ o 

ties  of  5  x  10  kg  m  ,  which  is  what  happens  when  you  press  a  solar  mass 
into  a  star  about  104  m  in  radius.  Our  equation  then  shows  that  the  fastest 

_ o 

pulsar  probably  has  a  period  of  about  0.5  ms,  or  0.5  x  10  ~  s,  and  thus 
rotates  about  2,000  times  a  second.  Such  stars  also  have  exceptionally  high 
escape  velocities  owing  to  their  compact  size. 

The  instability  of  uniformly  rotating  spherical  masses  was  first  described 
by  the  Scottish  mathematician  Colin  Maclaurin  (1698-1746),  and  the  gen¬ 
eral  result  ever  since  then  (Maclaurin  1742)  is  that  a  rotating  sphere  becomes 
unstable  when  the  angular  rotational  velocity  co  =  In/ P  rises  above  (Gp)1^2 
(Tassoul  1978).  The  detailed  theory  for  rotating  fluid  masses,  as  well  as 
gaseous  ones,  has  a  long,  rich  history  that  can  be  found  in  Todhunter  (1962) 
and  Chandrasekhar  (1969). 


Despite  its  great  size,  Jupiter  rotates  so  fast  that  day  and  night  each  last  about 
5  h  and  its  full  day  is  less  than  one-half  Earth  day.  The  precise  rotation  period  of 
9.9249  h  is  found  by  tracking  radio  bursts  that  are  linked  to  the  planet’s  spinning 
magnetic  field,  which  emerges  from  deep  within  the  planet.  Saturn  rotates  with  a 
day  of  only  10.6562  h,  which  is  also  inferred  from  the  observed  periodic  modu¬ 
lation  in  Saturn’s  radio  emission,  generated  in  its  spinning  magnetic  fields.  The 
visible  clouds  at  different  latitudes  on  both  giant  planets  rotate  at  different  speeds 
and  even  in  different  directions.  The  rotation  periods  of  the  some  planets  and  stars, 
including  the  Sun,  are  given  in  Table  4.5. 


4.4.2  The  Sun’s  Differential  Rotation 

Observations  of  sunspots  have  long  indicated  that  the  visible  solar  disk  rotates 
differently  at  different  latitudes,  with  a  faster  rate  at  the  equator  than  at  higher 
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Table  4.5  Rotation  periods  and  rotation  velocities  of  some  planets  and  stars 


Object 

Rotation  period 

Radiusa  (m) 

Rotation  velocity a  (ms  1 ) 

Earth 

0.99727  Earth  daysb 

6.378  x  106 

4.651  x  102 

Earth’s  Moon 

27.322  Earth  daysc 

1.738  x  10s 

4.627 

Mercury 

58.6462  Earth  days 

2.440  x  106 

3.026 

Venus 

—243.018  Earth  days 

6.052  x  10s 

1.81 

Jupiter 

9.9249  h 

7.149  x  107 

1.26  x  104 

Saturn 

10.6562  h 

6.027  x  107 

9.87  x  103 

Sun  (equator) 

25.67  Earth  days 

6.955  x  108 

1.97  x  103 

Vega 

12.5  h 

1.933  x  109 

2.7  x  105 

White  dwarf  stard 

186.5  s 

6.378  x  10s 

2.1  x  105 

Crab  pulsar 

0.033  s 

104 

1.9  x  106 

a  The  equatorial  radius  is  given  when  the  object  has  a  known  oblate  shape,  and  in  this  situation 
the  equatorial  rotation  velocity  is  provided 

b  One  Earth  day  is  defined  as  the  time  for  our  planet  to  revolve  once  with  respect  to  the  Sun,  and 
such  a  solar  day  is  24  h  or  86,400  s  long.  The  Earth’s  rotation  period  with  respect  to  stars,  or 
sidereal  time,  runs  about  4  min  slower  than  the  solar  day.  The  sidereal  day  lasts  23  h  56  min  04  s 
or  8.6164  x  104  s 

c  The  sidereal  rotation  period  of  the  Earth’s  Moon,  from  fixed  star  to  fixed  star,  is  27.322  Earth 
days.  The  time  from  new  Moon  to  new  Moon,  known  as  the  synodic  month,  is  29.53  Earth  days 
d  The  radius  of  a  white  dwarf  star  is  assumed  to  be  equal  to  that  of  the  Earth  and  its  rotation 
period  inferred  from  the  rotation  period  of  the  Sun  and  conservation  of  angular  momentum,  so  the 
period  scales  as  the  inverse  square  of  the  radius 


Table  4.6  Differential  rotation  of  the  Suna 


Solar  latitude 
(degrees) 

Rotation  period 
(days) 

Rotation  speed 
(km  h-1) 

Rotation  speed 
(m  s'1) 

Angular  velocity 
(nHz) 

0  (Equator) 

25.67 

7,097 

1,970 

451 

15 

25.88 

6,807 

1,891 

447 

30 

26.64 

5,922 

1,645 

434 

45 

28.26 

4,544 

1,262 

410 

60 

30.76 

2,961 

823 

376 

75 

33.40 

1,416 

393 

347 

a  Data  from  the  MDI  instrument  aboard  the  SOHO  spacecraft 


latitudes  (Carrington  1863;  Newton  and  Nunn  1951;  Snodgrass  1983).  This  is 
known  as  differential  rotation,  since  the  rate  of  rotation  differs  at  different  latitudes. 
It  indicates  that  the  Sun  is  not  solid,  for  it  would  be  torn  apart  by  differential  rotation 
if  it  was  solid;  instead,  most  of  the  Sun  is  a  gaseous  plasma.  Gilman  (1974)  has 
provided  a  review  of  the  rotation  of  the  Sun. 

The  rotation  speed  of  the  visible  solar  disk,  the  photosphere,  can  be  inferred 
from  the  Doppler  effect  of  an  absorption  line  originating  there,  and  the  results 
confirm  the  differential  rotation  suggested  by  sunspot  observations  (Table  4.6). 
The  Sun  spins  about  its  axis  with  a  period  of  about  25  days  at  the  equator,  which 
corresponds  to  a  rotation  speed  of  about  2  km  s-1  or  roughly  7,200  km  per  hour. 
But  since  the  Earth  is  orbiting  the  Sun  in  the  same  direction  that  the  Sun  rotates, 
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the  solar  rotation  period  observed  from  the  Earth  is  about  27  days.  The  shorter 
rotation  period  is  the  star’s  true  rotation  period,  and  is  technically  known  as  the 
sidereal  rotation  period,  from  fixed  star  to  fixed  star;  the  longer  period  required  for 
a  fixed  feature  to  rotate  back  to  the  same  position  as  viewed  from  Earth,  is  called 
the  synodic  rotation  period. 

The  English  amateur  astronomer  Richard  Christopher  Carrington  (1826-1875) 
determined  the  solar  rotation  rate  from  low  latitude  sunspots  in  the  1850s,  and 
defined  a  fixed  solar  coordinate  system  that  rotates  once  every  25.38  days 
(Carrington  1863),  which  would  correspond  to  the  sidereal  rotation  period  near  the 
solar  equator.  To  compare  locations  on  the  Sun  over  a  period  of  time,  the  mean 
observed,  or  synodic,  solar  rotation  period  has  been  arbitrarily  taken  to  be 
27.2753  days.  Each  rotation  of  the  Sun  is  then  given  a  unique  number  called  the 
Carrington  rotation  number,  with  rotation  1  beginning  on  9  November  1853. 

The  synodic  equatorial  rotation  period,  as  observed  from  Earth,  is 
26.75  zb  0.05  days,  with  a  differential  synodic  rotation  given  by  (Sheeley  et  al. 
1992): 


co(9)  =  13.46  —  2.7 cos2  9  +  1.2cos4  9  -  3.2cos6(0),  (4.22) 

where  co  is  the  angular  velocity  in  degrees  per  day,  and  6  is  the  co-latitude 
measured  from  the  poles  instead  of  the  equator.  An  angular  velocity  of  13.46 
degrees  per  day  corresponds  to  451  nHz. 

Helioseismologists  have  more  recently  used  5  min  oscillations  of  the  photo¬ 
sphere,  produced  by  internal  sound  waves,  to  investigate  the  internal  structure  of 
the  Sun  (see  Sect.  8.5),  and  the  rotation  rate  inside  the  Sun  has  been  measured  by  a 
change  in  the  periods  of  the  sound  waves.  Waves  propagating  in  the  direction  of 
rotation  are  carried  along  by  the  moving  gas,  and  move  faster  than  they  would  in  a 
non-rotating  Sun.  A  bird  or  a  jet  airplane  similarly  moves  faster  when  traveling 
with  the  wind  and  takes  a  shorter  time  to  complete  a  trip.  The  resonating  sound¬ 
wave  crests  moving  with  the  rotation  therefore  appear,  to  a  fixed  observer,  to  have 
shorter  periods.  Waves  propagating  against  the  rotation  are  slowed  down,  with 
longer  periods.  These  opposite  effects  make  the  observed  solar  oscillation  periods 
divide,  and  such  rotational  splitting  depends  on  both  the  depth  and  the  latitude  of 
the  sound  waves  moving  within  the  Sun. 

The  solar  oscillations  have  a  period  of  about  5  min,  so  the  rotational  splitting  is 
roughly  5  min  divided  by  25  days,  or  about  one  part  in  seven  thousand  since 
1  day  =  1,440  min.  The  photosphere  oscillations  have  to  be  measured  ten  or  a 
hundred  times  more  accurately  than  this  to  determine  subtle  variations  in  the  Sun’s 
rotation,  or  as  accurately  as  one  part  in  a  million. 

The  solar  oscillation  data  indicate  that  differential  rotation,  in  which  the  equator 
spins  faster  than  the  poles,  is  preserved  throughout  the  outer  third  of  the  Sun, 
known  as  the  convective  zone  (Fig.  4.8).  Within  this  zone,  there  is  little  variation 
of  rotation  with  depth,  and  the  inside  of  the  Sun  does  not  rotate  any  faster  than  the 
outside  at  the  same  latitude.  At  greater  depths,  the  interior  rotation  no  longer 
mimics  that  of  sunspots,  and  differential  rotation  disappears.  The  internal  accord 
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Fig.  4.8  Internal  rotation  of  the  Sun  The  rotation  rate  inside  the  Sun,  determined  by 
helioseismology  using  instruments  aboard  the  SOHO  spacecraft.  The  outer  parts  of  the  Sun 
exhibit  differential  rotation,  with  material  at  high  solar  latitudes  rotating  more  slowly  than 
material  at  equatorial  latitudes.  This  differential  rotation  persists  to  the  bottom  of  the  convective 
zone  at  28.7  percent  of  the  way  down  to  the  center  of  the  Sun.  The  rotation  period  in  days  is  given 
at  the  left  axis,  and  the  corresponding  angular  velocity  scale  is  on  the  right  axis  in  units  of 
nanoHertz  (nHz),  where  1  nHz  =  109  Hz,  or  1  billionth,  of  a  cycle  per  second.  A  rotation  rate  of 
320  nHz  corresponds  to  a  period  of  about  36  days  {solar  poles )  and  a  rate  of  460  nHz  to  a  period 
of  about  25  days  {solar  equator).  The  rotation  in  the  outer  parts  of  the  Sun  is  given  at  latitudes  of 
0  {solar  equator),  30,  45,  60,  and  75  degrees.  Just  below  the  convective  zone,  the  rotational  speed 
changes  markedly,  and  shearing  motions  along  this  interface  may  be  the  dynamo  source  of  the 
Sun’s  magnetism.  There  is  uniform  rotation  in  the  radiative  zone,  from  the  base  of  the  convective 
zone  at  0.713  to  about  0.25  solar  radii.  The  sound  waves  do  not  reach  the  central  part  of  the 
energy-generating  core.  (Courtesy  of  Alexander  G.  Kosovichev/convective  zone/Sebastien 
Couvidat,  Rafael  Garcia,  and  Sylvaine  Turck-Chieze/radiative  zone.  SOHO  is  a  project  of 
international  cooperation  between  ESA  and  NASA.) 


breaks  apart  just  below  the  base  of  the  convective  zone,  where  the  rotation  speed 
becomes  uniform  from  pole  to  pole.  Lower  down,  within  the  radiative  zone,  the 
rotation  rate  remains  independent  of  latitude,  acting  as  if  the  Sun  were  a  solid 
body.  Although  gaseous,  the  radiative  interior  of  the  Sun  rotates  at  a  nearly  uni¬ 
form  rate  intermediate  between  the  equatorial  and  high-latitude  rates  in  the 
overlying  solar  material.  Lang  (2009)  provides  detailed  references  to  pioneering 
spacecraft  studies  of  the  internal  rotation  of  the  Sun;  also  see  the  review  by 
Thompson  et  al.  (2003). 

Thus,  the  Sun’s  internal  rotation  velocity  changes  sharply  at  about  one  third  of 
the  way  to  its  center,  where  the  outer  parts  of  the  radiative  interior,  which  rotate  at 
one  speed,  meet  the  overlying  convective  zone,  which  spins  faster  in  its  equatorial 
middle.  The  transition  between  these  two  different  regimes  takes  place  in  a  narrow 
region  of  strong  rotational  shear  that  most  likely  plays  an  important  role  in  the 
generation  of  the  large-scale  solar  magnetic  field.  The  differential  rotation  of  other 
stars  may  also  play  a  significant  role  in  the  generation  of  their  magnetic  fields. 


124 


4  Cosmic  Motion 


4.4.3  Stellar  Rotation  and  Age 

The  rotation  of  stars  other  than  the  Sun  is  inferred  from  the  Doppler  broadening  of 
their  spectral  lines.  However,  astronomers  did  not  realize  at  first  that  some  stars 
rotate  fast  enough  for  such  measurements  to  be  meaningful  (Abney  1877).  The 
Doppler  effect  of  atoms  moving  in  the  hot  stellar  atmosphere  once  was  thought  to 
be  substantially  greater  than  that  of  stellar  rotation.  The  Sun,  for  example,  has  an 
equatorial  rotational  speed  of  about  2  km  s-1,  but  the  thermal  speed  of  hydrogen 
atoms  in  the  Sun’s  visible  disk,  at  a  temperature  of  5,780  K  is  about  12  km  s-1. 
The  Doppler  broadening  of  the  hot,  moving  atoms  would  therefore  be  greater  than 
that  attributed  to  rotation,  and  therefore  make  the  rotational  motion  very  difficult  to 
detect.  The  only  reason  that  solar  rotation  could  be  measured  using  spectral  lines, 
rather  than  sunspots,  was  that  the  Sun  is  resolved  and  one  can  map  out  the  speeds 
at  various  places  along  the  visible  disk,  but  that  is  not  possible  for  almost  all  other 
stars  that  remain  unresolved  with  even  the  best  telescope. 

The  turning  point  came  in  a  seminal  paper  by  two  Russian  astronomers,  Grigory 
Ambramovich  Shajn  (1892-1956)  and  Otto  Struve  (1896-1963),  who  showed  that 
relatively  young  stars  rotate  faster  than  older  ones  and  therefore  exhibit  excep¬ 
tionally  broad  spectral  lines.  They  concluded  that  some  stars  have  equatorial 
rotational  velocities  ranging  up  to  100  km  s-1  (Shajn  and  Struve  1929; 
Struve  1930). 

The  observed  component  of  radial  velocity,  Vn  depends  on  the  inclination,  i,  of 
the  star’s  pole  to  the  line  of  sight,  with  Vr  =  Ve  sin  i,  where  Ve  is  the  rotational 
velocity  at  the  equator.  The  inclination  is  often  unknown,  so  the  measurements 
give  a  minimum  value  for  the  star’s  rotation  velocity,  sometimes  referred  to  as  the 
projected  rotational  velocity. 

Stars  with  rapid  rotation  are  massive,  hot  and  young  (Slettebak  1949,  1954, 
1955).  They  include  the  bright  stars  Achernar,  Alpha  Arae,  Pleione,  and  Vega 
(Peterson  et  al.  2006),  with  respective  equatorial  rotational  velocities  of  up  to  300, 
470,  329,  and  274  km  s-1.  These  stars  are  rotating  so  rapidly  that  their  equators 
bulge  outward,  giving  them  a  flattened  shape.  Achernar,  for  example,  is  thought  to 
have  an  equatorial  diameter  that  is  about  50  %  greater  than  the  distance  between  its 
poles.  The  stars  with  rapid  rotation  exhibit  intense  x-ray  emission,  due  to  hot 
coronae  and  presumably  intense  magnetic  fields  (Pallavicini  et  al.  1981). 

Less  massive,  cooler  and  older  stars  like  the  Sun  rotate  with  much  slower 
speeds  of  10  km  s-1  or  less,  and  do  not  exhibit  a  pronounced  equatorial  bulge. 
These  stars  most  likely  were  formed  with  fast  rotations  like  the  more  massive  stars, 
but  have  slowed  down  as  they  aged.  Stellar  magnetic  fields  coupled  to  the  sur¬ 
rounding  interstellar  material  act  as  magnetic  brakes  over  long  time  intervals.  The 
Sun,  for  example,  probably  originated  4.6  billion  years  ago  with  a  rotation  velocity 
of  about  100  km  s-1;  its  magnetism  helps  to  explain  why  it  is  now  rotating  at 
about  2  km  s  1  (Sects.  12.1  and  12.2). 


Chapter  5 

Moving  Particles 


5.1  Elementary  Constituents  of  Matter 

What  is  matter  made  of?  To  find  out,  we  might  try  breaking  any  material  object 
into  increasingly  smaller  pieces  until  we  reach  a  stage  when  the  smallest  piece 
cannot  be  broken  apart.  The  last  step  in  this  imaginary,  successive  division  of 
matter  suggests  the  existence  of  unseen  atoms ,  a  Greek  word  meaning  “indivisi¬ 
ble,”  or  something  that  cannot  be  divided  further. 

In  the  fifth  century  BC,  for  example,  the  ancient  Greek  philosopher  Democritus 
(ca  470-ca  380  B.C.),  and  his  mentor  Leucippus  proposed  that  all  matter  is 
composed  of  combinations  of  a  small  number  of  separate  atoms  coming  together  in 
different  ways.  They  supposed  that  all  substances  are  composed  of  four  types  of 
elemental  atoms:  of  air,  earth,  fire,  and  water.  Mud,  for  example,  could  be  made 
from  earth  and  water,  and  fire  could  turn  water  into  vapor. 

Then  about  two  millennia  ago,  the  Roman  poet  Titus  Lucretius  Cams  (ca  99 
BC-ca  55  BC),  or  Lucretius  for  short,  wrote  a  wonderful  epic  poem,  in  Latin  De 
Rerum  Natura  or  in  English  On  the  Nature  of  the  Universe ,  which  described  these 
indestructible  atoms  that  are  so  exceedingly  small  that  they  are  invisible  and 
infinitely  vast  in  number  (Lucretius,  55  BC).  To  Lucretius,  atoms  were  the 
building  blocks  of  all  that  exists.  This  fundamental  idea  persists  to  this  day. 
Everything  that  we  see,  from  a  friend  to  a  tree  to  the  Sun  and  the  stars  consist  of 
innumerable  atoms,  all  moving  randomly  about,  colliding,  gathering  together,  and 
breaking  apart  again.  Atoms  are  immortal,  the  ingredients  of  all  that  existed  in  the 
past  and  the  seeds  of  everything  that  will  exist  in  the  future. 

All  ordinary  matter  is  composed  of  elemental  atoms ,  and  there  is  a  limited 
number  -  just  94  naturally  occurring  atoms,  detected  directly  on  the  Earth  or  in 
astronomical  spectra.  These  atoms  are  known  also  as  chemical  elements ,  because 
they  cannot  be  decomposed  by  chemical  means.  They  are  very  small  and 
exceedingly  numerous.  A  simple  drop  of  water  contains  about  100,000  billion, 
billion,  or  10“  atoms  -  close  to  the  number  of  stars  in  the  universe.  An  additional 
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24  atoms  have  been  produced  as  the  result  of  artificial  nuclear  reactions  within 
particle  accelerators  rather  than  by  natural  processes. 

Atoms  combine  to  form  molecules,  and  there  are  many  more  kinds  of  molecules 
than  there  are  atoms.  The  vast  numbers  of  molecules  differ  only  in  the  kind  and 
relative  number  of  the  atoms  of  which  they  are  constructed.  A  molecule  may  be  a 
combination  of  single  chemical  atoms,  such  as  the  oxygen  molecule,  02,  that  we 
breathe,  which  consist  of  two  oxygen  atoms  each  designated  by  O.  A  molecule 
also  may  contain  different  elements,  as  in  water,  designated  H20,  which  is  com¬ 
posed  of  two  atoms  of  hydrogen,  H,  and  one  of  oxygen,  O.  The  Earth’s  transparent 
atmosphere  consists  mainly  of  molecules  of  oxygen,  02  (21  %),  and  nitrogen,  N2 
(78  %),  with  trace  amounts  of  carbon  dioxide,  C02,  and  water  vapor.  Methane, 
CH4,  the  natural  gas  used  in  a  stove,  consists  of  one  atom  of  carbon,  C,  and  four  of 
hydrogen,  H.  Organic  molecules  contain  more  complex  combinations  of  carbon, 
hydrogen,  and  other  atoms. 

But  the  atom  is  not  indivisible  after  all.  Elemental  atoms  can  be  broken  into 
smaller  subatomic  pieces.  The  first  subatomic  particle  was  found  through  inves¬ 
tigations  of  electricity.  The  English  physicist  Michael  Faraday  (1791-1867)  dis¬ 
covered  that  the  electrical  charge  carried  by  different  atoms  is  always  an  integer 
multiple  of  a  basic  amount,  an  atomicity  of  electrical  charge  -  the  electron 
(Faraday  1839,  1844).  The  concept  of  such  an  indivisible  quantity  of  charge  was 
proposed  to  explain  the  chemical  properties  of  atoms. 

It  was  in  1894  that  the  Irish  physicist  George  Johnstone  Stoney  (1826-1911) 
coined  the  word  electron  to  describe  the  fundamental  unit  of  electricity  (Stoney 
1881,  1894),  and  electricity  is  indeed  transferred  by  the  flow  of  electrons.  Inter¬ 
actions  between  electrons  hold  the  atoms  in  a  molecule  together  in  a  chemical 
bond.  Similar  but  much  weaker  interactions  among  electrons  hold  many  molecules 
together. 

The  English  physicist  Joseph  John  Thomson  (1856-1940)  and  his  colleagues 
identified  the  electron  as  a  particle  and  determined  its  charge-to-mass  ratio 
(Thomson  1897a,  b).  Thomson  was  studying  cathode  rays,  which  carry  electrons 
between  electrodes  in  a  tube  of  gas;  he  showed  that  the  electrons  are  deflected 
when  either  an  external  magnetic  field  or  an  electric  field  is  applied,  which  meant 
that  they  are  electrically  charged.  Using  these  curved  trajectories,  Thomson 
showed  that  electrons  are  very  light,  roughly  1/ 1,000th  of  the  mass  of  the  least 

o  o 

massive  atom,  hydrogen.  It  would  take  30  billion,  billion,  billion,  or  3  x  10~  , 
electrons  to  make  a  total  mass  of  just  1  oz,  or  28  g. 

Thomson  received  the  1906  Nobel  Prize  in  Physics  for  his  investigations  of  the 
conduction  of  electricity  by  gases.  A  few  years  later,  the  American  physicist 
Robert  A.  Millikan  (1868-1953)  determined  the  elementary  charge  of  the  electron 
by  measuring  the  electrical  force  on  charged  droplets  of  oil  suspended  against 
gravity  between  two  metal  electrodes  (Millikan  1910,  1913).  He  was  awarded  the 
1923  Nobel  Prize  in  Physics  for  this  and  related  work. 

The  electron,  which  carries  a  negative  electrical  charge  and  has  no  known 
components  or  substructure,  is  believed  to  be  a  truly  elementary  particle.  The 
elementary  charge  of  the  electron,  which  is  denoted  by  e ,  has  a  value  of 
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e  =  1.602  x  10  19  C.  The  negative  electric  charge  of  the  electron  has  a  value  of 
—e,  but  no  one  knows  why  it  has  the  mass  and  charge  it  does.  When  the  electrical 
current  in  a  house  is  turned  on  to  light  a  lamp,  about  1  million  trillion,  or  10  , 
electrons  flow  through  the  wires  every  second. 

In  the  early  20th  century,  the  New  Zealand-born  British  physicist  Ernest 
Rutherford  (1871-1937)  and  his  colleagues  showed  that  radioactivity  is  produced 
by  the  disintegration  of  atoms,  and  they  discovered  that  radioactive  material  emits 
energetic  subatomic  particles.  When  bombarding  gold  leaf  with  beams  of  these 
particles,  they  found  to  their  astonishment  that  about  1  in  20,000  particles  bounced 
right  back  from  where  it  originated,  whereas  all  of  the  others  passed  through  the 
gold.  This  meant  that  atoms  are  largely  empty  space  and  that  most  of  the  mass  of 
an  atom  is  concentrated  in  a  nucleus  that  is  100,000  times  smaller  than  an  atom 
(Rutherford  1911,  1914).  The  nucleus  of  an  atom  contains  less  than  10-15  of  the 
atomic  volume,  but  it  includes  almost  all  the  atom’s  mass. 

Within  a  decade,  Rutherford  was  able  to  show  that  the  nucleus  of  different 
atoms  contains  various  amounts  of  the  nucleus  of  the  simplest  atom,  hydrogen.  He 
named  this  nuclear  building  block  a  proton ,  from  the  Greek  for  “first”,  since  it  was 
the  first  nuclear  particle  to  be  discovered. 

A  proton  is  positively  charged,  with  a  charge  of  +e,  equal  in  amount  to  that  of 
an  electron  but  opposite  in  charge.  It  is  positive.  Now,  particles  with  an  opposite 
sign  to  their  electric  charge  attract  one  another.  So  electrons  and  protons  are 
always  attracted  to  each  other.  Negatively  charged  electrons  surround  positively 
charged  protons  in  an  atom,  and  the  total  positive  charge  of  the  protons  is  equal  to 
the  total  negative  charge  of  the  electrons.  An  atom  has  no  net  electrical  charge  and 
it  is  electrically  isolated  from  external  space. 

Particles  with  the  same  electrical  charge  are  driven  apart  by  an  electrical 
repulsion.  Rutherford  therefore  postulated  the  existence  of  an  uncharged  nuclear 
particle,  later  called  the  neutron ,  to  help  hold  protons  together  in  the  atomic 
nucleus  and  prevent  the  protons  from  dispersing  as  they  repelled  each  other.  After 
an  eleven-year  search,  the  English  physicist  James  Chadwick  (1891-1974)  dis¬ 
covered  the  neutron  (Chadwick  1932a,  b). 

Protons  and  neutrons  collectively  are  named  nucleons ,  because  they  are  the  two 
constituents  of  the  atomic  nucleus.  They  consist  of  yet  smaller  components,  known 
as  quarks.  So,  the  proton  and  the  neutron  are  not  truly  elementary  particles. 
Nevertheless,  the  nuclear  fusion  reactions  that  make  stars  shine  can  be  understood 
by  assuming  that  all  atomic  nuclei  are  composed  of  protons  and  neutrons. 

These  nucleons  are  bound  together  in  an  atomic  nucleus  by  an  exceptionally 
strong  force,  the  nuclear  force  or  nucleon-nucleon  force,  which  allows  them  to 
cling  tightly  to  one  another  and  build  the  dense,  compact  atomic  nucleus.  Although 
powerful,  this  attractive  force  has  a  short  range,  operating  over  very  limited  dis¬ 
tances.  The  strong  force  decreases  to  insignificance  at  distances  greater  than  about 
1  million  billionths,  or  10-15,  of  a  meter,  and  closes  the  nucleus  in  at  an  atom’s 
center.  The  nucleons  cannot  be  pushed  any  closer  together,  and  this  sets  the 
physical  size  of  the  nucleus. 
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Therefore,  an  atom  is  largely  empty  space,  like  the  room  in  which  we  are 
sitting.  A  tiny,  heavy,  positively  charged  nucleus  lies  at  the  heart  of  an  atom, 
surrounded  by  a  cloud  of  relatively  minute,  negatively  charged  electrons  that 
define  most  of  an  atom’s  size  and  govern  its  chemical  behavior. 

The  radius  of  an  atom  is  approximately  10“ 10  m,  and  it  consists  of  a  swarm  of 
electrons  orbiting  a  nucleus,  whose  radius  is  about  10“ 15  m  for  the  nucleus  of  the 
hydrogen  atom  and  about  10“ 14  m  for  the  nuclei  of  the  heaviest  atoms,  such  as 
uranium.  The  fermi  unit  of  length,  where  1  fermi  =  1  fm  =  10“ 15  m,  is  used  in 
nuclear  physics.  The  diameter  of  the  proton  is  1.75  fm,  and  its  radius  is  0.875  fm. 

A  proton  and  a  neutron  have  about  the  same  mass,  which  is  nearly  2,000  times 
that  of  an  electron.  To  be  exact,  the  mass  of  the  proton  and  the  mass  of  the  neutron 
are  respectively  1,836  and  1,839  times  the  mass  of  an  electron. 

The  mass  of  an  atomic  nucleus  is  always  less  than  the  sum  of  the  masses  of  its 
protons  and  neutrons  because  they  have  expended  energy  to  bind  themselves 
together.  It  is  this  binding  energy  that  is  released  in  nuclear  reactions. 

What  holds  solid  objects  together?  Why  doesn’t  a  chair  fall  apart  when  the  wind 
blows  on  it?  All  durable  material  objects  that  surround  us  consist  of  atoms  and, 
given  the  emptiness  of  an  atom,  we  might  wonder  why  we  cannot  easily  crush 
them  into  smaller  entities.  The  answer  is  that  when  we  push  any  two  pieces  of 
material  together  the  forces  of  electrical  repulsion  between  the  atomic  electrons  in 
their  adjacent  surfaces  resist  the  pressure. 

Although  the  kilogram  is  a  useful  unit  of  mass  for  describing  large  objects,  the 
mass  of  an  atom  and  the  mass  of  its  nucleus,  which  contains  nearly  all  of  the 
atom’s  mass,  are  conveniently  measured  in  atomic  mass  units,  abbreviated  a.m.u. 
or  u,  where 

u  =  1  a.m.u.  =  1.660539  x  10_27kg,  (5.1) 

which  is  equal  to  one  twelfth  of  the  mass  of  the  carbon- 12  atom,  denoted  C.  The 
lightest  atom  is  hydrogen  with  an  atomic  weight  of  1.007825  u;  the  heaviest  stable 
atom  is  lead  208  with  an  atomic  weight  of  207.97665  u.  Unstable  radioactive 
atoms  like  uranium  235  are  a  bit  heavier. 

Nuclear  astrophysicists  like  to  use  energy,  E ,  to  express  mass,  m ,  through  the 
E  =  me  relation,  where  c  is  the  speed  of  light,  and  the  convenient  unit  of  nuclear 
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energy  is  the  MeV,  where  1  MeV  =  1.602176  x  10  J,  which  is  also  equal  to 
1,000  keV;  note  that  the  conversion  factor  1.602176  is  the  elementary  charge  of 
the  electron. 

Characteristics  of  these  atomic  and  nuclear  particles  are  given  in  Table  5.1. 

The  simplest  and  lightest  atom  consists  of  a  single  electron  circling  around  a 
nucleus  composed  of  a  single  proton  without  any  neutrons.  This  is  an  atom  of 
hydrogen.  Most  of  the  universe,  and  the  majority  of  the  stars,  is  composed  mainly 
of  hydrogen.  The  nucleus  of  helium,  the  next  most  abundant  atom  in  the  cosmos, 
contains  two  neutrons  and  two  protons;  so  it  naturally  has  two  electrons  that 
balance  the  electrical  charge  of  the  protons  (Fig.  5.1). 

So  an  atom  is  composed  of  a  dense  and  massive  nucleus  containing  protons  and 
neutrons,  and  surrounded  by  electrons.  As  in  the  example  of  helium,  electrically 
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Table  5.1  Physical  properties  of  electrons,  protons,  neutrons,  and  atomsa 
Electron 

me  =  mass  of  electron  =  5.4858  x  10-4  u  =  9.10938  x  10-31  kg  =  0.5109989  MeV/c2 

e  =  elementary  charge  =  1.6022  x  10-19  C;  electron  charge  =  —  e 

re  =  classical  electron  radius  =  2.8179  x  1CT15  m 

oy  =  Thomson  cross  section  'iri~  =  6.65246  x  10-29  m2 

Atomic  nucleus  ( protons ,  neutrons ,  alpha  particles ) 

mP  =  mass  of  proton  =  1.007276466  u  =  1.6726218  x  10“27  kg  =  938.27203  MeV/c2 
e  =  charge  of  proton  =  +1.602  x  10-19  C 

mn  =  mass  of  neutron  =  1.008664916  u  =  1.6749274  x  10-27  kg  =  939.56536  MeV/c2 
my  =  mass  of  alpha  particle  =  mass  of  helium 

nucleus  =  4.001506179  u  =  6.6446567  x  10-27  kg  =  3727.37924  MeV/c2 
Z  =  total  number  of  protons  in  nucleus  =  atomic  number 
A  =  total  number  of  protons  and  neutrons  in  nucleus  =  atomic  mass  number 
R  =  nuclear  radius  =  r()A 1  ^  for  an  atom  with  mass  number  A  and  r0  =  1.25  x  10-15  m 
Atom 

a0  =  Bohr  radius  =  5.2918  x  10_u  m 

u  =  atomic  mass  unit  =  1.66053886  x  10-27  kg  =  931.494061  MeV/c2 

mH  —  mass  of  hydrogen  atom  =  1.007825  u  =  1.6739326  kg 

mHe  =  mass  of  helium  atom  =  4.002602  u  =  6.646476  x  10-27  kg 

A mHe  =  mass  defect  of  helium  atom  =  mHe  —  2mP  —  2 mn  —  2 me  =  0.030378  u 

_  r\  i-i 

c  The  mass,  m,  values  are  given  in  atomic  mass  units  u  =  1.660539  x  10“  kg,  in  kilograms  or 
kg,  and  as  the  rest  mass  energy  divided  by  the  square  of  the  speed  of  light 
c  =  2.9979  x  108  m  s-1;  this  energy  is  given  in  units  of  MeV  =  1.60217646  x  10-14  J 


Helium  Atom 


Electrons 

- 100,000  fm - H 

Fig.  5.1  The  helium  atom  An  atom  of  helium  contains  two  electrons  that  swarm  about  the 
atom’s  nuclear  center  in  a  cloud  of  largely  empty  space.  The  shading  shows  that  the  electrons  can 
be  anywhere  but  are  most  likely  to  be  found  near  the  center  of  the  atom.  The  magnified  nucleus  of 
the  helium  atom  consists  of  two  protons  and  two  neutrons  bound  together  by  a  strong  nuclear 
force.  The  nucleus  and  each  of  its  four  particles  are  spherically  symmetrical.  The  size  of  the 
helium  nucleus  is  about  1  fermi,  or  1  fm,  which  is  equivalent  to  10-15  m.  The  atom  is  about 
100,000  times  bigger  than  the  nucleus,  with  an  atom  size  of  about  105  fm,  or  10_1()  m.  (From 
“The  Life  and  Death  of  Stars”  by  Kenneth  R.  Lang,  published  by  Cambridge  University  Press, 
2013.  Reprinted  with  permission.) 
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neutral  atoms  have  as  many  electrons  as  there  are  protons  and  are  therefore 
without  net  charge.  An  ionized  atom  has  fewer  electrons  than  protons,  and 
therefore  has  a  positive  charge.  The  atomic  number,  designated  by  the  symbol  Z,  is 
equal  to  the  number  of  protons  in  an  atom’s  nucleus.  Hydrogen  has  an  atomic 
number  of  1,  helium  2,  carbon  6,  nitrogen  7,  oxygen  8,  lead  82,  and  uranium  92. 
The  atomic  mass  number,  designated  A,  is  equal  to  the  total  number  of  protons  and 
neutrons  in  the  nucleus.  The  radius,  R ,  of  the  nucleus  of  an  atom  of  atomic  mass 
number  A  is  R  =  tqA1/3,  where  r0  =  1.25  x  10“ 15  m  =  1.25  fermi  =  1.25  fm, 
so  a  heavier  atom  has  a  bigger,  and  more  massive  nucleus. 

The  atoms  of  a  particular  element  all  have  the  same  number  of  protons  in  their 
nucleus,  but  the  number  of  neutrons  can  vary,  giving  rise  to  different  isotopes  of 
the  same  element.  Elements  of  atomic  number  83-94  are  composed  entirely  of 
radioactive  isotopes  that  are  unstable  and  are  known  to  decay  into  other  elements. 


5.2  Heat,  Temperature,  and  Speed 
5.2.1  Where  Does  Heat  Come  From? 

Heat  is  a  form  of  energy  caused  by  the  motion  of  tiny  unseen  particles,  such  as  the 
molecules  in  a  gas,  which  are  in  a  state  of  ceaseless  motion.  These  particles  move 
randomly  in  all  directions  and  do  not  contribute  to  the  overall  motion  of  the  gas  in 
which  they  reside.  The  faster  they  move,  the  hotter  the  body. 

The  energy  of  motion  is  called  kinetic  energy,  after  the  Greek  word  kinesis 
meaning  “motion”  -  the  word  cinema  has  the  same  root,  referring  to  motion 
pictures.  In  a  star,  individual  gas  particles  are  changing  direction  constantly  in  an 
irregular  zigzag  trajectory,  and  the  heat  can  be  enormous.  However,  all  of  the 
particles  in  a  star  move  together  in  the  same  overall  direction,  and  they  are 
responsible  for  the  bulk  kinetic  energy  of  star  motion  (Fig.  5.2). 

All  gas  molecules  are  always  moving,  and  the  hotter  they  become  the  faster 
they  move  and  the  greater  their  kinetic  energy.  The  lowest  possible  temperature  is 
absolute  zero,  or  zero  on  the  Kelvin  scale  denoted  K  (Kelvin  1848).  At  absolute 
zero,  molecules  cease  to  move  and  are  completely  at  rest;  they  have  no  kinetic 
energy.  At  this  temperature,  the  constituent  particles  stick  together  and  behave  as  a 
frozen  solid,  resembling  ice. 

Raise  the  temperature  above  absolute  zero  and  molecules  move  about  and 
collide,  turning  ice  into  liquid  water.  Further  increase  the  temperature  and  mole¬ 
cules  move  so  fast  that  they  overcome  the  cohesive  forces  that  bind  them  together. 
Gas  is  formed,  capable  of  nearly  unlimited  expansion  in  all  directions  as  when 
water  evaporates. 

So,  by  adding  more  heat  we  can  transform  a  solid  into  a  liquid  and  then  a  gas, 
and  we  can  reverse  the  process  by  removing  heat  and  lowering  the  temperature. 
All  our  familiar  objects  exist  in  one  of  these  three  fundamental  states  -  the  solid, 
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Random  Motion 


Ordered  Motion 


Fig.  5.2  Random  and  ordered  motion  Particles  within  a  hot  gas  (left)  move  here  and  there  in 
random  directions  that  continually  change  as  the  result  of  collisions  between  particles.  This 
supports  the  gas  against  inward  gravitational  forces.  A  planet  or  star  (right)  moves  along  a  well- 
defined,  ordered  trajectory  determined  by  external  gravitational  forces  on  it.  When  a  large 
number  of  stars  has  gathered  together  and  is  confined  within  a  star  cluster,  the  stars  also  move  in 
random  directions,  supporting  their  combined  gravitational  pull.  (From  “The  Life  and  Death  of 
Stars”  by  Kenneth  R.  Lang,  published  by  Cambridge  University  Press,  2013.  Reprinted  with 
permission.) 


liquid  and  gaseous  ones.  At  higher  temperatures  within  cosmic  objects,  there  is  a 
fourth  state  of  matter,  known  as  plasma,  in  which  the  atoms  are  tom  into  their 
subatomic  ingredients.  The  range  of  temperatures  found  in  the  universe  is  illus¬ 
trated  in  Table  5.2. 

The  German  physician  Jules  Robert  Mayer  (1814-1878)  reasoned  that  heat  is  a 
form  of  energy  (Mayer  1842)  -  generally  called  “force”  in  his  time  and  related  to 
the  motivating  force  of  fire  (Carnot  1824).  Mayer  found  that  heat  energy  can 
change  form,  and  this  had  a  crucial  role  in  the  discovery  of  the  conservation  of 
energy.  Heat  energy  can  be  produced  by  or  transferred  into  another  type  of  energy, 
but  the  energy  never  disappears.  The  total  energy  is  conserved  (Mayer  1842; 
Helmholtz  1847). 

The  English  physicist  James  Prescott  Joule  (1818-1889)  soon  provided 
experimental  verification  of  the  conservation  of  energy  for  particular  cases,  in  a 


Table  5.2  Range  of  cosmic  temperatures 


Location 

Temperature  (K) 

Absolute  zero 

0 

Cosmic  microwave  background  radiation 

3 

Water’s  freezing  (triple)  point 

273 

Water’s  boiling  point 

373 

Incandescent  light  bulb 

2,500 

Visible  solar  disk 

5,780 

Center  of  Sun 

1.5  x  107 

Atom  bomb 

3.5  x  108 

CERN  particle  accelerator  (proton-proton  collision) 

1013 

Big  bang  (at  10-44  s) 

1032 
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lecture  titled  “On  Matter,  Living  Force,  and  Heat”  (Joule  1847,  1850).  Today,  we 
identify  Joule’s  “living  force”  with  kinetic  energy.  The  joule  unit  of  energy, 
abbreviated  J,  is  appropriately  named  after  him. 

The  German  physician  and  physicist  Hermann  von  Helmholtz  (1821-1894) 
provided  the  connection  between  the  conservation  of  energy  and  the  kinetic  energy 
of  motion,  suggesting  that  the  Sun  could  gain  heat  from  its  gravitational  energy  by 
contracting  (Helmholtz  1847,  1856,  1908).  This  mechanism  would  keep  the  Sun 
shining  for  tens  of  million  of  years,  but  it  was  eventually  realized  that  the  Earth 
and  Sun  are  4.6  billion  years  old,  and  another  source  of  energy,  due  to  nuclear 
reactions,  was  required  to  conserve  energy  in  the  Sun  for  such  long  times. 


5.2.2  Thermal  Velocity 


If  we  bring  a  hot  body  into  contact  with  a  colder  one,  the  fast-moving  particles  in 
the  hot  body  collide  at  the  boundary  with  the  slower-moving  particles  of  the  colder 
body.  This  transfers  to  them  a  part  of  the  kinetic  energy.  The  fast-moving  particles 
gradually  slow  down  and  the  slow  ones  accelerate,  averaging  out  the  temperature 
differences  until  a  state  of  thermal  equilibrium  is  obtained,  and  then  a  single 
temperature  characterizes  the  situation. 

The  portion  of  an  object’s  internal  energy  that  is  responsible  for  its  temperature 
is  called  thermal  energy,  and  for  an  equilibrium  temperature,  denoted  by  T,  the 
average  thermal  energy  given  by: 


Thermal  energy  =  -  kT. 


(5.2) 
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where  the  Boltzmann  constant  k  =  1.38065  x  10  J  K  .  This  constant  is 
named  after  the  Austrian  physicist  Ludwig  Boltzmann  (1844-1906),  whose  doc¬ 
toral  thesis  pioneered  the  kinetic  theory  of  gases  (Boltzmann  1868). 

A  particle  is  said  to  move  at  the  thermal  velocity  when  its  kinetic  energy  is 
equal  to  its  thermal  energy.  For  a  molecule  or  particle  of  mass  m  and  velocity  V  the 
kinetic  energy  of  motion  is: 


Kinetic  energy 


1  9 

-mV1 . 

2 


When  this  is  set  equal  to  the  thermal  energy,  we  obtain  the  thermal  velocity, 
V thermal,  of  a  particle  of  mass,  m,  at  temperature,  T: 


V thermal  — 


3kT 

"3  kT 

/  m 

m 

1  1/2 


The  magnitude  of  the  thermal  velocity,  which  is  the  thermal  speed,  increases 
with  the  temperature  of  the  gas  and  decreases  with  increasing  particle  mass.  Hotter 
particles  move  faster  and  more  massive  ones  move  slower. 
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Example:  Thermal  velocity  in  our  atmosphere  and  in  the  Sun’s  visible 
disk 

At  sea  level  the  temperature  T  is  about  288  K,  and  our  air  is  predominantly 
composed  of  nitrogen  molecules  of  mass  m  =  2xl4xu,  where  the 
atomic  mass  unit  u  =  1.66054  x  10  “  kg.  Substituting  these  numbers  into 
our  expression  for  thermal  velocity,  V thermal  =  (3 kT/m)1^2,  where  the 

_ oo  _ 1 

Boltzmann  constant  k  =  1.381  x  10  J  K  ,  we  obtain  a  thermal  speed, 
the  magnitude  of  the  thermal  velocity,  of  about  507  ms-1.  For  the  visible 
solar  disk,  the  temperature  T  is  5,780  K  and  the  abundant  hydrogen  atoms  of 
mass  mH  =  1.67  x  10  ^  kg,  have  a  thermal  speed  of  about  12,000  m  s  . 
They  move  faster  because  the  hydrogen  atoms  on  the  Sun’s  disk  are  both 
hotter  and  less  massive  than  the  nitrogen  molecules  in  our  air. 

Protons  and  electrons  are  perpetually  steaming  away  from  the  Sun  in  the 
solar  wind.  How  hot  would  they  have  to  be  to  escape  from  the  Sun’s 
gravitational  pull?  As  we  saw  in  Sect.  4.1,  the  escape  speed  of  the  Sun  at  its 

visible  disk  is  V  =  VescQ  =  (2 GMq/Rq)1^2  =  6.18  x  105ms_1.  If  that 
speed  was  equal  to  the  thermal  velocity  V thermal  =  (3 kT/m)1^2,  then  the 
temperature  T  =  mV  /(3k),  where  the  Boltzmann  constant  k  =  1.381  x 

O  Q  1 

10“"  JK_1  and  m  is  the  mass  of  the  particle.  For  a  proton  of  mass 
mp  =  1.672  x  10-27  kg,  the  temperature  is  TP  =  1.54  x  107  K,  and  for  the 

_ O  1 

electron  of  mass  me  =  9.109  x  10  kg,  the  temperature  is 
Te  =  8.4  x  10  K.  The  outer  solar  atmosphere,  the  corona,  which  lies  just 
above  the  visible  solar  disk,  has  a  million-degree  temperature,  so  it  is  hot 
enough  for  the  electrons  to  escape  from  the  Sun,  and  nearly  hot  enough  for 
the  protons  to  do  so.  Since  the  escape  velocity  falls  off  with  increasing 
distance  from  the  Sun,  this  also  becomes  possible. 

For  the  Earth,  we  have  an  escape  velocity  of  V  =  VescE  = 
C IGMe/Re)11 2  =1.12  x  104  m  s  '.  Hydrogen  atoms  do  not  remain  in  the 
Earth’s  atmosphere,  and  since  their  mass  is  approximately  the  same  as  that 
of  their  nuclear  proton,  the  temperature  required  for  this  escape  is 
T  =  mV  /(3k)  x  5  x  lCf  K.  The  mean  surface  temperature  of  the  Earth  is 
about  281  K,  not  hot  enough  for  hydrogen  to  escape  immediately  from  the 
surface.  However,  the  temperature  of  the  Earth’s  atmosphere  increases  at 
higher  levels  to  more  than  104  K,  in  its  ionosphere,  permitting  hydrogen  to 
leak  off  the  planet  by  thermal  evaporation,  which  has  been  detected  from 
satellite  observations  as  a  geocorona. 


The  equilibrium  temperature  is  sometimes  called  the  kinetic  temperature , 
denoted  by  TK,  and  it  is  given  by: 
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Kinetic  temperature  =  TK  =  — — — . 

^  iC 


5.2.3  Collisions 


The  numerous  particles  in  a  gas  are  always  colliding  with  each  other,  and  another 
German  scientist,  Rudolf  Clausius  (1822-1888),  introduced  the  concept  of  the 
mean  free  path  between  collisions.  He  asked  how  far,  on  average,  can  a  molecule 
move  before  it  comes  under  the  sphere  of  action  of  another  molecule.  When  the 
number  of  particles  per  unit  volume  increases,  for  example,  the  mean  distance 
between  collisions  will  decrease. 

The  average  distance  covered  by  a  moving  particle  between  successive  colli¬ 
sions  is  called  the  mean  free  path ,  denoted  /.  It  is  given  by  (Clausius  1858): 


where  N  is  the  number  density  of  gas  particles  and  the  cross  sectional  area  a  = 
for  collision  radius  r0.  For  an  atom  we  might  take  r0  =  a0  =  5.298  x  10  11  m, 
the  Bohr  radius  to  obtain  /  «  10  N  m.  A  molecule  would  be  somewhat  larger, 
depending  on  the  number  of  atoms  it  contains. 

For  a  gas  at  temperature  T  and  pressure  P, 
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where  the  Boltzmann  constant  k  =  1.38065  x  10  “  J  K  .In  our  atmosphere  at 
sea  level,  with  a  temperature  T  =  288  K  and  a  pressure  P  =  105  Pa,  the  mean  free 
path  between  collisions  of  nitrogen  molecules  with  radius  r0  k  2  x  10-10  m,  is 
about  Ik  3  x  10  m,  and  each  air  molecule  is  subjected  to  many  billions  of 
collisions  every  second. 

The  approximate  mean  time,  t,  between  collisions  is  given  by  the  ratio  of 
/  divided  by  the  thermal  velocity,  or: 


m  \  l/2 


(—) 

\3kTJ 


where  /  is  the  mean  free  path,  m  is  the  particle  mass,  and  k  is  the  Boltzmann 
constant. 
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Fig.  5.3  Maxwell  distribution  of  particle  speeds  The  speeds  of  particles  with  the  same  mass 
and  three  different  temperatures.  The  peak  of  this  distribution  shifts  to  higher  speeds  at  higher 
temperatures.  There  is  a  small  fraction  of  particles  having  high  speed,  residing  in  the  high-speed 
tail  of  the  distribution,  and  this  fraction  increases  with  temperature.  The  fraction  of  particles  with 
low  speed  becomes  smaller  at  higher  temperatures  but  does  not  vanish.  The  peak  also  shifts  to 
higher  speeds  at  lower  mass  when  the  temperature  is  unchanged.  The  Scottish  scientist  James 
Clerk  Maxwell  (1831-1879)  derived  this  distribution  in  1873.  (From  “The  Life  and  Death  of 
Stars”  by  Kenneth  R.  Lang,  published  by  Cambridge  University  Press,  2013.  Reprinted  with 
permission.) 


5.2.4  The  Distribution  of  Speeds 

Also  in  the  mid-nineteenth  century,  the  Scottish  physicist  James  Clerk  Maxwell 
(1831-1879)  introduced  a  statistical  approach  to  the  kinetic  theory  of  gases,  which 
recognizes  that  every  gas  particle  has  a  different  speed  and  that  each  collision 
between  particles  changes  the  speeds  of  those  particles.  He  proposed  that  the 
numerous  collisions  between  the  large  numbers  of  molecules  in  a  gas  produce  a 
statistical  distribution  of  speeds,  in  which  all  of  the  speeds  might  occur  with  a 
different  and  known  probability.  This  is  called  the  Maxwell  speed  distribution. 

In  thermal  equilibrium,  the  average  value  of  the  kinetic  energy  of  particles  in  a 
gas  might  be  distributed  equally  among  all  of  the  particles,  but  this  equality  is  only 
statistically  true.  Most  particles  move  with  the  same  average  speed,  but  not  all  of 
them.  Some  are  faster  than  average  and  others  slower.  Gas  particles  can  gain  or 
lose  speed  by  collisions  with  one  another,  so  they  do  not  all  move  at  the  same 
average  speed.  In  any  given  instant,  the  speed  and  kinetic  energy  of  most  of  the 
particles  are  close  to  the  average  value,  but  there  is  always  a  small  percentage  that 
moves  faster  or  slower  than  the  average. 

The  Maxwell  speed  distribution,  illustrated  in  Fig.  5.3,  gives  the  fraction  of  gas 
molecules,  or  other  particles,  moving  at  a  particular  speed  (the  magnitude  of  the 
velocity  vector)  at  any  given  temperature  and  mass  (Maxwell  1860).  The  most 
probable  speed  is  close  to  the  average  in  thermal  equilibrium;  therefore,  the  most 
likely  speed  for  any  given  type  of  particle  increases  with  its  temperature.  However, 
there  is  a  range  of  speeds,  both  higher  and  lower  than  the  average  value,  and  this 
range  also  increases  with  the  temperature.  In  other  words,  the  Maxwell  distribution 
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function  becomes  broader  and  its  peak  shifts  to  higher  speeds  when  the  temper¬ 
ature  rises. 

Although  this  distribution  function  appears  to  be  symmetrical,  it  has  a  high¬ 
speed  tail.  Particles  in  the  high-speed  tail  have  greater  kinetic  energy  than  other 
particles  in  the  distribution,  and  they  have  an  important  role  in  the  nuclear-fusion 
reactions  that  make  the  Sun  and  other  stars  shine. 

The  Maxwell  speed  distribution  applies  to  all  types  of  particles  in  thermal 
equilibrium,  as  long  as  there  are  many  of  them.  In  addition  to  atoms  or  molecules, 
it  can  be  used  to  describe  the  speeds  of  numerous  subatomic  particles  inside  the 
Sun,  which  have  been  freed  from  their  atomic  bonds  at  very  high  temperatures. 
The  distribution  also  describes  the  speeds  of  millions  of  stars  that  are  collected 
together  in  star  clusters,  in  which  the  stellar  motions  are  analogous  to  those  of  gas 
particles. 

The  number  of  particles,  N(V)dV ,  with  speeds  between  V  and  V  +  dV  is  given  by: 

N(V)dV  =  Ntotf(V)dV  (5.9) 

where  Ntot  is  the  total  number  of  particles  of  a  given  mass  in  the  system.  The 
Maxwell  speed  probability  density  distribution, /(V),  is  given  by  (Maxwell  1860): 


or  equivalently 
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where  T  is  the  temperature,  m  is  the  mass  of  the  molecule,  or  particle,  under 
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consideration,  and  the  Boltzmann  constant  k  =  1.38065  x  10  JKA  This 
function  has  three  parts,  a  constant  term,  a  V  term,  and  an  exponential  term. 

The  most  probable  speed,  VP ,  is  obtained  by  setting  the  differential  df  (V)/dV 
equal  to  zero  and  solving  for  V,  which  yields 


(5.12) 


The  most  probable  speed  is  the  speed  associated  with  the  highpoint  in  the 
Maxwell  distribution,  and  it  is  the  speed  most  likely  to  be  possessed  by  any 
particle  of  mass  m  at  temperature  T.  For  the  diatomic  nitrogen  molecule  at  sea 
level  temperature  of  288  K,  we  have  a  most  probable  speed  of  VP  =  414  m  s_1, 
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since  the  mass  m  =  2  x  14  x  u  for  atomic  mass  unit  u  =  1.66054  x  io-^  kg. 

Only  a  small  fraction  of  the  particles  have  the  most  probable  speed,  since  there 
is  also  a  range  of  speeds,  both  higher  and  lower  than  the  most  probable  one,  and 
this  range  also  increases  with  the  temperature.  In  other  words,  the  Maxwell  dis¬ 
tribution  function  becomes  broader  and  its  peak  shifts  to  higher  speeds  when  the 
temperature  rises. 
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A  measure  of  the  range  of  possible  speeds,  or  the  spread  of  the  Maxwell 
distribution,  is  the  full  width  to  half  maximum,  AV,  of  the  exponential  term,  which 
is  a  Gaussian  function  of  the  form  exp  [— V  /(2<r )]  of  standard  deviation  a.  Such  a 
function  has  AV  =  2.355  cr,  and  for  the  Maxwell  distribution  we  have: 

AV  «  2355(kT/m)l/2^  l.66Vp.  (5.13) 

This  is  only  approximate  since  there  is  an  enhancement  at  high  velocities  owing 
to  the  V  term  in  the  Maxwell  distribution,  but  the  main  point  is  that  both  the  most 
probable  velocity  and  the  spread  in  the  Maxwell  distribution  increase  with 
temperature. 

The  most  probable  speed  is  very  close  to  the  average  speed  in  thermal  equilib- 
rium,  since  VP  =  (2/3)1/2  Vthermai  =  0.816  Vthermal.  The  mean  speed  <V  >  is  the 
mathematical  average,  denoted  by  <  >  ,  of  the  speed  distribution.  It  is  given  by: 


<V  > 


f  Vf{V)dV 
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<y  > 


(5.14) 
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The  root  mean  square  speed,  Vrms,  is  the  square  root  of  the  average  squared 
speed 


or 


(5.16) 
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The  Maxwell  speed  distribution  can  be  expressed  as  an  energy  distribution  for 
the  kinetic  energy  E  =  mV  / 2.  The  number  N(E)  of  gas  particles  with  a  kinetic 
energy  between  E  and  E  +  dE  is  given  by: 


N(E)dE  =  2N,ot  3  E$  exp 
7i2(kT)2 


dE  =  Ntotf(E)dE. 


(5.18) 


The  total  internal  energy,  U,  of  a  total  of  Ntot  particles  in  thermal  equilibrium  is 
given  by  (Maxwell  1860;  Boltzmann  1868): 
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Table  5.3  Atmospheres  of  Venus,  Mars,  and  Eartha 


Venus 

Mars 

Earth 

Constituent 

Carbon  dioxide,  C02 

96 

95 

0.038 

Nitrogen,  N2 

3.5 

2.7 

78 

Argon,  Ar 

0.007 

1.6 

0.93 

Water  vapor,  H20 

0.010 

0.03  (variable) 

1  (variable) 

Oxygen,  02 

0.003 

0.13 

21 

Surface  pressure  (bar) 

92 

0.007-0.010 

1.0  (at  sea  level) 

Surface  temperature  (K) 

735 

183-268 

184-330 

Mean  210 

Mean  287.2 

a  Percentage  composition  of  atmospheric  constituent 

Table  5.4  Atmospheres  of  the  giant  planets  and  the  Suna 

Constituent 

Sun 

Jupiter 

Saturn 

Uranus 

i  Neptune 

Hydrogen,  H2 

84 

86.4 

97 

83 

79 

Helium,  He  (atom) 

16 

13.6 

3 

15 

18 

Water,  H20 

0.15 

Methane,  CH4 

0.07 

0.21 

0.2 

2 

3 

Ammonia,  NH3 

0.02 

0.07 

0.03 

a  The  percentage  abundance  by  number  of  molecules  for  the  Sun,  cooled  to  planetary  temper¬ 
atures  so  that  the  elements  combine  to  form  the  compounds  listed,  and  for  the  outer  atmospheres 
of  the  giant  planets  below  the  clouds.  Blanks  indicate  unobserved  compounds.  (Courtesy  of 
Andrew  P.  Ingersoll.) 


5.3  Molecules  in  Planetary  Atmospheres 

The  molecular  ingredients  of  an  atmosphere  can  be  determined  by  observing  the 
unique  spectral  signatures  of  different  molecules.  The  atmospheres  of  the  eight 
major  planets  in  our  solar  system  are  mainly  composed  of  molecules  of  the  cos- 
mically  abundant  atoms  -  hydrogen,  H,  carbon,  C,  oxygen,  O,  and  nitrogen,  N,  but 
in  various  percentages  given  in  Tables  5.3  and  5.4.  The  main  ingredient  of  the 
atmospheres  of  Venus  and  Mars  is  carbon  dioxide,  abbreviated  C02.  Our 
breathable  air  contains  21  %  oxygen  molecules,  denoted  02,  and  78  %  nitrogen 
molecules,  abbreviated  N2.  The  atmospheres  of  the  giant  planets  are  mainly 
composed  of  molecular  hydrogen,  denoted  H2. 

As  suggested  by  George  Johnstone  Stoney  (1826-1911),  the  ability  of  a  planet 
or  satellite  to  retain  an  atmosphere  depends  on  both  the  temperature  of  that 
atmosphere  and  the  gravitational  pull  of  the  planet  or  satellite  (Stoney  1898,  1900). 
An  atmosphere  is  held  near  a  planet  by  its  gravity,  but  since  gas  has  a  natural 
tendency  to  expand  into  space,  only  planets  with  a  sufficiently  strong  gravitational 
pull  can  retain  an  atmosphere  of  a  given  composition.  The  ability  of  a  planet  to 
retain  an  atmosphere  also  depends  on  its  temperature,  determined  by  both  the 
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planet’s  distance  from  the  Sun  and  the  atmospheric  greenhouse  effect.  Lighter, 
hotter  molecules  will  move  faster  than  heavier,  colder  ones,  and  the  fast  ones  will 
be  more  likely  to  escape  the  planet’s  gravitational  grasp. 

If  the  gas  is  hot,  the  molecules  move  about  with  a  greater  speed  and  are  more 
likely  to  escape  the  gravitational  pull  of  the  planet.  This  is  one  of  the  reasons  that 
Mercury,  the  closest  planet  to  the  Sun  and  therefore  the  hottest,  has  no  atmosphere. 
The  other  reason  is  that  Mercury  has  a  relatively  small  mass,  as  far  as  planets  go, 
and  thus  has  a  comparatively  low  gravitational  pull.  On  the  other  hand,  a  planet 
with  a  larger  mass  is  more  likely  to  retain  an  atmosphere,  which  helps  explain  why 
massive  Jupiter  retains  the  lightest  element,  hydrogen.  Jupiter  is  also  relatively  far 
away  from  the  Sun’s  heat,  so  molecules  in  Jupiter’s  atmosphere  move  at  a  rela¬ 
tively  slow  speed. 

As  shown  by  John  S.  Lewis  (1941-  ),  the  composition  of  the  planets  and  their 
atmospheres  is  intimately  connected  with  their  distance  from  the  Sun  (Lewis  1974, 
2004). 

An  atom,  ion,  or  molecule  moves  about  because  it  is  hot.  Its  kinetic  tempera¬ 
ture,  T,  is  used  to  define  its  thermal  velocity,  Vthermah  given  by  equating  the 
thermal  energy  to  the  kinetic  energy  of  motion,  or 


L thermal  — 


J3kT- 

"3  kT 

V  m 

m 

1  1/2 


(5.20) 
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where  the  Boltzmann  constant  k  =  1.3806  x  10  “  J  K  ,  and  the  particle’s  mass 
is  denoted  by  m.  We  see  right  away  that  at  a  given  temperature,  lighter  particles 
move  at  faster  speeds.  Colder  particles  of  a  given  mass  travel  at  slower  speeds. 
Anything  will  cease  to  move  when  it  reaches  absolute  zero  on  the  kelvin  scale  of 
temperature. 

The  thermal  velocity  can  be  compared  to  the  planet’s  escape  velocity,  Vesc , 
given  by 


/2GM 

2GM 

V  D 

D 

1  1/2 


(5.21) 


where  M  is  the  planet’s  mass,  the  universal  gravitational  constant  is 
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G  =  6.674  x  10“ 11  m  kg“A  s  ,  and  D  is  the  distance  between  the  center  of  the 
planet  and  the  gas  particle.  The  escape  velocities  at  the  surfaces  or  cloud  tops  of 
the  planets  range  between  4  and  60  km  s“\ 

A  planet  tends  to  retain  molecules  that  are  moving  at  velocities  less  than  the 
planet’s  escape  velocity,  and  a  molecule’s  velocity  increases  with  temperature. 
At  a  given  temperature,  a  molecule’s  velocity  increases  with  decreasing  molecular 
mass,  so  lighter  molecules  move  at  faster  speeds  and  are  more  likely  to  escape  a 
given  planet  or  satellite  than  heavier  ones.  The  high-speed  tail  in  the  Maxwell- 
Boltzmann  distribution  means  that  at  any  instant,  a  tiny  fraction  of  the  molecules 
are  moving  fast  enough  to  escape  even  when  the  average  thermal  velocity  is  less 
than  the  escape  velocity. 
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When  the  thermal  velocity  exceeds  the  escape  velocity  for  a  given  type  of 
molecule,  all  of  those  molecules  will  promptly  flow  out  into  space,  and  if  this 
happens  for  every  type  of  molecule,  an  airless  body  is  left  behind  -  like  Mercury, 
the  Earth’s  Moon,  and  every  natural  planetary  satellite  in  the  solar  system  except 
Titan. 

For  the  Earth,  Mars  and  Venus,  the  thermal  velocity  of  all  molecules  is  smaller 
than  the  escape  velocity,  but  the  lightest,  fastest  molecules  can  still  slowly  leak  out 
or  evaporate  from  the  top  of  the  atmosphere  where  collisions  no  longer  dominate 
the  velocity  distribution.  At  lower  altitudes,  collisions  confine  the  particles,  but 
above  a  certain  altitude  known  as  the  exobase,  the  atmosphere  is  so  tenuous  that 
gas  particles  hardly  ever  collide.  Nothing  stops  an  atom  or  molecule  with  sufficient 
velocity  from  flying  away  from  the  exobase  into  space. 

The  method  of  molecular  escape  from  the  exobase  is  known  as  Jeans  escape , 
after  the  British  scientist  James  Jeans  (1877-1946)  who  introduced  it  (Jeans  1916). 
It  is  also  known  as  thermal  evaporation  since  it  is  analogous  to  the  slow  evapo¬ 
ration  of  water  from  the  ocean.  On  average,  the  molecules  in  ocean  water  do  not 
have  enough  thermal  energy  to  escape  from  the  liquid,  but  some  of  them  acquire 
enough  as  the  result  of  collisions  near  the  ocean’s  surface. 

At  and  below  the  exobase  in  the  atmosphere,  collisions  between  particles  drive 
the  speed  distribution  into  a  Maxwellian  one,  while  above  the  exobase  collisions 
are  essentially  absent  and  particles  that  have  velocities  greater  than  the  escape 
velocity  may  leave  the  planet.  The  upward  moving  atoms  in  the  high-speed  tail  of 
the  Maxwell  distribution  can  leave  or  exit  the  planet,  hence  the  name  exobase. 

The  lightest  element,  hydrogen,  is  the  one  that  most  easily  overcomes  the 
gravity  of  a  terrestrial  planet,  but  first  it  must  reach  the  exobase.  On  Earth,  the 
exobase  is  located  about  500  km  above  the  surface,  and  calculations  indicate  that 
about  a  billion,  billion,  billion,  or  10“  ,  hydrogen  atoms  are  still  being  lost  from  the 
Earth’s  exobase  every  second.  This  value  is  confirmed  by  satellite  ultraviolet 
observations  of  hydrogen  escaping  from  the  Earth’s  upper  atmosphere.  So 
hydrogen  gas  is  very  rare  in  the  Earth’s  atmosphere,  present  at  about  1  part  per 
million  by  volume.  Nevertheless,  it  is  still  the  third  most  abundant  element  in  the 
Earth’s  surface,  in  the  form  of  chemical  compounds  such  as  hydrocarbons  and 
water. 

Notice  that  lighter  particles  are  lost  by  thermal  evaporation  at  a  much  faster  rate 
than  heavier  ones.  Even  over  the  Earth’s  lifetime  of  4.6  billion  years,  the  total 
mass  of  all  the  hydrogen  atoms  lost  by  thermal  evaporation  is  2  x  10  kg,  and  the 
amount  lost  by  heavier  molecules  would  be  much  less.  By  way  of  comparison  the 
total  mass  of  the  Earth’s  atmosphere  is  about  5  x  10  kg.  Lammer  (2008)  has 
reviewed  the  atmospheric  escape  and  evolution  of  the  terrestrial  planets  and 
satellites. 
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5.4  Gas  Pressure 

5.4.1  What  Keeps  Our  Atmosphere  Up? 

Why  doesn’t  the  sky  fall  down,  as  Chicken  Little  once  said  was  happening?  After 
all,  the  Earth’s  atmosphere  is  pulled  down  by  the  planet’s  relentless  gravity.  The 
answer  is  that  the  atmosphere  is  warmed  by  the  Sun,  so  its  molecules  are  in 
continuous  motion  and  collide  with  one  another,  producing  a  gas  pressure  that 
prevents  them  from  falling  to  the  ground.  So,  the  atmosphere  holds  itself  up. 

The  British  scientist  Robert  Boyle  (1627-1691)  likened  air  to  a  “heap  of  little 
bodies,  lying  one  upon  another,”  which  acted  like  springs  that  resist  compression. 
He  discovered  that  the  total  pressure  exerted  by  all  the  little  springs  is  inversely 
proportional  to  the  volume  of  space  in  which  they  are  confined  (Boyle  1660). 
Other  things  being  equal,  the  product  of  pressure  and  volume  is  conserved,  so 
when  a  gas  is  compressed  into  a  smaller  volume  the  pressure  rises. 

When  you  compress  a  gas,  the  molecules  move  about  more  rapidly  and  collide 
more  often.  They  are  resisting  being  crowded  together  and  pushed  into  a  confining 
place.  If  you  remove  the  confinement,  the  gas  will  expand  out  into  surrounding 
space. 

Radiation,  wind,  or  a  magnetic  field  also  can  produce  a  pressure,  known  as 
radiation  pressure,  wind  pressure  or  magnetic  pressure.  For  example,  radiation 
pressure  of  sunlight  (Debye  1909)  can  be  used  to  propel  a  spacecraft  by  using  a 
solar  sail,  and  both  solar  radiation  pressure  and  solar  wind  pressure  push  different 
types  of  comet  tails  away  from  the  Sun.  The  solar  radiation  pressure  can  also  cause 
a  dust  grain  to  slowly  spiral  into  the  Sun;  a  drag  due  to  the  component  of  radiation 
pressure  tangential  to  the  grain’s  motion.  This  drag  is  known  as  the  Poynting- 
Robertson  effect  (Poynting  1904;  Robertson  1937). 

The  SI  unit  of  pressure  is  the  pascal,  abbreviated  Pa,  which  is  a  force  per  unit 

_ o 

area  with  1  Pa  =  1  N  m  ,  where  the  newton,  abbreviated  N,  is  the  unit  of  force. 
Atmospheric  pressure  on  the  Earth  and  other  planets  often  is  measured  in  the  bar 
unit  of  pressure,  where  1  bar  =  100,000  Pa  =  105  Pa.  The  Earth’s  standard 
atmospheric  pressure  at  sea  level  is  defined  as  1.01325  bar. 

The  radiation  pressure  of  sunlight  on  a  perfectly  reflecting  surface  at  the  Earth’s 
distance  from  the  Sun  is  about  10-5  Pa.  The  pressure  of  the  Sun’s  winds  just 

1  O  _  IQ 

outside  our  planet  is  a  startling  10  Pa,  but  it  drops  to  about  10  Pa  when  the 
winds  spread  out  to  their  boundary  with  interstellar  space.  The  pressure  of  intense 
magnetic  fields  just  above  the  visible  solar  disk  is  about  30  Pa. 

Some  representative  pressures,  displaying  their  enormous  range  in  the  universe, 
are  given  in  Table  5.5. 
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Table  5.5  Range  of  cosmic  pressures 


Location 

Pressurea  (Pa) 

Interstellar  space 

1(T13 

Sunlight  radiation  pressure  at  Earth  orbit 

10“5 

Beneath  foot  of  a  tarantula 

1 

Visible  solar  diskb 

10 

Atmospheric  pressure  on  Mars 

103 

Earth’s  atmosphere  at  sea  level0 

105 

Inside  a  champagne  bottle 

5  x  105 

Surface  pressure  of  atmosphere  on  Venus 

9  x  106 

Inside  a  fully  charged  scuba  tank 

107 

Center  of  the  Earth 

4  x  10“ 

Center  of  Jupiter 

7  x  1012 

Center  of  the  Sun 

2  x  1016 

a  Unless  otherwise  stated  in  the  location,  the  pressure  is  the  gas  pressure 
b  The  gas  pressure  at  the  visible  solar  disk,  known  as  the  photosphere,  is  about  the  same  as  the 
vacuum  pressure  inside  an  incandescent  light  bulb 

c  The  standard  atmosphere  of  Earth  has  a  sea-level  pressure  of  1.01325  bar  =  101,325  Pa 


5.4.2  The  Ideal  Gas  Law 

Under  most  conditions,  an  atmosphere  behaves  like  an  ideal  gas,  in  which  the 
randomly-moving  molecules  have  no  volume,  like  a  point,  and  do  not  interact 
with  each  other,  except  by  collisions.  The  ideal  gas  law  for  the  gas  pressure,  Pg,  is 
given  by: 


PgV  =  NtotkT ,  (5.22) 

where  V  is  the  volume,  Ntot  is  the  total  number  of  molecules,  the  Boltzmann 
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constant  k  =  1.3806  x  10  J  K  ,  and  T  is  the  temperature.  Such  a  relation 
between  pressure,  volume  and  temperature  is  known  as  an  equation  of  state. 

Using  the  particle  number  density  N  =  Ntot/V ,  the  ideal  gas  law  can  also  be 
written 


Pg  =  NkT. 


(5.23) 


Example:  Gas  pressure  and  number  of  molecules  at  sea  level  in  the 
Earth’s  atmosphere 

The  standard  atmospheric  pressure  on  Earth  at  sea  level  is  equal  to 
1.01325  x  105  Pa.  The  gas  temperature  is  288  K,  so  the  number  density  of 
atmosphere  molecules  at  sea  level  is  N  =  Ntot/V  =  P/kT  =  2.5  x  10  m_  . 


The  ideal  gas  law,  also  known  as  the  perfect  gas  law,  is  a  simplified  equation  of 
state  that  is  a  good  approximation  to  many  gases  under  many  different  conditions. 
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It  describes  the  pressure  within  the  Earth’s  atmosphere,  and  it  is  also  used  to 
specify  the  gas  pressure  exerted  by  subatomic  particles  in  the  hot  interiors  of  stars. 

s 

The  ideal  gas  law  was  first  stated  by  the  French  engineer  Emile  Clapeyron 
(1799-1864),  and  derived  from  kinetic  theory  by  the  German  physicists  August 
Kronig  (1822-1879)  and  Rudolf  Clauisus  (1822-1888)  (Clapeyron  1834,  1856; 
Clausius  1850,  1857,  1870). 

An  alternative  expression  for  the  ideal  gas  law  is: 

pkT  pkT 
8  m  pmH 

for  a  gas  of  mass  density  p  and  average  mass  per  particle  given  by: 

in  =  p/N  =  pmH  (5.25) 

for  mean  molecular  weight  p  or  the  mean  particle  mass  in  units  of  the  mass  of  the 
hydrogen  atom  of  mH  =  1.00794  u  «  1.67  x  10  "  kg,  which  is  roughly  equal  to 
the  atomic  mass  unit  u  =  1.6605  x  10  ^  kg  and  good  enough  for  order  of 
magnitude  estimates. 

For  a  diatomic  molecule  composed  of  atoms  of  atomic  mass  number  A,  the 
mean  molecular  weight  is  p  =  2 A,  which  is  28  for  molecular  nitrogen  N2,  where 
the  mass  number  of  the  nitrogen  atom  14N  is  14. 

Stellar  interiors  consist  of  ionized  gas.  For  a  fully  ionized  hydrogen  gas,  there 
will  be  an  equal  number  of  protons  and  electrons,  so  the  mean  mass  will  be: 


(5.24) 


me  +  mp  mpf 


where  the  mass  of  the  electron,  me ,  is  negligible  when  compared  to  the  mass  of  the 

_ O  '~J 

proton,  mP ,  which  is  equal  to  that  of  the  hydrogen  atom  mH  =  1.67  x  10-"'  kg. 
For  a  fully  ionized  helium  gas  in  =  4mH/3. 

When  the  ionized  gas  is  composed  of  three  different  kinds  of  particles,  each 
with  its  own  mass  ra*  and  number  density  Nt  denoted  by  subscript  i,  the  mean 
particle  mass  is 


N\m\  +  A^2^2  +  N 31713  p 

m  = - =  — 

N1+N2+N3  N  ’ 


(5.27) 


For  an  ionized  gas  containing  hydrogen,  H ,  helium,  He ,  and  an  element  of  mass 
number  A, 


m  p  2 

niH  ~  NmH  ~  1  +  3A  +  0.5y 


(5.28) 


where  X,  T,  and  Z  represent  the  concentration  by  mass  of  hydrogen,  helium  and 
heavier  elements  and  X  +  Y  +  Z  =  1.  Their  number  densities  are  given  by: 


Xp 

—  ,NHe 
mH 


Yp 
4  mH 


ZAp 

AmH 


1 


(5.29) 
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where  the  atomic  number  ZA  «  A/2  is  the  mass  abundance  of  an  element  of 
atomic  mass  number  A.  As  an  example,  the  abundances  observed  in  the  disk  of  the 
Sun  are  X  =  0.71,  Y  =  0.27  and  Z  =  0.02;  in  the  solar  core  nuclear  fusion  reac¬ 
tions  have  converted  about  half  the  hydrogen  into  helium  and  X  =  0.34,  Y  =  0.64 
and  Z  =  0.02. 

Any  hot  gas  exerts  gas  pressure,  and  the  gas  pressure  will  vary  with  distance 
from  whatever  is  heating  the  gas.  The  Earth’s  atmosphere,  for  example,  is  heated 
from  below,  at  the  warm  ground,  and  from  above,  by  the  Sun’s  radiation.  Unlike 
our  atmosphere,  the  Sun  is  heated  from  inside,  at  the  center  of  its  hot  dense  core. 


5.4.3  The  Earth’s  Sun-Layered  Atmosphere 

Our  thin  atmosphere  is  pulled  close  to  the  Earth  by  its  gravity  and  suspended 
above  the  ground  by  molecular  motion.  And  because  air  molecules  are  mainly  far 
apart,  our  atmosphere  is  mostly  empty  space,  and  it  always  can  be  squeezed  into  a 
smaller  volume.  The  atmosphere  near  the  ground  is  compacted  to  its  greatest 
density  and  pressure  by  the  weight  of  the  overlying  air.  Yet,  even  at  the  bottom  of 
the  atmosphere  the  density  is  only  about  1  /  1000th  of  that  of  liquid  water;  an  entire 
liter  of  this  air  weighs  only  1  g. 

At  greater  heights  there  is  less  air  pushing  down  from  above,  so  the  com¬ 
pression  is  less  and  the  pressure  and  density  of  air  gradually  fall  off  into  the 
vacuum  of  space.  There  is  a  simple  formula  that  expresses  the  drop  in  the  atmo¬ 
sphere  pressure  at  increasing  distance,  or  radius  from  a  planet  center.  The  gas 
pressure,  Pg(r ),  at  radius,  r,  is  given  by  the  barometric  equation ,  also  known  as  the 
barometric  law : 


pg{r)  =  Pg(R)  exp 

\  (r-R)' 

(5.30) 

H 

where  Pg  (R)  is  the  surface  pressure  at  radius  R ,  the  height  above  the  ground  is  r  —  R, 
and  H  is  the  atmosphere  scale  height  given  by: 

kT 

H  = 

mg 

(5.31) 

and 

kTR2 

H  S3 

inGM 

(5.32) 

where  k  is  the  Boltzmann  constant,  m  «  pmn  is  the  mean  molecular  mass  for  mean 
molecular  weight  /i,  where  mH  =  1.67  x  10  ~  kg  is  the  mass  of  the  hydrogen  atom, 
and  the  g  is  the  local  acceleration  of  gravity  given  by  g  =  GM(r)/r  «  GM/R  fora 
planet  of  mass  M,  at  least  as  long  as  r  —  R  is  much  less  than  R. 


5.4  Gas  Pressure 


145 


<D 

S-i 


3 


c/2 

C/2 


<D 

S-H 


Ph 


Fig.  5.4  Earth’s  layered  atmosphere  The  atmospheric  pressure  ( right  scale)  decreases  with 
altitude  ( left  scale).  This  is  because  fewer  particles  are  able  to  overcome  the  Earth’s  gravitational 
pull  and  reach  higher  altitudes.  The  temperature  ( bottom  scale)  also  decreases  steadily  with 
height  in  the  ground-hugging  troposphere,  but  the  temperature  increases  in  two  higher  regions 
that  are  heated  by  the  Sun:  the  stratosphere ,  with  its  critical  ozone  layer ,  and  the  ionosphere.  The 
stratosphere  is  heated  mainly  by  ultraviolet  radiation  from  the  Sun,  and  the  ionosphere  is  created 
and  modulated  by  the  Sun’s  x-rays  and  extreme  ultraviolet  radiation,  which  breaks  apart 
atmospheric  molecules,  and  strips  electrons  from  atoms  to  produce  ions.  The  process  of 
ionization  by  the  Sun’s  invisible  rays  releases  heat  to  warm  the  ionosphere,  so  the  temperature 
rises  with  altitude.  In  the  ionosphere,  at  about  100-500  km  above  the  ground,  temperatures 
skyrocket  to  higher  than  anywhere  else  in  the  atmosphere.  At  higher  altitudes,  the  atmosphere 
thins  out  into  the  exosphere,  or  the  “exit  to  the  outside  sphere.”  The  temperature  is  so  hot  out 
there  and  the  particles  move  so  fast  that  some  atoms  and  molecules  slowly  evaporate 


The  temperature  of  the  atmosphere  also  decreases  at  increasing  height  near  the 
ground;  however,  it  is  not  a  simple  fall- off  when  we  keep  going  upward.  It  falls 
and  rises  in  two  full  cycles  as  we  move  off  into  space  (Fig.  5.4).  That’s  because  the 
Earth’s  atmosphere  is  heated  from  both  the  warm  ground  below  and  from  above  by 
ultraviolet  and  x-ray  radiation  from  the  Sun. 

The  temperature  decreases  steadily  with  height  in  the  lowest  regions  of  our 
atmosphere,  since  it  is  heated  from  the  warmer  ground  below.  When  warm  cur¬ 
rents  move  up  from  the  Earth’s  surface,  they  expand  in  the  lower  pressure  and 
become  cooler.  The  average  air  temperature  drops  below  the  freezing  point  of 
water  of  273  K  at  only  a  kilometer  or  two  above  the  Earth’s  surface,  and  bottoms 
out  at  a  height  of  about  12  km  above  sea  level.  This  is  the  greatest  height  achieved 
by  the  air  currents.  All  our  weather  occurs  below  this  altitude,  controlled  by  visible 
sunlight. 

Global  atmospheric  circulation,  driven  by  differential  solar  heating  of  the 
equatorial  and  polar  surfaces,  creates  complex,  wheeling  patterns  of  weather  in  this 
region,  leading  to  the  designation  troposphere,  from  the  Greek  tropo  for  “turning.” 
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The  average  extent  of  the  ground-hugging  troposphere  varies  with  latitude,  from 
about  16  km  above  the  warm  equator  to  roughly  8  km  over  the  cold  poles. 

The  vertical  extent  of  the  troposphere  was  detected  near  the  end  of  the  nine¬ 
teenth  century  when  Leon  Philippe  Teisserenc  de  Bort  (1855-1913),  a  French 
meteorologist,  launched  hundreds  of  unmanned  balloons  that  carried  thermometers 
and  barometers  to  altitudes  as  great  as  15  km  (de  Bort  1902).  At  this  height,  the 
temperature  no  longer  decreased  with  altitude,  and  seemed  to  remain  nearly 
constant.  If  the  temperature  was  unchanging,  the  ingredients  of  the  atmosphere 
above  the  troposphere  might  settle  down  into  layers,  or  strata,  depending  on  their 
weight,  so  de  Bort  named  this  region  the  stratosphere. 

Contrary  to  everyone’s  expectations,  the  temperature  increases  at  greater 
heights  within  the  stratosphere,  rising  to  nearly  ground-level  temperatures  at  about 
50  km  above  the  Earth’s  surface;  but  we  still  use  the  name  stratosphere  to  des¬ 
ignate  the  layer  of  the  atmosphere  that  lies  immediately  above  the  troposphere. 
The  Sun’s  invisible  ultraviolet  radiation  is  largely  absorbed  in  the  stratosphere, 
where  the  radiation  warms  the  gas  and  helps  make  ozone. 

Above  the  stratosphere  we  come  to  the  mesosphere,  from  the  Greek  meso  for 
“intermediate.”  The  temperature  declines  rapidly  with  increasing  height  in  the 
mesosphere,  from  temperatures  of  about  265  K  at  50  km  altitude  to  far  below 
freezing  at  about  85  km,  where  the  temperature  reaches  the  lowest  levels  in  the 
entire  terrestrial  atmosphere. 

The  mesosphere  has  been  known  as  the  “ignorosphere”  because  it  is  too  high  to 
be  reached  by  airplanes  and  too  low  to  be  studied  by  most  spacecraft.  The  air  at 
this  height  is  too  thin  to  support  research  balloons  or  aircraft,  but  thick  enough  for 
atmospheric  friction,  or  air  drag,  to  cause  satellites  to  decay  quickly  from  orbit. 
Sounding  rockets  pass  through  this  region  too  rapidly  to  permit  detailed  study. 

Instruments  aboard  rockets,  as  well  as  radio  signals  from  the  ground,  have  been 
used  to  examine  the  higher  levels  of  the  Earth’s  atmosphere,  where  the  temper¬ 
atures  rise  to  above  those  on  the  ground  owing  to  extreme  ultraviolet  and  x-ray 
radiation  from  the  Sun.  This  radiation  is  energetic  enough  to  break  the  atmospheric 
molecules  apart,  and  to  strip  electrons  off  their  component  atoms,  producing  ions. 
They  reside  in  the  ionosphere  (Focus  5.1),  where  the  temperature  can  rise  to  about 
1200  K. 


Focus  5.1  The  Earth’s  ionosphere 

On  December  12,  1901,  the  Italian  electrical  engineer  Guglielmo  Marconi 
(1874-1937)  startled  the  world  by  transmitting  a  radio  signal  in  Morse  code 
across  the  Atlantic,  from  England  to  Newfoundland.  Marconi  became  an 
international  hero  and  established  the  American  Marconi  Company,  which 
later  evolved  into  the  Radio  Corporation  of  America,  abbreviated  RCA.  In 
1909  Marconi  and  the  German  inventor  Karl  F.  Braun  (1850-1918)  received 
the  Nobel  Prize  in  Physics  for  “their  contribution  to  the  development  of 
wireless  telegraphy.” 


5.4  Gas  Pressure 


147 


Because  radio  waves  travel  in  straight  lines,  and  cannot  pass  through  the 
solid  Earth,  no  one  expected  that  Marconi  could  send  a  radio  signal  halfway 
around  the  world.  Radio  waves  get  around  the  Earth’s  curvature  by  reflection 
from  an  electrically  charged  layer,  now  called  the  ionosphere,  extending  into 
space  from  roughly  70  to  500  km  above  the  Earth’s  surface.  The  atoms  in 
the  ionosphere  are  highly  ionized,  and  many  of  their  electrons  have  therefore 
been  set  free  from  atomic  bonds.  As  independently  shown  by  Arthur  E. 
Kennelly  (1861-1939),  then  at  the  Harvard  School  of  Engineering,  and 
Oliver  Heaviside  (1850-1925)  in  England,  these  electrons  give  the  iono¬ 
sphere  a  high  electrical  conductivity,  which  enables  them  to  turn  the  radio 
signals  back  toward  the  ground,  reflecting  them  like  a  metal  mirror  and  not 
allowing  the  radio  signals  to  pass  through  (Kennelly  1902). 

The  rapid  expansion  of  radio  broadcasting  in  the  1920s,  as  well  as  the 
concurrent  development  of  pulsed  radio  signals,  helped  specify  the  structure 
and  daily  variation  of  the  ionosphere.  Edward  Appleton  (1892-1965)  and  his 
students  measured  the  height  of  the  reflecting  layer  by  determining  the 
elapsed  time  between  transmitting  a  radio  pulse  and  receiving  its  echo  from 
the  ionosphere;  like  all  electromagnetic  radiation,  the  radio  waves  travel  at 
the  speed  of  light.  They  showed  that  there  are  at  least  three  such  reflecting 
layers,  now  labeled  D,  E  and  F,  at  respective  altitudes  of  70,  100  and 
200-300  km  (Appleton  1932;  Appleton  and  Barnett  1925). 

In  1947,  Sir  Appleton  was  awarded  the  Nobel  Prize  in  Physics  for  his 
investigations  of  the  physics  of  the  upper  atmosphere,  especially  for  his 
discovery  of  the  so-called  Appleton  layer. 


The  mystery  of  exactly  what  produces  and  controls  the  ionosphere  was  not 
solved  until  after  World  War  II,  when  captured  German  V-2  rockets  were  brought 
to  the  United  States.  These  and  subsequent  rockets,  built  by  American  engineers, 
were  used  by  the  Naval  Research  Laboratory  to  loft  detectors  above  the  atmo¬ 
sphere,  showing  that  the  Sun  emits  very  energetic  radiation  at  invisible  x-ray  and 
extreme  ultraviolet  wavelengths  (Byram  et  al.  1956).  When  this  radiation  reaches 
the  upper  atmosphere  it  breaks  the  nitrogen  and  oxygen  molecules  into  their 
constituent  atoms  and  ionizes  them,  producing  free  electrons  and  atomic  ions.  The 
ionosphere  above  your  head  therefore  develops  as  the  Sun  rises  and  decays  as  the 
Sun  sets;  it  lingers  on  during  the  night  but  is  not  energized  then. 

The  process  of  ionization  by  the  Sun’s  invisible  rays  releases  heat  to  warm  the 
ionosphere,  so  the  temperature  increases  with  altitude  in  it.  Within  the  ionosphere, 
the  temperatures  rise  to  higher  values  than  anywhere  else  in  the  entire  atmosphere. 
Indeed,  some  scientists  prefer  to  call  this  region  the  thermosphere,  or  “hot” 
sphere.  The  thermosphere  overlaps  with  the  E  and  F  regions  of  the  ionosphere, 
beginning  at  about  90  km  and  extending  upward  to  about  500  km. 

At  higher  altitudes,  the  atmosphere  thins  out  into  the  exosphere ,  or  the  “exit  and 
outside  sphere.”  In  the  exosphere  the  gas  density  is  so  low  that  an  atom  can 
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completely  orbit  the  Earth  without  colliding  with  another  atom.  The  temperature 
is  so  hot  out  there,  and  the  atoms  move  so  fast,  that  some  atoms  can  escape  the 
Earth’s  gravitational  pull  and  travel  into  outer  space.  It  is  therefore  the  thermo¬ 
sphere  at  the  top  of  the  ionosphere  that  caps  our  Sun-layered  atmosphere  and 
provides  the  Earth’s  threshold  into  space. 


5.4.4  Pressure,  Temperature,  and  Density  Inside  the  Sun 

Our  Sun  is  a  giant  sphere  of  extremely  hot  gas,  rarefied  on  the  outside  and 
compacted  on  the  inside.  Unlike  the  Earth,  it  has  no  solid  surface.  The  high 
compressibility  of  the  solar  gas  brings  about  a  rapid  increase  in  density  as  we  go 
from  its  visible  disk  to  its  center,  and  as  the  result  of  this  crowding,  the  gas 
particles  collide  more  frequently  with  higher  speeds  than  elsewhere  in  the  Sun. 
The  compacted  gas  particles  also  push  more  vigorously  outward,  producing  strong 
gas  pressure  that  keeps  the  Sun  from  collapsing. 

Within  the  Sun’s  dense,  central  core,  the  density  has  increased  to 

_ Q 

151,300  kg  m  ,  which  greatly  exceeds  that  of  any  solid  or  liquid  bodies  in  our 
everyday  environment.  The  central  density  is  more  than  ten  times  greater  than  the 

_ Q 

density  of  solid  lead,  at  11,340  kg  m  ,  but  still  behaving  like  a  gas. 

The  center  of  the  Sun  is  just  slightly  more  than  100  times  denser  than  the  Sun 

_ Q 

taken  as  a  whole,  at  a  mean  value  of  1409  kg  m  .  That’s  the  number  you  get 
when  dividing  the  Sun’s  mass  M0  =  1.989  x  1030  kg  by  its  volume,  4nR3Q/3,  for  a 
solar  radius  of  RQ  =  6.955  x  108  m.  The  mean  density  of  the  Sun  is  near  that  of 

_ o 

water,  at  1 ,000  kg  m  ,  but  it  is  an  average  density  and  the  Sun  is  much  too  hot  to 
be  solid  or  liquid  anywhere  inside. 

The  central  temperature  can  be  estimated  by  assuming  that  a  proton  at  the 
center  of  the  Sun  is  hot  enough  and  moving  fast  enough  to  counteract  the  gravi¬ 
tational  compression  it  experiences  from  the  rest  of  the  star.  When  you  do  the 
arithmetic,  this  balanced  condition  occurs  at  a  central  temperature  of  15.6  million 
K  (Sect.  8.2).  That  is  how  hot  the  center  of  the  Sun  has  to  be  to  avoid  collapsing 
under  its  own  weight,  and  something  has  to  keep  it  that  hot.  The  heat  is  supplied  by 
nuclear  reactions  at  the  core  of  the  Sun  (Chap.  8). 

At  this  temperature  and  the  central  density,  the  central  pressure  needed  to  resist 
the  weight  of  the  overlying  gas  is  2  x  10  Pa,  or  200,000  million  times  the 
pressure  of  our  atmosphere  at  sea  level. 


Example:  Gas  pressure  at  the  center  of  the  Sun 

c  _ o 

The  mass  density  at  the  center  of  the  Sun  is  p  =  1.51  x  10  kg  m  ,  and 
the  mass  is  provided  by  protons  of  mass  mP  =  1.67  x  10  kg.  So  the 
number  density  of  protons  at  the  center  of  the  Sun  is  N  =  p/mP  =  0.90  x 

o  o  _ o  n 

10  m  .  The  central  solar  temperature  isTc=1.56x  10  K.  So  the  gas 

1  6 

pressure  at  the  center  of  the  Sun  is  P  =  NkT  =  1.9  x  10  Pa,  using  the 


5.4  Gas  Pressure 


149 


Boltzmann  constant  k  =  1.38065  x  10-23  J  K_1.  This  is  about  200  billion, 
or  2  x  1011,  times  as  great  as  the  atmospheric  pressure  at  sea  level  on  Earth, 
at  about  105  Pa.  That’s  because  there  is  both  a  much  higher  temperature  and 
a  much  higher  particle  number  density  in  the  core  of  the  Sun. 


In  contrast  to  both  the  central  and  mean  densities,  the  outer  layers  of  the  Sun  are 
quite  rarefied.  This  is  because  there  is  less  overlying  material  to  support  at  greater 
distances  from  the  center,  so  there  is  a  drop  in  pressure,  density  and  temperature. 
The  compression  is  less,  so  the  gas  gets  thinner  and  cooler  (Fig.  5.5).  Halfway 
from  the  center  of  the  Sun  to  the  visible  disk,  the  density  is  the  same  as  that  of 
water,  and  about  nine  tenths  of  the  distance  from  the  center  to  the  Sun’s  appar¬ 
ent  edge,  we  find  material  as  tenuous  as  the  transparent  air  that  we  breathe  on 
Earth. 

At  the  visible  solar  disk,  the  rarefied  gas  is  about  one  thousand  times  less  dense 
than  our  atmosphere  at  sea  level.  Out  there,  in  the  more  rarefied  outer  parts  of  the 
Sun,  the  temperature  has  fallen  to  5,780  K.  Examination  of  this  outer,  cooler  solar 
atmosphere  tells  us  about  the  elemental  constituents  of  the  Sun.  But  in  most  of  the 
Sun,  from  its  core  to  its  upper  atmosphere,  there  are  no  atoms;  at  these  hot 
temperatures  the  atoms  have  been  tom  by  collisions  into  their  subatomic  ingre¬ 
dients,  mainly  protons  and  electrons,  to  make  an  ionized  gas  known  as  plasma. 


5.5  Plasma 

5.5.1  Ionized  Gas 

Plasma  has  been  called  the  fourth  state  of  matter ,  to  distinguish  it  from  the  solid, 
liquid  and  gas  states  (Fig.  5.6).  The  plasma  is  a  hot,  completely  ionized  gas, 
consisting  of  ions  and  electrons  that  have  been  pulled  free  of  atoms.  The  high 
temperatures  result  in  all  the  atoms  losing  their  normal  complement  of  electrons  to 
leave  positively  charged  ions  behind,  and  it  is  too  hot  for  the  free  electrons  and 
ions  to  join  together  and  form  permanent  atoms.  Because  the  total  negative 
electrical  charge  of  the  free  electrons  is  equal  to  the  total  positive  charge  of  the 
ions,  plasma  is  electrically  neutral  over  a  sufficiently  large  volume. 

The  ionosphere  is  plasma.  The  interiors  of  most  stars  are  plasma  consisting 
mainly  of  electrons  and  protons,  but  this  plasma  still  behaves  like  a  gas  that  is 
described  by  thermal  equilibrium  and  the  Maxwellian  speed  distribution.  Most  of 
the  matter  in  the  universe  is  in  the  plasma  state. 

Ions  are  atoms  that  have  lost  one  or  more  electrons,  and  the  ionization  energy  is 
the  amount  of  energy  needed  to  remove  electrons  from  an  atom.  This  energy  is 
also  known  as  the  ionization  potential,  and  in  atomic  physics,  the  ionization 
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Fig.  5.5  Internal  compression  of  the  Sun  The  variation  of  pressure,  temperature,  and  mass 
density  with  fractional  radial  distance  from  the  Sun’s  center  (left)  to  its  visible  disk  (right).  At  the 

_ Q 

center  of  the  Sun,  the  temperature  is  15.6  million  K  and  the  mass  density  is  151,300  kg  m  ;  the 
central  pressure  is  2.33  x  1016  Pa,  or  233  billion  times  that  of  the  Earth’s  atmosphere  at  sea  level 
(one  bar  is  equivalent  to  100,000  Pa).  Nuclear  reactions  occur  only  in  the  central  core  to  about 
25  %  of  the  Sun’s  radius.  The  energy  produced  in  the  core  is  transported  by  radiation  to  71  %  of 
the  star’s  radius,  where  the  temperature  has  dropped  to  about  2  million  K  and  the  density  has 

_ o 

fallen  to  about  200  kg  m  .  The  energy  then  is  transported  by  convection  out  to  the  Sun’s  visible 
disk,  known  as  the  photosphere,  where  the  temperature  is  5,780  K,  and  the  pressure  and  density 
have  dropped  off  the  scales  of  the  graph.  (From  “The  Life  and  Death  of  Stars”  by  Kenneth  R. 
Lang,  published  by  Cambridge  University  Press,  2013.  Reprinted  with  permission.) 


energy,  or  potential,  is  measured  using  the  unit  of  electron  volt,  abbreviated  eV. 
By  definition,  the  electron  volt  is  the  amount  of  kinetic  energy  gained  by  a  single 
unbound  electron  when  it  is  accelerated  through  an  electric  potential  difference  of 
1  V.  It  is  the  energy  an  electron  gains  when  it  passes  across  the  terminals  of  a  1  V 
battery. 
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Gaseous  Solid  (or  Liquid)  Plasma 


Fig.  5.6  States  of  matter  The  locations  of  atoms,  large  dashed  circles',  their  component 
electrons,  small  filled  circles',  and  central  nucleus,  large  filled  circles,  for  the  gaseous  {left),  liquid 
or  solid  {center),  and  plasma  {right)  states  of  matter.  In  the  gaseous  state,  the  atoms  are  widely 
separated  and  free  to  move  about.  The  atoms  are  practically  touching  one  another  in  the  solid  and 
liquid  states.  At  sufficiently  high  temperature  and  pressure,  the  atoms  cease  to  exist  and  the 
plasma  state  is  created.  The  atoms  are  torn  into  their  constituents  by  frequent  collisions  at  high 
temperatures.  Plasma  consists  of  bare  nuclei  and  unattached  electrons  moving  about  in  random 
directions  within  the  former  empty  space  of  atoms.  In  the  plasma  state,  matter  regains  the 
compressibility  of  the  gaseous  state  and  plasma  behaves  like  a  gas.  (From  “The  Life  and  Death  of 
Stars”  by  Kenneth  R.  Lang,  published  by  Cambridge  University  Press,  2013.  Reprinted  with 
permission.) 

The  eV  unit  of  energy  is  also  used  to  describe  x-rays,  which  were  first  produced 
by  connecting  a  high-voltage  power  supply  across  the  ends  of  an  evacuated  glass 
tube.  The  x-rays  are  in  the  1-100  keV  range  of  energy,  where  1  keV  =  1,000  eV. 

The  conversion  from  the  eV  unit  to  the  joule  unit  of  energy  is 

1  eV  =  1.602176  x  10"19J«  1.602  x  10"19J,  (5.33) 

which  is  numerically  equal  to  the  elementary  charge,  in  coulombs,  of  the  electron. 

The  amount  of  energy  required  to  remove  the  least  tightly  bound  electron  from 
a  neutral  atom  is  called  the  first  ionization  potential,  denoted  by  the  Roman 
numeral  I.  The  additional  energy  needed  to  remove  the  next  least  tightly  bound 
electron  is  the  second  ionization  potential.  More  generally,  the  nth  ionization 
potential,  or  the  nth  ionization  energy,  is  the  energy  required  to  strip  off  the  nth 
electron  after  the  first  n  —  1  electrons  have  been  removed. 

For  the  one  and  only  electron  of  the  hydrogen  atom,  the  first  ionization  potential 
is  13.5984  eV.  The  first,  second  and  third  ionization  potentials  of  atomic  oxygen 
are  13.6181,  35.117  and  54.934  eV.  The  temperature,  T,  required  to  ionize 
hydrogen,  ripping  off  its  sole  electron  and  leaving  a  proton  behind,  is  obtained  by 
dividing  its  first  ionization  potential  by  the  thermal  energy  kT ,  for  the  Boltzmann 
constant  k  =  1.38065  x  10-23  J  K  \  giving  a  temperature  T  «  1.58  x  105  K. 
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At  this  temperature,  or  any  higher  one,  hydrogen  atoms  will  be  completely  torn 
apart,  into  their  subatomic  ingredients,  the  electrons  and  protons,  which  happens 
inside  most  stars. 


5.5.2  Plasma  Oscillations  and  the  Plasma  Frequency 

If  all  the  electrons  in  plasma  were  displaced  by  a  small  amount  with  respect  to  the 
ions,  the  force  of  electrical  attraction  between  the  electrons  and  ions  would  pull  the 
electrons  back,  but  when  pushed  the  displacement  can  continue.  The  back  and 
forth  motion  is  a  natural  oscillating  one,  with  electrons  moving  at  the  plasma 
frequency ,  designated  vP,  given  by  (Tonks  and  Langmuir  1929): 

1/2 

=  8.98  Ay 2  Hz.  (5.34) 
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where  the  electron  density  Ne  is  in  units  of  m  ,  the  electron  charge 
e  =  1.602  x  10“ 19  C,  the  electron  mass  me  =  9.1094  x  10-31  kg,  the  permit¬ 
tivity  of  free  space  is  s0  =  8.8542  x  10-12  F  m_1,  and  n  «  3.14159. 

The  ionosphere  reflects  radio  waves  at  the  plasma  frequency  Vp  or  at  the  plasma 
wavelength,  denoted  2P.  Since  the  radio  radiation  travels  at  the  speed  of  light,  c, 
we  have  2P  x  vP  =  c  =  2.9979  x  108  m  s-1. 


vP 


ezNe 


47i28ome 


Example:  The  temperature,  origin,  and  plasma  frequency  of  the  Earth’s 
ionosphere 

The  F  layer  of  the  ionosphere,  located  about  200  km  above  our  heads, 
contains  oxygen  atoms  that  are  missing  two  electrons  and  a  number  density 

1  o  _ o 

of  free  electrons  of  Ne  =  10  "  m  .  The  temperature,  T,  required  to  create 
these  ions  can  be  estimated  by  equating  the  thermal  energy  3kT/2  to  the 
third  ionization  potential  for  oxygen  atoms  of  54.934  eV,  using  the 
Boltzmann  constant  k  =  1.38065  x  10-23  J  K-1  and  1  eV  =  1.6022  x 
10“ 19  J.  The  result  is  T  =  4.229  x  105  K.  If  the  photon  energy  hv  of  the 
incident  solar  radiation  is  equal  to  this  thermal  energy,  where  the  Planck 

_ Q  A 

constant  h  =  6.626  x  10  J  s,  then  the  wavelength  of  the  radiation  is 
2  =  c/v  =  2hc/(3kT)  «  2.3  x  10-8  m,  or  an  x-ray  wavelength,  so  x-rays 
from  the  Sun  can  produce  the  ionization.  The  plasma  wavelength  kP  cor¬ 
responding  to  the  plasma  frequency  vP  in  this  layer  of  the  ionosphere  is 

O  _  1 

2p  =  c/Vp  «  33  m,  where  the  speed  of  light  c  =  2.9979  x  10  m  s  ,  and 
Vp  =  8.98  NlJ2  =  8.98  x  106  Hz.  Radio  transmissions  from  the  Earth  at 
this  long  wavelength  are  reflected  back  down  to  the  ground,  and  cannot  get 
through  the  ionosphere.  They  can  nevertheless  be  reflected  at  an  angle, 
enabling  long-distance  radio  communications. 


5.5  Plasma 
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We  can  infer  the  height  and  electron  density  of  a  given  layer  in  the  ionosphere 
by  sending  radio  signals  up  into  the  atmosphere  at  successively  longer  wave¬ 
lengths  or  shorter  frequencies.  The  ionosphere  will  not  mirror  radio  waves,  and 
send  a  signal  back,  unless  their  wavelength  is  longer  than  the  plasma  wavelength 
or  their  frequency  is  less  than  the  plasma  frequency.  This  provides  a  measure  of 
the  electron  density  in  the  ionosphere,  the  shorter  the  reflection  wavelength  or  the 
higher  the  reflection  frequency,  the  larger  the  electron  density.  Radio  waves  with 
wavelengths  that  are  less  than  the  plasma  wavelength  can  pass  right  through  the 
ionosphere,  because,  roughly  speaking,  they  are  short  enough  to  pass  among  the 
electrons.  These  shorter  wavelengths  are  used  in  communications  with  satellites  or 
other  spacecraft  in  outer  space  beyond  the  ionosphere.  The  longer  wavelengths  are 
used  in  radio  communications  around  the  Earth  through  reflection  off  the 
ionosphere. 


Example:  Sputnik 

The  F  layer  in  the  ionosphere  has  an  electron  density  of  about 

1  _ Q 

Ne  k,  10  "  m  ;  therefore,  it  has  a  plasma  frequency  of 

/r 

vp^8.98xl0  Hz  «  8. 98  MHz  with  a  corresponding  wavelength  of 
4p  =  c/vp  «  33  m.  Radio  or  microwave  signals  with  wavelength  shorter 
than  this  value  can  see  through  the  ionosphere  and  communicate  with  sat¬ 
ellites  in  and  above  it.  At  longer  wavelengths,  radio  signals  sent  up  into  the 
ionosphere  are  reflected  back  down  to  the  ground  and  this  is  how  the  ion¬ 
osphere’s  electron  densities  are  measured.  The  orbit  of  the  first  artificial 
satellite,  Sputnik ,  which  was  launched  on  October  4,  1957,  had  a  semi-major 
axis  of  6,955  km,  placing  it  about  584  km  above  the  mean  radius  of  the 
Earth;  the  mean  radius  is  6,371  km.  Sputnik  therefore  orbited  within  the 
outer  ionosphere;  the  atmospheric  friction  led  to  rapid  orbital  decay  and  the 
demise  of  the  satellite  in  3  months.  Amateur  radio  operators  monitored  the 
beep  of  its  radio  signals  throughout  the  world.  At  a  signal  frequencies  of  20.0 
and  40.0  MHz,  or  wavelengths  of  about  15  and  7.5  m,  the  radio  signals  from 
Sputnik  were  just  short  enough  in  wavelength  to  pass  through  the  ionosphere. 


5.5.3  Atoms  are  Tom  Apart  into  Plasma  Within  the  Sun 

Whole  atoms  are  only  found  in  the  outer  visible  layers  of  the  Sun,  where  the 
temperature  is  a  relatively  cool  5,780  K.  Raise  the  temperature  by  just  a  factor  of 
three,  to  about  17,000  K,  which  happens  just  beneath  the  solar  disk  we  see  with 
our  eyes,  and  the  Sun’s  hydrogen  atoms  are  stripped  bare,  losing  their  identity. 

The  hot  atoms  move  rapidly  here  and  there,  colliding  with  each  other  at  high 
speeds,  and  the  violent  force  of  these  collisions  is  enough  to  fragment  the  atoms 
into  their  subatomic  constituents.  And  since  the  Sun  is  mostly  hydrogen,  its 
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interior  consists  mainly  of  protons,  the  nuclei  of  hydrogen  atoms,  and  free  elec¬ 
trons  that  have  been  torn  off  the  atoms  by  innumerable  collisions  and  set  free  to 
move  throughout  the  Sun. 

What  is  left  is  plasma,  a  seething  mass  of  electrically  charged  particles,  the 
electrons  and  protons.  The  electrical  charge  of  the  protons  balances  and  cancels 
that  of  the  electrons,  which  have  been  removed  from  atoms  to  also  release  the 
protons,  so  the  plasma  has  no  net  electric  charge.  The  Sun  is  just  one  huge  mass  of 
incandescent  plasma,  compressed  on  the  inside  and  more  tenuous  further  out. 

Plasma  can  be  packed  more  tightly  than  complete  atoms.  This  is  because  the 
electrons  in  an  atom  are  located  at  relatively  remote  distances  from  the  atomic 
nuclei,  so  atoms  are  largely  empty  space  and  once  the  electrons  are  removed  the 
protons  can  be  compressed  together  more  than  atoms  can. 


5.6  Sound  Waves  and  Magnetic  Waves 
5.6.1  Sound  Waves 


Sound  is  transmitted  in  waves  that  are  produced  by  perturbations  in  an  otherwise 
undisturbed  gas  or  liquid.  These  waves  can  be  described  as  a  propagating  change 
in  the  mass  density.  For  a  fluid  medium,  which  can  be  either  a  gas  or  a  liquid,  we 
assume  an  initial  equilibrium  in  which  the  fluid  is  at  rest,  with  initial  velocity 
V0  =  0  and  a  constant  mass  density  p0  and  pressure  P0,  where  the  subscript  0 
denotes  the  initial  undisturbed  condition.  We  then  assume  a  perturbation  px  in  the 
mass  density  p  that  becomes  p  =  p0  +  px,  the  perturbation  velocity  is  denoted  as 
Vi,  and  the  perturbation  pressure  designated  Px  with  a  subscript  1  for  the  perturbed 
condition.  The  equations  of  hydrodynamics  have  a  plane  wave  solution  for  the 
perturbed  density  in  the  v  direction  given  by: 


px  oc  exp 


(5.35) 


where  the  frequency,  co ,  is  related  to  the  wavelength,  2,  by: 


(5.36) 


The  pressure,  Px,  and  velocity,  Vj,  also  satisfy  the  wave  equation.  The  French 
mathematician  and  astronomer  Pierre-Simon  Laplace  (1749-1827)  used  the  ideal 
gas  law  to  describe  the  pressure,  by  P  =  NkT  and  P/ p  =  kT/m  to  obtain  the  speed 
of  sound,  cs  (Laplace  1816): 


OJA 


2n 


Po 


m 


(5.37) 
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where  y  is  the  adiabatic  index.  For  a  monatomic  gas,  the  y  is  5/3  =  1.667  and  for  a 
diatomic  gas  y  =  7/5  =  1.400.  The  Boltzmann  constant  k  =  1.38065  x 
10  J  K  .  The  mean  molecular  mass  is  m  =  p  x  u  where  the  mean  molecular 
weight  is  p  and  the  atomic  mass  unit  u  =  1.66054  x  10  kg.  Isaac  Newton 
(1642-1727)  considered  the  speed  of  sound  in  an  isothermal  calculation  in  his 
Principia  ( Book  //,  Proposition  49),  instead  of  an  adiabatic  one,  obtaining 
essentially  the  same  result  with  a  y  =  1.000  for  an  isothermal  perturbation. 

The  speed  of  sound  in  an  ideal  gas  is  proportional  to  the  square  root  of  the 
temperature,  but  it  is  nearly  independent  of  pressure  or  mass  density  for  a  given 
gas.  At  a  constant  temperature,  the  ideal  gas  pressure  has  no  effect  on  the  speed  of 
sound  because  the  pressure  and  the  density,  which  is  also  proportional  to  pressure, 
have  equal  but  opposite  effects  on  the  speed  of  sound,  and  the  two  contributions 
cancel  out  exactly. 

Our  atmosphere  consists  mainly  of  diatomic  molecules  N2  at  78  %  and  oxygen 
02  at  21  %.  For  diatomic  molecules  y  =  1.400,  and  the  mean  molecular 
weight  p  =  2 A  for  a  diatomic  molecule  composed  of  atoms  of  atomic  mass 
number  A.  Nitrogen  and  oxygen  have  A  =  14  and  A  =  16,  respectively;  there¬ 
fore,  the  mean  molecular  weight  of  our  atmosphere  is  2  x  14  x  0.78  +  2  x  16  x 
0.21  =  28.56.  Substituting  these  numbers  with  the  other  known  constants  into  the 
equation  we  find  that  sound  moves  though  the  air  at  a  speed  of  about  20 
Tl/2  m  s"1,  or  to  be  precise: 


Coir  =  20.0457  [T 
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ms 
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(5.38) 


where  T  is  temperature  on  the  kelvin  scale,  and  T  =  273.15  +  Tc  if  you  are  using  a 
temperature  Tc  in  °C.  At  sea  level,  the  temperature  is  288  K  or  15  °C  and  the 
sound  speed  is  about  340  ms-1.  The  speed  of  sound  decreases  with  altitude,  due 
to  lower  temperatures  found  there,  but  even  at  the  cruising  altitudes  of  most 
aircraft  the  temperature  is  less  than  about  216  K  corresponding  to  a  speed  of  sound 
of  less  than  294  km  s-1. 

The  speed  of  motion  divided  by  the  speed  of  sound  is  called  the  Mach  number, 
and  anything  that  moves  at  a  speed  greater  than  Mach  1  is  said  to  be  traveling  at 
supersonic  speed.  Most  modern  fighter  aircraft  are  supersonic.  Such  aircraft  have 
broken  the  sound  barrier  and  can  produce  a  sonic  boom. 


Example:  Sound  waves  in  the  Earth’s  atmosphere 

Assuming  that  the  Earth’s  atmosphere  is  mainly  composed  of  diatomic 

nitrogen  molecules,  with  an  adiabatic  index  of  y  =  7/5  and  a  molecular 

_ 

mass  of  2  x  14  u,  where  the  atomic  mass  unit  u  =  1.660539  x  10“"  kg, 
the  sound  speed  at  ground  level,  where  the  temperature  is  T  =  288  K, 
is  about  cs=  [ykT/( 28u)]^2  «  346  m  s-1,  where  the  Boltzmann 

_ OQ  _  1 

constant  k  =  1.38065  x  10  ^  J  K  .  The  first  shout  from  a  drowning  man, 
located  1  km  out  in  the  ocean,  would  be  heard  at  the  beach  in  just  2.9  s. 
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We  detect  sound  by  small  changes  in  the  sound  pressure  against  our  vibrating 
eardrum,  above  and  below  the  normal  atmospheric  pressure.  The  threshold  of 
hearing  for  most  individuals  is  a  sound  pressure  of  2  x  10-5  Pa.  Sound  pressure  is 
inversely  proportional  to  distance  from  the  source  of  the  sound,  unlike  sound 
intensity  that  falls  off  with  the  inverse  square  of  the  distance. 

At  sea  level,  sound  waves  move  about  4.3  times  faster  in  water  than  in  air.  The 
larger  mass  density  of  water  slows  the  sound  waves  in  water  relative  to  air,  and  this 
nearly  makes  up  for  the  compressibility  differences  of  the  two  media.  Submarines 
use  pulses  of  sound  waves  to  detect  the  direction  and  distance  of  ships  or  the  depth 
of  the  ocean  floor.  The  acronym  SONAR  is  used  for  such  SOund  Navigation  And 
Ranging. 

Sound  waves  are  generated  by  turbulence  (Lighthill  1952,  1954;  Proudman 
1952),  and  it  is  such  turbulent  motions  that  give  rise  to  the  roar  of  a  jet  airplane 
engine.  The  convective  rise  and  fall  of  gas  in  the  outer  layers  of  the  Sun  also  create 
sound  waves  (Biermann  1948;  Schwarzschild  1948;  Schatzman  1949)  that  are 
produced  by  the  turbulent  motion  (Goldreich  and  Kumar  1990).  Most  of  the  solar 
sound  waves  are  trapped  inside  the  Sun,  and  they  are  used  to  investigate  its  internal 
properties  (Sect.  8.5). 

Within  the  Sun  we  can  assume,  to  a  first  approximation,  that  it  consists  of  a 
fully  ionized  monatomic  gas  with  an  adiabatic  index  of  y  =  5/3  =  1.667  and  the 

_ ‘"S'-J 

mass  is  dominated  by  the  protons,  with  a  mass  mP  =  1.67  x  10  "  kg.  Similar 
assumptions  apply  to  the  hot  expanding  solar  atmosphere,  resulting  in  the  super¬ 
sonic  solar  wind. 


Example:  Sound  speed  in  the  solar  wind 

A  perpetual  wind  of  protons  and  electrons  is  blowing  out  from  the  Sun  in  all 
directions  through  interplanetary  space.  At  its  origin  near  the  Sun,  the 

z 

temperature  is  10  K,  and  the  highly  conducting  wind  stays  nearly  that  hot 
all  the  way  to  the  Earth  and  beyond.  Using  this  temperature  with  y  =  1.667, 
a  proton  mass  of  mP  =  1.6726  x  10  kg,  and  the  Boltzmann  constant 
k  =  1.38065  x  10  J  K  ,  we  find  that  the  sound  speed  in  the  solar  wind 
is  cs  ~  (■ ykT/mP )1^2  «  105  m  s”1  or  100  km  s_1.  The  solar  wind  has  a  fast 
component  moving  at  a  speed  of  about  750  km  s-1  and  a  slow  one  moving 
at  about  half  that  speed,  so  the  solar  wind  is  always  supersonic,  with  a  speed 
exceeding  the  speed  of  sound. 


5.6.2  Magnetic  Waves 

In  addition  to  ponderous  material  particles,  like  atoms,  molecules,  electrons,  and 
ions,  there  are  also  magnetic  fields  that  permeate  the  universe.  The  trajectories  of 
charged  particles,  the  electrons  and  the  ions,  are  guided  by  these  magnetic  fields, 
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which  act  as  a  wall  to  them.  The  charges  can  spiral  around  the  magnetic  fields  but 
cannot  cross  them. 

The  theory  that  deals  with  the  interaction  of  a  hot  gas,  or  plasma,  and  a 
magnetic  field  is  called  magnetohydrodynamics,  or  MHD  for  short.  As  the  pon¬ 
derous  name  suggests,  the  equations  are  a  combination  of  those  of  electromag¬ 
netism  and  fluid  mechanics.  The  Swedish  electrical  engineer  Hannes  Alfven 
(1908-1995)  pioneered  the  study  of  MHD,  receiving  the  1970  Nobel  Prize  in 
Physics  for  this  work. 

Alfven  proposed  the  possible  existence  of  oscillations  produced  by  magnetic 
tensions,  which  are  now  known  as  Alfven  waves.  These  waves  propagate  in  the 
direction  of  the  magnetic  field  with  the  Alfven  velocity  VA  given  by  (Alfven 

1942a,  b): 


(5.39) 


where  B  is  the  magnetic  field  strength  in  tesla,  p  is  the  mass  density  in  units  of 

_ o  _ /:  _ o 

kg  m  ,  and  the  magnetic  permeability  p0  =  1.2566  x  1(TD  N  A  . 

Neglecting  the  contribution  of  electrons  to  the  mass  density  and  assuming  that 
there  is  a  single  ion  species  we  obtain: 


B 


(5.40) 


_ o 

where  Nt  is  the  number  density  of  the  ions  in  m  with  mass  mt  in  kg,  for  protons 
nii  =  1.6726  x  10  ^  kg.  The  ion  mass  density  provides  the  inertia  for  the 
oscillations  and  the  magnetic  field  provides  the  restoring  force.  The  ion  motion  and 
the  magnetic  field  perturbations  are  in  the  same  direction,  both  transverse  to  the 
direction  of  propagation. 

The  kinetic  energy  density  of  the  ions  is  given  by: 


\mv\Ni 


B2 

2/V 


(5.41) 


r\ 

which  is  equal  to  the  magnetic  field  energy  density  B  /( 2p0). 

Cosmic  magnetic  fields  are  always  being  jostled,  twisted  and  stirred  around, 
and  tension  acts  to  resist  the  motions  and  pull  the  disturbed  magnetism  back.  This 
generates  Alfven  waves  that  propagate  along  magnetic  fields,  somewhat  like  a 
vibrating  string.  These  waves  do  not  form  shocks,  and  once  generated  they  can 
propagate  large  distances,  directing  their  energy  along  open  magnetic  fields. 
Alfven  suggested  that  these  waves  could  contribute  to  the  heating  of  the  outer  solar 
atmosphere  (Alfven  1947). 

Instruments  aboard  Mariner  5  detected  magnetic  fluctuations  attributed  to 
large-amplitude  Alfven  waves  in  the  interplanetary  medium  during  the  space¬ 
craft’s  voyage  to  Venus  in  1967  (Belcher  et  al.  1969).  The  pressure  of  these  waves 
may  push  the  solar  wind  to  a  higher  speed  than  it  would  otherwise  have  (Cranmer 
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Fig.  5.7  Alfven  waves  The  velocity  amplitudes,  or  speeds,  of  Alfven  waves,  expressed  as 
transverse  velocities  of  the  oscillating  magnetic  held  lines,  versus  distance,  D,  above  the  visible 
disk  of  the  Sun  or  photosphere,  given  in  units  of  the  solar  radius  RQ.  The  solid  curve  fits  the  data 
observed  from  spacecraft  whose  instruments  are  specified  by  their  acronym.  The  left-most  two 
sets  of  data  (« dotted  and  dashed  curves )  represent  radial  motions  and  may  not  correspond  directly 
to  the  transversely  oscillating  Alfven  waves  (adapted  from  a  figure  provided  by  Steven  R. 
Cranmer) 

and  van  Ballegooijen  2005).  The  magnetic  waves  have  been  measured  for  more 
than  40  years  throughout  the  plane  of  our  solar  system,  both  near  to  and  far  from 
the  Sun  (Fig.  5.7).  Magnetometers  aboard  the  Ulysses  spacecraft  have  detected  the 
effects  of  Alfven  waves  above  the  Sun’s  polar  regions,  and  observations  from  the 
Hinode  spacecraft  have  found  their  signatures  near  the  visible  solar  disk,  with 
perhaps  enough  energy  to  power  the  Sun’s  winds  (De  Pontieu  et  al.  2007). 


Example:  Alfven  waves  in  the  interplanetary  medium 

Spacecraft  measure  an  interplanetary  magnetic  field  strength  just  outside  the 
Earth  of  B  =  2.5  x  10-9  T.  The  observed  proton  density  in  the  solar  wind  at 

/T  _ Q 

the  Earth’s  orbit  is  NP  =  5  x  10  m  for  protons  of  mass  mP  =  1.67  x 
10-27  kg.  The  associated  Alfven  velocity  is  VA  =  B/(iUoNpmp)1^2  = 
2.44  x  104  m  s'1  =  24  km  s-1,  with  /i0  =  1.2566  x  10~6  N  A-2.  The 
magnetic  fields  are  tied  to  the  Sun  at  one  end,  and  stretch  out  into  interplanetary 
space  at  the  other,  with  a  strength  that  is  inversely  proportional  to  the  distance. 
The  proton  density  is  proportional  to  the  inverse  square  of  the  distance,  but  the 
Alfven  velocity  goes  as  the  inverse  square  root  of  this  density.  So  the  two 
distance  variations  cancel,  and  we  expect  comparable  Alfven  velocities 
throughout  interplanetary  space,  which  have  in  fact  been  observed  from  10  to 
100  km  s'1  (Fig.  5.7). 


Chapter  6 

Detecting  Atoms  in  Stars 


6.1  What  is  the  Sun  Made  Out  Of? 

When  sunlight  is  spread  out  into  its  different  colors  or  wavelengths,  it  is  cut  by 
several  dark  gaps.  They  were  first  noticed  by  the  English  astronomer  William 
Hyde  Wollaston  (1766-1828)  in  1802  (Wollaston  1802),  and  then  investigated  in 
greater  detail  by  German  astronomer  Joseph  von  Fraunhofer  (1787-1826). 
Fraunhofer  had  detected  and  catalogued  more  than  300  gaps,  assigning  Roman 
letters  to  the  most  prominent  (Fraunhofer  1817). 

By  directing  the  incoming  sunlight  through  a  slit  and  then  dispersing  it  with  a 
prism,  Fraunhofer  was  able  to  overcome  the  blurring  of  colors  from  different  parts 
of  the  Sun’s  disk,  discovering  numerous  dark  features  in  this  spectral  display. 
When  coarse  wavelength  resolution  is  used,  adjacent  bright  emission  obscures  the 
dark  places,  which  are  no  longer  found. 

The  dark  gaps  of  missing  colors  found  in  a  display  of  the  Sun’s  radiation 
intensity  as  a  function  of  wavelength,  or  in  its  spectrum,  are  now  called  lines 
because  they  each  look  like  a  line  in  the  spectral  display.  They  are  designated 
further  as  absorption  lines  because  they  are  produced  when  atoms  in  a  cool, 
tenuous  gas  absorb  the  radiation  of  hot,  dense  underlying  material.  They  are  also 
known  as  Fraunhofer  absorption  lines,  in  recognition  of  his  work.  Such  lines  also 
can  appear  in  emission  when  a  gas  is  heated,  and  they  are  known  as  emission  lines. 

Both  absorption  and  emission  lines  (Fig.  6.1)  identify  the  ingredients  of  the 
cosmos.  For  stars,  astronomers  mainly  use  absorption  lines  to  determine  the 
composition  of  their  relatively  cool,  outer  atmospheres.  In  contrast,  emission  lines 
reveal  the  ingredients  of  the  hotter,  rarefied  emission  nebulae. 

The  Sun  is  so  bright  that  its  light  can  be  spread  out  into  small  wavelength 
intervals  with  enough  intensity  to  be  detected,  thereby  displaying  numerous  dark 
absorption  lines  (Fig.  6.2).  One  instrument  used  to  make  and  record  such  a 
spectrum  is  called  a  spectroheliograph,  a  composite  word  consisting  of  spectro  for 
“spectrum,”  helio  for  the  “Sun,”  and  graph  for  “record”  (Fig.  6.3).  It  uses  the 
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Fig.  6.1  Absorption  and  emission  lines  The  spectrum  of  a  star  or  other  cosmic  object  displays 
the  intensity  of  its  radiation  as  a  function  of  wavelength,  denoted  by  the  Greek  symbol  lambda,  L 
The  object’s  continuum  radiation  can  be  detected  at  all  wavelengths.  Atoms  can  produce 
absorption  at  a  specific  wavelength.  This  feature  is  called  an  absorption  line  because  it  looks  like 
a  line  in  the  spectrum.  When  atoms  are  excited  at  high  temperatures,  they  can  radiate  an  emission 
line.  The  motion  of  the  absorbing  or  emitting  atoms  broadens  these  lines  (see  Fig.  6.6).  The  line 
wavelength  indicates  which  atom  is  responsible  for  the  absorption  or  emission,  and  the  intensity 
of  a  stellar  line  is  related  to  both  the  number  of  atoms  and  the  physical  conditions  in  the  star’s 
atmosphere.  (From  “The  Life  and  Death  of  Stars”  by  Kenneth  R.  Lang,  published  by  Cambridge 
University  Press,  2013.  Reprinted  with  permission.) 

grooves  of  a  diffraction  grating  to  reflect  sunlight  into  different  locations  according 
to  color  or  wavelength,  similar  to  what  a  rainbow  and  compact  disk  do. 

The  Sun’s  absorption  lines  provided  the  first  clues  to  the  composition  of  the 
stars.  In  the  mid- 19th  century,  the  German  physicist  Gustav  Kirchhoff 
(1824-1887)  and  his  chemist  colleague  Robert  Bunsen  (1811-1899)  showed  that 
every  chemical  element  when  burned  and  vaporized  into  a  gas  emits  brightly 
colored  lines.  And  the  unique  wavelengths  of  these  lines  coincide  with  those  of  the 
dark  absorption  lines  in  the  Sun’s  spectrum. 

When  Kirchhoff  and  Bunsen  vaporized  an  individual  element  in  a  flame,  the  hot 
vapor  produced  a  distinctive  pattern  of  sharply  defined,  bright  lines.  Moreover,  when 
the  light  produced  by  a  hot  radiating  object,  such  as  a  tungsten  lamp,  was  passed 
through  the  cooler  vaporized  gas,  dark  lines  were  produced  at  exactly  the  same 
locations.  Kirchhoff  generalized  this  into  a  law  stating  that  the  powers  of  emission 
and  absorption  of  a  body  at  any  particular  wavelength  are  the  same  at  any  given 
temperature.  He  also  concluded  that  the  visible  solar  disk  was  hot  and  incandescent, 
producing  a  continuum  spectrum  (the  sort  without  lines),  which  became  crossed  by 
the  dark  Fraunhofer  lines  when  passing  through  cooler  overlying  gas. 

By  comparing  the  Sun’s  absorption  lines  with  the  emission  lines  of  elements 
vaporized  in  the  laboratory,  Kirchhoff  identified  in  the  solar  atmosphere  several 
elements  known  on  the  Earth.  The  lines  designated  by  Fraunhofer  with  the  letters 
H  and  K  were  associated  with  calcium,  and  iron  was  assigned  the  letter  E. 
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Fig.  6.2  Solar  spectrum  The  visible  portion  of  the  Sun’s  radiation  has  been  dispersed  into  its 
spectral  components,  displaying  radiation  intensity  as  a  function  of  wavelength.  When  we  pass 
from  long  wavelengths  to  shorter  ones  ( left-right ,  top— bottom),  the  spectrum  ranges  from  red 
thorough  orange,  yellow,  green,  blue  and  violet.  Dark  gaps  in  the  spectrum,  called  Fraunhofer 
absorption  lines,  are  due  to  absorption  by  atoms  in  the  Sun.  The  wavelengths  of  these  absorption 
lines  can  be  used  to  identify  the  elements  in  the  Sun,  and  the  relative  darkness  of  the  lines  helps 
establish  the  relative  abundance  of  these  elements.  This  high-resolution  version  of  the  spectrum 
of  our  Sun  was  created  from  a  digital  atlas  observed  with  the  Fourier  Transform  Spectrometer  at 
the  McMahon-Pierce  Solar  Facility  at  the  National  Solar  Observatory  on  Kitt  Peak,  near  Tucson 
Arizona.  (Courtesy  N.  A.  Sharp,  NOAO/NSO/Kitt  Peak  FTS/AURA/NSF.) 

The  letter  b  was  attributed  to  magnesium,  and  the  close  pair  of  dark  lines  in  the 
yellow,  specified  by  the  letter  D,  was  attributed  to  sodium;  they  produce  the 
distinctive  yellow  color  of  sodium  vapor  streetlights  used  at  the  time.  This  sug¬ 
gested  that  the  Sun  -  and  presumably  all  stars  -  are  composed  of  terrestrial 
elements  (Kirchhoff  and  Bunsen  1860,  1861;  Kirchhoff  1861a,  b). 

o 

The  Swedish  spectroscopist  Anders  Jonas  Angstrom  (1814-1874)  subsequently 
identified  hydrogen  in  the  solar  spectrum;  it  is  associated  with  Fraunhofer’s  letters 

o 

C  and  F.  Angstrom  published  a  comprehensive  atlas  of  more  than  1 ,000  absorption 
lines  in  the  Sun’s  spectrum,  identifying  them  with  hydrogen,  sodium,  calcium, 
barium,  strontium,  magnesium,  copper,  iron,  chromium,  nickel,  cobalt,  zinc,  and 

o 

gold  (Angstrom  1868).  His  scale  of  wavelengths  for  measuring  the  spectral  lines  is 

o  o 

still  used  by  some  astronomers;  it  is  now  named  the  Angstrom,  abbreviated  A,  with 
1  A  =  10-10  m  =  0.1  nm. 

Unfortunately,  the  Fraunhofer  lines  designated  A  and  B  are  not  related  to  the 
composition  of  the  Sun.  They  only  appear  in  spectra  gathered  beneath  the  Earth’s 
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Sun 


Fig.  6.3  Spectroheliograph  A  small  section  of  the  Sun’s  image  at  the  focal  plane  of  a  telescope 
is  selected  with  a  narrow  entry  slit,  Si,  and  this  light  passes  to  a  diffraction  grating  to  produce  a 
spectrum.  A  second  slit,  S2,  at  the  focal  plane  selects  a  specific  wavelength  from  the  spectrum.  If 
the  plate  containing  the  two  slits  is  moved  horizontally,  then  the  entrance  slit  passes  adjacent 
strips  of  the  solar  image.  The  light  leaving  the  moving  exit  slit  then  builds  up  an  image  of  the  Sun 
at  a  specific  wavelength 


atmosphere,  and  are  caused  by  oxygen  in  the  terrestrial  atmosphere.  The  oxygen 
molecules  absorb  sunlight  at  the  wavelengths  of  the  A  and  B  Fraunhofer  lines, 
creating  the  dark  lines  that  are  superposed  on  the  Sun’s  spectrum. 

Still,  there  are  some  very  strong  absorption  lines  that  are  due  to  the  Sun,  and 
they  extract  large  amounts  of  energy  from  sunlight.  They  are  produced  by 
hydrogen,  sodium,  magnesium,  calcium,  and  iron  (Table  6.1),  but  iron  accounts 
for  more  lines  than  any  other  element.  Because  abundant  heavy  iron  accounts  for 
the  Earth’s  high  average  mass  density  and  because  most  of  the  other  solar  lines 
corresponded  to  elements  known  on  the  Earth,  it  was  initially  thought  that  the  Sun 
is  made  of  the  same  material  as  the  Earth;  but  this  turned  out  to  be  only  partly  true. 
Many  of  the  visible  spectral  lines  are  associated  with  hydrogen,  a  relatively  rare 
element  on  Earth  (Abundant  hydrogen  is  found  bound  up  in  the  Earth’s  water,  but 
hydrogen  gas  escapes  from  the  Earth  and  is  not  found  in  noticeable  quantities  in  its 
atmosphere.).  Hydrogen  is  nevertheless  the  most  abundant  element  in  the  universe. 

The  Earth  is  primarily  made  of  heavy  elements  that  are  relatively  uncommon  in 
the  Sun  and  the  rest  of  the  universe.  Hydrogen  is  about  1  million  times  more 
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Table  6.1  Prominent  absorption  lines  and  elements  detected  in  sunlight"1 


Wavelength  (nm) 

Fraunhofer  letter 

Element  symbol  and  name 

393.368 

K 

Ionized  calcium,  Ca  II 

396.849 

H 

Ionized  calcium,  Ca  II 

410.175 

h 

Hydrogen,  H§  B  aimer  delta  transition 

422.674 

g 

Calcium,  Ca  I 

431.0 

G 

CH  molecule 

434.048 

Hydrogen,  H„,  B aimer  gamma  transition 

438.356 

d 

Iron,  Fe  I 

486.134 

F 

Hydrogen,  Up,  B aimer  beta  transition 

516.733 

b4 

Magnesium,  Mg  I 

517.270 

b2 

Magnesium,  Mg  I 

518.362 

hi 

Magnesium,  Mg  I 

526.955 

E 

Iron,  Fe  I 

588.997b 

d2 

Sodium,  Na  I 

589.594 

D, 

Sodium,  Na  I 

656.281 

c 

Hydrogen,  Ha,  B aimer  alpha  transition 

686.719 

B 

Molecular  oxygen,  02,  in  the  Earth’s  atmosphere 

759.370 

A 

Molecular  oxygen,  02,  in  the  Earth’s  atmosphere 

a  The  photosphere  is  the  visible  solar  disk.  The  wavelengths  are  in  nanometer  units,  where 
1  nanometer  =  1  nm  =10  m.  Astronomers  have  often  used  the  Angstrom  unit  of  wave- 

o  o 

length,  where  1  Angstrom  =  1  A  =  0.1  nm.  Joseph  von  Fraunhofer  used  letters  to  designate  the 
spectral  lines  before  they  were  chemically  identified,  and  the  subscripts  denote  components  that 
were  not  resolved  by  Fraunhofer.  A  Roman  numeral  I  after  an  element  symbol  denotes  an 
electrically  neutral,  or  unionized,  atom,  with  no  electrons  missing,  whereas  the  Roman  numeral  II 
denotes  a  singly  ionized  atom  with  one  electron  missing.  The  lines  A  and  B  are  produced  by 
molecular  oxygen  in  the  terrestrial  atmosphere 

b  Fraunhofer’s  D  line  includes  the  two  sodium  lines,  designated  and  D2,  and  the  helium  line  at 
587.6  nm,  designated  D3 


abundant  than  iron  in  the  Sun,  but  iron  is  a  main  constituent  of  the  Earth,  which 
cannot  even  retain  hydrogen  gas  in  its  atmosphere  for  any  significant  length  of 
time.  Asplund  et  al.  (2009)  have  reviewed  observations  of  the  chemical  compo¬ 
sition  of  the  Sun. 

Helium,  the  second-most  abundant  element  in  the  Sun,  is  so  rare  on  the  Earth 
that  it  was  first  discovered  in  the  Sun.  The  French  astronomer  Pierre  Jules  Cesar 
Janssen  (1824-1927)  observed  an  unidentified  yellow  emission  line,  with  a 
wavelength  of  587.49  nm,  in  the  solar  spectrum  during  the  solar  eclipse  on  August 
18,  1868,  which  he  observed  from  India  (Janssen  1868).  The  emission  originated 
in  the  chromosphere,  a  thin,  slightly  hotter  layer  of  gas  that  lies  just  above  the 
visible  solar  disk,  or  photosphere,  which  became  visible  when  the  Moon  blocked 
the  bright  glare  of  the  photosphere. 

On  October  20,  1868,  the  British  astronomer  Sir  Joseph  Norman  Lockyer 
(1836-1920)  found  the  same  yellow  line  in  the  solar  spectrum,  but  he  didn’t  need  a 
solar  eclipse  (Lockyer  1869,  1887).  It  was  probably  not  until  the  following  year 
that  Lockyer  became  convinced  that  the  yellow  line  he  saw  could  not  be  identified 
with  any  known  element  on  Earth,  and  named  the  element  “helium”  after  the 
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Table  6.2  The  twenty  most  abundant  elements  in  the  Sun 


Element 

Symbol 

Atomic  number,  Z 

Abundance21  (logarithmic) 

Discovery  on  Earth 

Hydrogen 

H 

1 

12.00 

1766 

Helium 

He 

2 

[10.93  ±  0.01] 

1895b 

Carbon 

C 

6 

8.43  ±  0.05 

(ancient) 

Nitrogen 

N 

7 

7.83  ±  0.05 

1772 

Oxygen 

O 

8 

8.69  ±  0.05 

1774 

Neon 

Ne 

10 

[7.93  ±  0.10] 

1898 

Sodium 

Na 

11 

6.24  ±  0.04 

1807 

Magnesium 

Mg 

12 

7.60  ±  0.04 

1755 

Aluminum 

Al 

13 

6.45  ±  0.03 

1827 

Silicon 

Si 

14 

7.51  ±  0.03 

1823 

Phosphorus 

P 

15 

5.41  ±  0.03 

1669 

Sulfur 

S 

16 

7.12  ±  0.03 

(ancient) 

Chlorine 

Cl 

17 

5.50  ±  0.30 

1774 

Argon 

Ar 

18 

[6.40  ±  0.13] 

1894 

Potassium 

K 

19 

5.03  ±  0.09 

1807 

Calcium 

Ca 

20 

6.34  ±  0.04 

1808 

Chromium 

Cr 

24 

5.64  ±  0.04 

1797 

Manganese 

Mn 

25 

5.43  ±  0.04 

1774 

Iron 

Fe 

26 

7.50  ±  0.04 

(ancient) 

Nickel 

Ni 

28 

6.22  ±  0.04 

1751 

a  Logarithm  of  the  abundance  in  the  solar  photosphere,  normalized  to  hydrogen  =  12.00,  or  an 
abundance  of  1.00  x  10  .  Indirect  solar  estimates  are  marked  with  [].  The  data  are  from  Asplund 

et  al.  (2009) 

b  Helium  was  discovered  on  the  Sun  in  1868,  but  it  was  not  found  on  Earth  until  1895 


Greek  Sun  god,  Helios ,  who  daily  traveled  across  the  sky  in  a  chariot  of  fire  drawn 
by  four  swift  horses. 

Helium  was  not  found  in  the  Earth  until  27  years  after  its  discovery  in  the  Sun. 
Then  the  Scottish  chemist  Sir  William  Ramsay  (1852-1916)  discovered  its 
spectral  signature  in  a  gaseous  emission  given  off  by  the  uranium  mineral,  cleveite, 
when  it  was  heated  (Ramsay  1895).  Ramsay  received  the  Nobel  Prize  in  Chemistry 
in  1904  for  his  discovery  of  noble,  or  inert,  gaseous  elements  in  the  air;  they 
include  helium,  neon,  argon,  krypton,  xenon  and  radon.  These  so-called  noble 
gases  do  not  combine  with  most  other  chemical  elements,  which  is  one  reason  it 
took  so  long  to  discover  helium  on  Earth. 

Today,  helium  is  used  on  the  Earth  in  a  variety  of  ways,  including  inflating 
party  balloons  and,  in  its  liquid  state,  to  keep  sensitive  electronic  equipment  cold. 
Although  plentiful  in  the  Sun,  helium  is  almost  nonexistent  on  the  Earth.  It  is  so 
terrestrially  rare  that  we  are  in  danger  of  running  out  of  helium  during  this  century. 
There  have  been  reports  that  Japanese  scientists  plan  to  mine  helium  from  the 
Moon’s  surface,  where  it  has  been  implanted  by  winds  from  the  Sun. 

Altogether,  92.1  %  of  the  atoms  of  the  Sun  are  hydrogen  atoms,  7.8  %  are 
helium  atoms,  and  all  of  the  heavier  elements  comprise  only  0.1  %.  In  contrast,  the 
main  ingredients  of  the  Earth  are  the  heavier  elements  such  as  silicon  and  iron, 
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Atomic  Number,  Z 

Fig.  6.4  Elemental  abundance  in  solar  photosphere  The  relative  abundance  of  the  elements  in 
the  Sun’s  visible  disk,  the  photosphere,  plotted  as  a  function  of  atomic  number.  The  atomic  number, 
denoted  by  Z,  is  the  number  of  protons  in  an  atom’s  nucleus,  or  roughly  half  the  atomic  weight. 
Heavy  elements,  with  high  atomic  numbers,  are  less  abundant  than  light  ones,  with  low  atomic 
numbers;  the  most  abundant  element  in  the  Sun  is  the  lightest  element,  hydrogen.  Helium  is  the 
second  most  abundant  solar  element.  The  abundance  data  are  plotted  in  a  logarithmic  scale 
normalized  to  1  million  million,  or  1.0  x  10  ,  for  hydrogen.  (Adapted  from  Asplund  et  al.  2009.) 


which  explains  the  Earth’s  high  mass  density  -  about  four  times  that  of  the  Sun, 
which  is  only  about  as  dense  as  water. 

By  mass,  hydrogen  accounts  for  71.54  %  of  the  Sun;  the  helium  amounts  to 
27.03  %  by  mass,  because  the  helium  atom  is  about  four  times  more  massive  than  the 
hydrogen  atom.  All  of  the  heavier  solar  elements  amount  to  only  1.42  %  by  mass. 

The  abundance  of  the  20  most  abundant  elements  in  the  Sun  are  listed  in 
Table  6.2,  normalized  to  a  hydrogen  abundance  of  1  million  million,  or  1012. 
There  is  a  systematic  decrease  in  the  abundance  of  solar  elements  with  increasing 
atomic  number  (Fig.  6.4),  but  with  a  noticeable  gap  of  unexpectedly  low  abun¬ 
dance  for  the  light  elements  between  helium  and  carbon. 


6.2  Quantization  of  Atomic  Systems 

Most  of  the  mass  of  an  atom  is  concentrated  in  its  relatively  small  nucleus,  which 
is  surrounded  by  electrons  (Sect.  5.1).  The  nucleus  has  a  positive  charge  due  to  the 
protons  in  it  and  is  about  100,000  times  smaller  than  the  atom.  The  negatively 
charged  electrons  keep  the  atom  distended,  enlarging  its  shape.  As  a  result,  an 
atom  is  mostly  empty  space. 
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Fig.  6.5  Balmer  lines  of  hydrogen  The  spectrum  of  the  Sun’s  optically  visible  light  exhibits 
four  strong  absorption  lines  that  are  attributed  to  hydrogen,  whose  line  wavelengths  are  spaced 
closer  together  at  shorter  wavelengths  (top).  These  lines  are  designated  Ha  at  a  red  wavelength  of 
656.3  nm,  Up  at  a  wavelength  of  486.1  nm,  Hy  at  the  blue  434.1  nm  and  He)  at  the  violet 
410.2  nm,  where  1  nanometer  =  1  nm  =  10-9  m.  These  spectral  features  originate  when  an 
electron  in  a  hydrogen  atom  moves  from  a  low  to  a  high  electron  orbit,  the  orbital  energy  of 
which  is  a  function  of  the  integer  n  (bottom).  They  were  named  Balmer  lines  after  the  Swiss 
mathematics  teacher  Johann  Balmer  (1825-1898),  who  first  derived  an  equation  that  describes 
their  wavelengths  in  terms  of  integers.  (From  “The  Life  and  Death  of  Stars”  by  Kenneth  R.  Lang, 
published  by  Cambridge  University  Press,  2013.  Reprinted  with  permission.) 


According  to  Rutherford’s  model  of  the  atom,  the  electrons  revolve  around  the 
central  nucleus,  somewhat  like  the  planets  that  endlessly  whirl  around  the  Sun. 
Unlike  a  planet,  an  electron  is  electrically  charged,  and  a  revolving  charge  emits 
electromagnetic  radiation.  That  is  how  radio  signals  are  broadcast,  by  moving 
electrons  through  wires  to  generate  radiation. 

This  means  that  something  was  wrong  with  Rutherford’s  model.  An  electron 
revolving  in  an  atom-sized  orbit  radiates  light  waves,  and  as  a  result  of  this 
emission,  it  will  lose  its  kinetic  energy  of  motion.  As  a  result,  an  electron  cannot 
be  perpetually  moving  around  the  nucleus.  An  atomic  electron  should  lose  its 
orbital  motion  and  spiral  into  the  atom’s  nucleus  in  less  than  1  s.  So,  the  idea  of 
electron  orbits  did  not  seem  to  work. 

An  unexpected  feature  of  the  Sun’s  hydrogen  lines,  which  are  produced  by  an 
orbital  change  of  the  hydrogen  atom’s  single  electron,  provided  a  clue  to  this 
enigma.  That  is,  the  wavelengths  of  spectral  lines  from  hydrogen  indicate  that 
electrons  have  to  follow  certain  rules  if  they  want  to  belong  to  an  atom.  Not  just 
any  behavior  is  allowed  and  only  certain  orbits  are  permitted. 

Adjacent  hydrogen  lines  in  the  spectrum  of  the  Sun  or  any  other  cosmic  object 
systematically  crowd  together  at  shorter  wavelengths  (Fig.  6.5).  The  Swiss 
mathematics  teacher  Johann  Balmer  (1825-1898)  found  a  simple  equation  that 
describes  their  regular  spacing.  The  four  lines  in  the  Sun’s  visible  hydrogen 
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Balmer  Series 


Lyman  Series 


Fig.  6.6  Bohr  atom  In  this  model,  proposed  in  1913  by  the  Danish  physicist  Niels  Bohr 
(1885-1962),  a  hydrogen  atom’s  one  electron  revolves  around  the  hydrogen  nucleus,  a  single 
proton,  in  well-defined  orbits  described  by  the  integer  n  =  1,  2,  3,  4,  5,  . . ..  An  electron  absorbs  or 
emits  radiation  when  it  transitions  between  these  allowed  orbits.  The  electron  can  jump  upward, 
to  orbits  with  larger  n,  by  absorption  of  a  radiation  photon  of  exactly  the  right  energy,  equal  to  the 
energy  difference  between  the  orbits;  the  electron  can  jump  down  to  lower  orbits,  of  smaller  n, 
with  the  emission  of  radiation  of  that  same  energy  and  wavelength.  Transitions  that  begin  or  end 
on  the  n  =  2  orbit  define  the  Balmer  series  observed  at  visible  wavelengths.  They  are  designated 
by  Ha,  H/f  Hy,....  The  Lyman  series,  with  transitions  from  the  first  orbit  at  n  =  1,  is  detected  at 
ultraviolet  wavelengths.  The  orbits  are  not  drawn  to  scale  because  the  size  of  their  radius 
increases  with  the  square  of  the  integer  n 


spectrum  have  wavelengths  of  410,  434,  486,  and  656  nm,  and  they  are  still  known 
as  the  Balmer  lines.  Balmer’ s  equation  that  specified  their  wavelengths,  2,  is 
(Balmer  1885): 


A 


=  K 


m 


m2  —  22 


_ n 

where  the  constant  K  =  3.6456  x  10  m  =  364.56  nm,  the  integer  n  =  2  and 
the  integers  m  =  3,  4,  5,  ....  The  most  intense  Balmer  line  is  now  known  as  the 
Balmer  hydrogen  alpha  line ,  designated  Ha,  at  a  red  wavelength  at  656.3  nm, 
where  1  nanometer  =  1  nm  =  10-9  m  =  10  A  =  10  Angstrom. 

A  few  years  later,  the  Swedish  physicist  Johannes  Rydberg  (1854-1919)  gen¬ 
eralized  the  Balmer  equation  for  all  transitions  of  hydrogen,  in  the  Rydberg  for¬ 
mula  (Rydberg  1890): 


1 

I 


where  m  and  n  are  integers,  the  Rydberg  constant  for  hydrogen 
Roo  =  10,973,731.57  m-1  =  1.097373157  x  107  m-1  =  4/K  where  the  K  is  in 
Balmer’ s  equation  where  m  =  2,  the  n  =  3,  4,  5  (the  m  and  n  have  been  inter¬ 
changed  from  Balmer’ s  formula.)  Rydberg’s  formula  was  subsequently  used  to 
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predict  spectral  lines  observed  at  the  invisible  infrared,  ultraviolet  and  radio 
wavelengths. 

The  Danish  physicist  Niels  Bohr  (1885-1962)  explained  Balmer’s  equation  by 
a  model  of  the  hydrogen  atom,  now  known  as  the  Bohr  atom  (Bohr  1913).  In  this 
model,  a  single  electron  in  a  hydrogen  atom  revolves  about  the  nuclear  proton  in 
specific  orbits  with  definite,  quantized  values  of  energy  (Fig.  6.6).  An  electron 
only  emits  or  absorbs  radiation  when  jumping  between  those  allowed  orbits,  each 
jump  associated  with  a  specific  energy  and  a  single  wavelength.  If  an  electron 
jumps  from  a  low-energy  to  a  high-energy  orbit,  it  absorbs  radiation  at  this 
wavelength;  radiation  is  emitted  at  exactly  the  same  wavelength  when  the  electron 
jumps  the  opposite  way.  This  unique  wavelength  is  related  to  the  difference 
between  the  two  orbital  energies.  Bohr  was  awarded  the  1912  Nobel  Prize  in 
Physics  for  these  investigations  of  the  structure  of  atoms  and  the  radiation  ema¬ 
nating  from  them. 

Because  only  quantized  orbits  are  allowed,  spectral  lines  are  produced  only  at 
specific  wavelengths  that  characterize  or  identify  an  atom.  An  atom  or  molecule 
can  absorb  or  emit  a  particular  type  of  light  only  if  it  resonates  to  that  light’s 
energy.  As  it  turns  out,  the  resonating  wavelengths  or  energies  of  each  atom  are 
unique. 

A  little  more  than  a  decade  before  Bohr  introduced  his  model,  Planck  (1901) 
explained  continuum  thermal  radiation,  without  lines,  by  supposing  that  the 
emission  and  absorption  of  light  takes  place  only  in  the  form  of  certain  discrete 
portions,  or  quanta,  of  energy,  now  known  as  photons  (See  Sect.  2.4).  He  had 
quantized  radiation,  giving  a  photon  an  energy  hv  at  frequency  v.  The  constant  h  is 

_ 0  4 

the  Planck  constant,  with  the  value  h  =  6.626  x  10  J  s.  The  energy  of  each 
separate  light  quantum,  or  photon,  is  proportional  to  the  frequency  of  light,  or 
inversely  proportional  to  its  wavelength. 

Bohr  went  one  step  farther  and  quantized  the  energy  of  motion  of  the  electrons 
revolving  in  an  atom.  He  proposed  that  the  mechanical  energy  of  any  moving 
subatomic  particle  could  take  on  only  one  of  a  certain  set  of  discrete  values,  in  an 
entirely  new  quantum  mechanics.  That  is,  he  quantized  the  electron’s  angular 
momentum  and  energy,  using  the  Planck  constant,  h.  Because  the  quantum  of  an 
electron’s  orbital  energy  increases  with  the  decreasing  dimensions  of  the  orbit,  this 
suggested  that  quantum  mechanics  would  become  important  only  for  very  small, 
subatomic  physical  scales. 

In  the  Bohr  atom,  the  single  electron  of  a  hydrogen  atom  is  said  to  orbit  the 
atom’s  nuclear  proton  with  an  angular  momentum,  me\rn ,  that  is  quantized,  or 
(Bohr  1913): 


meV rn  =  hn / (In) ,  (6.3) 

where  the  mass  of  the  electron  me  =  9.109  x  10  kg,  the  electron’s  orbital 
velocity  is  V,  its  orbital  radius  is  rn  for  the  integer  n  =  1,  2,  3  . . .,  Planck’s  constant 
h  =  6.626  x  1CT34  J  s,  and  n  =  3.14159. 
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The  radius,  rn,  of  the  nth  electron  orbit  is  derived  by  equating  the  Coulomb 
force  of  attraction  of  the  proton,  of  charge  e ,  on  the  electron,  also  of  charge  e ,  to 
the  centripetal  force,  or 


to  obtain 


1  e2  _  mV2 
4ns0  r2n  rn 


where  the  radius,  a0,  of  the  first  Bohr  orbit  of  hydrogen  is: 

ao  =  — - — -  ~  5.2918  x  10-11  m,  (6-6) 

nmeez 

_ i  o  _ i 

with  an  electrical  constant  s0  =  8.8542  x  10  “  F  m  ,  the  Planck  constant 

'2/t 

h  =  6.6261  x  10  J  s,  the  constant  n  =  3.14159,  the  electron  mass 
me  =  9.1094  x  10-31  kg,  and  the  elementary  charge  e  =  1.6022  x  10“ 19  C. 

The  electron  orbital  energy  is  also  quantized,  which  can  be  seen  by  solving  for 
its  kinetic  energy  meV  / 2.  That  is,  electrons  can  only  occupy  orbits  with  allowed 
orbital  energy,  En,  given  by: 


mee 4  hcRoQ 

RefyEn2  n2 


where  the  Rydberg  constant,  R is: 


4 

mee 
8sq  h3c 


10,973,731.568  m_1 


«  1.097  x  107m_1, 


and  the  speed  of  light  c  =  2.9979  x  108  m  s”1. 

The  constant  hcR ^  =  13.  60569  eV  =  1  Ry  is  known  as  the  Rydberg  unit  of 
energy,  and  is  often  used  in  atomic  physics.  The  electron  volt  is  equivalent  to 
1  eV  =  1.602176  x  10-19  J. 

The  permitted  orbital  energy  increases  with  decreasing  integer,  or  quantum 
number,  n ,  which  corresponds  to  decreasing  radius  or  size  of  the  electron  orbit. 
The  closer  an  electron  is  to  the  nucleus,  the  greater  is  its  allowed  energy.  Both  the 
energy  and  radius  of  an  electron’s  orbit  vary  as  the  inverse  square  of  the  quantum 
number. 

The  nth  energy  level  of  the  hydrogen  atom  has  energy 
En  =  13.6  6V/n  =  hcR^/n  ,  where  13.6  eV  is  the  ionization  potential  of 
hydrogen  and  1  eV  =  1.602  x  10“ 19  J.  The  temperature  required  to  ionize 
hydrogen,  removing  its  sole  electron  from  its  ground  state  is  about  1.6  x  105  K, 
obtained  from  setting  the  temperature  equal  to  13.6  eV  divided  by  the  Boltzmann 
constant  k  —  1.38  x  10-23  JK”1. 
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The  energy  difference  between  two  energy  levels  specified  by  integers  n  and  m  is: 

En  -  Em  =  (  4  -  ^  ]  x  13.5609  eV.  (6.9) 

\nz  mz  J 

Radiation  can  be  emitted  or  absorbed  by  a  hydrogen  atom  when  the  electron 
changes  between  allowed  orbits,  with  the  frequency  vmn  of  the  m  —  n  transition 
given  by 
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where  1 1  denotes  the  absolute  value,  and  the  wavelength  Amn  of  the  m  —  n  transition 
can  be  inferred  from  the  frequency  by  2mn  =  c/vmn  where  c  =  2.9979  x  108  m  s  1 
is  the  speed  of  light. 

The  wavelength,  2mn,  of  the  radiation  emitted  or  absorbed  in  a  radiative  tran¬ 
sition  between  the  two  levels  will  be: 


he 


X)  X  911.5  A, 

mzJ 


(6.11) 


O  _  1  A  O 

where  1A=10  m  =  0.1  nm  is  one  Angstrom. 

The  most  intense  spectral  line  for  any  given  value  of  n  is  the  transition  with  m  —  n 
=  1 ,  known  as  the  alpha,  or  a,  transition,  and  the  next  most  intense  line  is  for  the  beta, 
or  /?,  transition  with  m  —  n  =  2,  and  so  on  for  increasing  m  —  n.  The  difference  in 
orbital  energies  for  the  a  transition  is  smaller  than  any  other  transition,  and  therefore 
the  easiest  to  accomplish;  more  atoms  will  undergo  this  transition  than  other  atoms, 
and  the  line  is  therefore  the  most  intense.  At  higher  m  —  n,  the  difference  in  orbital 
energies  is  greater  and  the  transition  is  more  energetic;  but  there  are  fewer  atoms 
undergoing  the  transition  and  the  intensity  of  the  line  is  lower. 

For  any  n ,  the  frequency  of  the  a  transition  is  given  by: 


2cRoo 
y  mn  o 


(6.12) 


Example:  Recombination  lines  at  radio  wavelengths 

Electron  transitions  at  high  quantum  numbers  n  have  been  detected  from 
hydrogen  atoms  surrounding  very  hot  stars.  For  the  m  —  n  =  1,  or  the  a 
transition,  radiation  from  the  n  =  109  transition  has  been  observed.  The  fre- 
quency  of  this  transition  is  vmn  =  IcR^/n  =  5.079  x  10  Hz  = 
5,079  MHz,  where  the  speed  of  light  c  =  2.9979  x  108  m  s-1  and  the 
Rydberg  constant  R  ^  =  1.097  x  10  m  .  The  radius  of  the  electron  orbit  is 
rn  =  a0n  «  6.29  x  10  m,  where  the  Bohr  radius  a0  =  5.2918  x 
10-11  m,  so  the  orbit  radius  is  much  larger  than  the  average  radius  of  an  atom, 
usually  about  10“ 10  m. 
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The  a  transition  detected  at  visible  wavelengths  is  called  the  B  aimer  a  transi- 

o 

tion,  at  a  red  wavelength  of  6562.8  A  while  the  ultraviolet  a  transition  is  known  as 
Lyman  a  at  1215.67  A,  where  1  A  =  10-10  m. 

The  Lyman  series,  for  n  =  1,  includes 


Designation 

m 

(a) 

Lyoc 

2 

1,216 

LyP 

3 

1,026 

Lyy 

4 

973 

Lyman  limit 

00 

911.5 

And  the  B aimer  series,  for  n 

=  2,  includes: 

Designation 

m 

^mn  (A) 

Hoc 

3 

6,563 

Hp 

4 

4,861 

Hy 

5 

4,341 

B  aimer  limit 

oo 

3,646 

The  important  hydrogen  transitions  between  low  n  orbits  have  been  given  the 
last  names  of  the  persons  who  first  observed  them.  They  are  called  the  Lyman 
transitions  for  n  =  1  (Lyman  1906),  the  B  aimer  transitions  for  n  =  2  (B  aimer 
1885),  the  Paschen  transitions  for  n  =  3  (Paschen  1908),  the  Brackett  transitions 
for  n  =  4  (Brackett  1922),  and  the  Pfund  transitions  for  n  =  5  (Pfund  1924). 

The  wavelengths  of  the  relevant  m  to  n  transitions  are  given  in  Table  6.3  for 
m  between  1  and  10,  as  well  as  for  m  =  oo,  which  is  the  limiting  wavelength  at 
which  the  lines  merge  together  (Fig.  6.7).  For  wavelengths  shorter  than  this  limit, 


Table  6.3  Wavelengths  of  the  m  to  n  transitions  of  hydrogen  for  n  =  1  to  n  =  5  and  m  =  2  to 
m  =  10a 


Series  (m) 

r 

"C 

3 

P 

3 

12 

II 

B  aimer  ( n  =  2) 

Paschen  ( n  =  3) 

Brackett  ( n  =  4) 

Pfund  (n  =  5) 

2 

121.567 

3 

102.572 

656.280 

4 

97.2537 

486.132 

1875.10 

5 

94.9743 

434.046 

1281.81 

4051.20 

6 

93.7803 

410.173 

1093.81 

2625.20  7 

7457.8 

7 

93.0748 

397.007 

1004.94 

2165.50 

4652.5 

8 

92.6226 

388.905 

954.598 

1944.56 

3739.5 

9 

92.3150 

383.538 

922.902 

1817.41 

3296.1 

10 

92.0963 

379.790 

901.491 

1736.21 

3038.4 

oo 

91.15 

364.6 

820.36 

1458.4 

2278.8 

a  The  wavelengths  are  given  in  nanometers  where  1  nm  =  10  9  m 
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Fig.  6.7  Balmer  limit  As  shown  in  this  spectrum  of  a  star,  the  lines  of  the  Balmer  series  of 
hydrogen  crowd  together  and  merge  at  a  limiting  wavelength,  the  Balmer  limit  at  364.6  nm  or 

o 

approximately  3,700  Angstrom.  At  wavelengths  less  than  this  limit,  the  radiation  of  hydrogen  is 
emitted  at  a  continuous  range  of  wavelengths,  known  as  the  Balmer  continuum,  resulting  from 
transitions  between  states  with  principal  quantum  number  n  =  2  and  states  in  which  the  single 
electron  is  freed  from  the  atom 


the  radiation  is  emitted  at  a  continuous  range  of  wavelengths,  or  as  a  continuum, 
resulting  from  transitions  in  which  the  electron  leaves  the  atom. 

For  an  atom  of  atomic  number  Z,  which  is  the  charge  of  the  nucleus  in  units  of 
the  proton  charge,  the  electron  orbital  energy,  En,  is  given  by 

meeAZ 2  hcRA 

=  =  ^  (6J3) 
where  n  is  an  integer  and  the  atomic  Rydberg  constant,  RA ,  is  given  by: 


(6.14) 


where  the  electron  mass  me  =  5.4858  x  10  4  u,  the  atomic  mass  MA  is  often 

_ 0^7 

given  in  units  of  the  atomic  mass  unit  u  =  1.660539  x  10  kg.  The  electron 
mass  in  atomic  mass  units  is  me  =  0.000548579867  u.  For  hydrogen  Z  =  1  and 
for  helium  Z  =  2.  The  frequency  for  the  transition  from  an  upper  level  m  to  a 
lower  level  n  is  given  by 


v 


mn 


=  cRaZ2 


2  cRaZ" 


(m  —  n ) 


n ' 


(6.15) 
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Table  6.4  Atomic  number  Z,  atomic  mass  MA,  and  atomic  Rydberg  constant  RA  for  the  most 
abundant  atoms  in  the  cosmosa 


Atom 

Atomic  number,  Z 

Atomic  mass,  MA 

V  1 

(u)  Rydberg  constant,  RA  (10  m  ) 

Hydrogen,  H1 

1 

1.007  825 

1.096776 

Helium,  He4 

2 

4.002  603 

1.097223 

Carbon,  C12 

6 

12.000  000 

1.097323 

Nitrogen,  N14 

7 

14.003  074 

1.097330 

Oxygen,  O16 

8 

15.994  914 

1.097335 

Neon,  Ne20 

10 

19.992  440 

1.097343 

_  r\  /-] 

L  The  atomic  mass  is  given  in  atomic  mass  units  u  =  1.660539  x  10  kg 


The  atomic  number  and  Rydberg  constant  for  abundant  atoms  are  given  in 
Table  6.4. 


6.3  Some  Atoms  are  Excited  Out  of  Their  Lowest-Energy 
Ground  State 

Because  atoms  reside  together  in  great  numbers,  we  must  use  a  statistical  approach 
to  determine  their  average  properties.  Their  level  of  excitation  depends  on  the 
temperature  and  the  density,  which  influence  how  often  the  particles  collide  and 
become  excited.  The  higher  the  temperature,  the  faster  particles  move,  and  the 
more  frequent  their  collisions.  When  the  particles  are  packed  together  in  greater 
numbers  per  unit  volume,  with  a  greater  density,  collision  frequency  also 
increases. 

The  number  of  atoms  in  the  lowest  possible  energy  state,  called  the  ground 
state,  is  greater  than  the  number  in  any  other  state  of  energy,  essentially  because  its 
easier  to  stay  in  the  lowest  energy  state.  Because  it  requires  less  energy,  an  atom 
acts  as  if  it  prefers  the  ground  state.  It  is  simply  more  difficult  to  enter  or  remain  in 
an  excited  state  of  energy.  Outside  agencies  are  required  to  sustain  an  excited  state 
of  energy. 

The  allowed  energy  levels  of  the  electron  orbits  within  an  atom  can  be  com¬ 
pared  to  the  rungs  of  a  ladder.  Electrons  can  climb  the  ladder’ s  energy  rungs  when 
an  atom  either  collides  with  other  atoms  or  absorbs  radiation.  Once  an  electron  is 
up  on  a  rung,  it  can  jump  downward,  releasing  the  energy  it  attained  to  get  there. 
Since  it  is  easier  to  stay  on  the  ground  and  never  climb  the  ladder,  most  atoms  are 
usually  in  the  ground  state.  Because  each  type  of  atom  has  a  different  type  of 
ladder,  with  energy  rungs  located  at  different  places,  every  element  exhibits  unique 
absorption  or  emission  lines. 

It  takes  more  energy  to  excite  the  higher  states,  and  the  number  of  atoms  that 
exist  in  a  given  energy  level  varies  inversely  and  exponentially  as  a  function  of  the 
energy.  When  collisions  are  the  dominant  process  that  influences  the  energy-level 
population,  the  ratio  of  the  population  of  two  energy  levels  of  a  given  atom 
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depends  on  the  temperature.  At  higher  temperatures,  more  atoms  are  pumped  up  to 
the  more  energetic  states. 

Under  conditions  of  local  thermodynamic  equilibrium,  the  ratio  of  the  number 
of  atoms  at  two  different  energies  depends  on  their  energy  difference  divided  by 
the  temperature.  At  a  higher  temperature,  there  are  more  atoms  with  higher  energy; 
but  there  always  are  more  atoms  with  the  lowest  possible  energy,  the  ground  state. 
The  number  ratio  is  known  as  the  Boltzmann  distribution ,  named  after  Ludwig 
Boltzmann  (1894-1906)  who  derived  it  (Boltzmann  1872). 

Individual  atoms  in  a  collection  of  atoms  are  always  moving  about,  undergoing 
collisions,  becoming  excited  and  radiating  that  excitation  away.  The  total  energy 
emitted  depends  on  the  number  of  excited  atoms,  which  varies  with  the  temper¬ 
ature,  the  energy  of  the  transition,  and  the  transition  probabilities,  which  are 
tabulated  in  spectroscopic  data  bases  found  on  the  Internet. 

Under  conditions  of  local  thermodynamic  equilibrium  at  a  temperature  T,  the 
number  densities,  Nn,  and  Nm,  of  atoms  in  levels  n  and  m  with  energies  En  and  Em 
are  related  by  the  Boltzmann  distribution  (Boltzmann  1872) 


Nn  §n  (  En  Em\ 


(6.16) 


_ O  O  _  1 

where  the  Boltzmann  constant  k  =  1.38065  x  10  J  K  ,  the  gn  is  the  degen¬ 
eracy  of  level  n.  The  degeneracy  of  a  level  is  the  number  of  quantum  states  with 
the  energy  of  that  level.  The  quantity  En  is  the  energy  of  level  n  relative  to  a 
fundamental,  unexcited  ground  state  n  —  1. 

For  the  hydrogen  atom,  we  have  gn  =  2 n  and 


13.6 


1  - 


1 


m 


eV, 


(6.17) 


where  n  is  the  quantum  number  of  the  atomic  energy  level  under  consideration  and 
1  eV  =  1.602  x  10~19  J. 

Example:  Exciting  hydrogen  atoms 

Find  the  temperature  at  which  the  number  density  of  hydrogen  atoms  in 
the  fundamental  ground  state  is  equal  to  that  of  the  second  excited  state  for 
n  =  3.  From  the  Boltzmann  equation,  we  set  Ni/N$  =  (gi/gs)  exp  [—(Ej  — 
E3)/(kT)]  =  1,  and  using  g\  =2,  E±  =  0  for  reference,  g3  =  18, 

E3  =  12.09  eV  =  1.939  x  10“18  J  and  k  =  1.38  x  10“23  J  K“\  we  obtain 
(1.4036  x  105)/T  =  In  (9)  =  2.197,  or  T  x  63,900  K.  (The  In  denotes  the 
natural  logarithm.) 


If  N0  is  the  total  number  of  atoms  or  ions  per  unit  volume  in  the  ground  state 
with  n  —  1,  the  total  number  of  excited  atoms,  NE,  with  energy,  En,  above  the 
ground  state  will  be: 
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Ne 

No 


8 


n 


u 


exp 


(6.18) 


where  the  partition  function  for  the  atom  or  ion  under  consideration  is  given  by 


(6.19) 


and  gn  is  the  degeneracy  of  level  n.  The  fraction  of  atoms  or  ions  in  a  given  energy 
level  is  equal  to  the  portion  of  the  partition  function  related  to  this  level. 

When  the  electron  stays  within  the  atom  or  ion,  going  from  one  bound  state  to 
another,  then  the  transition  is  known  as  a  bound-bound  transition.  If  the  electron 
breaks  free  of  the  atom  or  ion,  then  it  is  called  a  bound-free  transition.  The 
probability  per  unit  time,  Pmn,  that  an  atom  will  undergo  a  bound-bound  transition 
from  a  high  state  of  energy,  Em,  to  a  lower  state  of  energy,  En ,  is  (Einstein  1917a): 


~T  Pmn  U 


v? 


(6.20) 


where  Amn  is  the  Einstein  coefficient  for  spontaneous  transition  between  the  two 
states  (spontaneous  emission),  and  Bmn  is  the  Einstein  stimulated  coefficient  for  a 
transition  induced  by  radiation  of  energy  density  Uv  in  the  frequency  range  v  to 
v  +  dv.  The  probability  per  unit  time  for  a  radiation-induced  absorption  is 


(6.21) 


where  Bnm  is  the  Einstein  coefficient  for  photo-absorption.  The  energy  density  of 
black  body  radiation  at  temperature,  T,  in  the  frequency  range  v  to  v  +  dv  is 
(Planck  1901) 


871  hv3 


c ~ 


exp 


(6.22) 


where  BV(T)  is  the  brightness  of  the  radiator  at  frequency  v  and  the  temperature 
is  T.  Because  the  number  of  downward  transitions  must  equal  the  number  of 
upward  transitions,  and  because  in  thermodynamic  equilibrium  each  state  has  a 
population  determined  by  the  Boltzmann  distribution,  it  follows  that 


and 


87l/zv3 

o  Bynn 
C3 


(6.23) 


nm  7 


(6.24) 


where  gm  is  the  statistical  weight  of  the  m  level. 

The  Amn  is  the  probability  per  second  that  an  atom  with  an  electron  in  level 
m  will  spontaneously  emit  a  photon  of  energy  hvmn  =  Em  —  En  so  the  energy,  E , 
emitted  per  unit  volume  by  the  spontaneous  bound-bound  transition  is: 
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E  NmhvmnA 


mn  7 


(6.25) 


where  Nm  is  the  volume  density  of  atoms  in  level  m  and  hvmn  =  Em  —  En. 


6.4  Ionization  and  Element  Abundance  in  the  Sun 
and  Other  Stars 


Because  a  greater  number  of  atoms  will  absorb  more  light,  the  relative  darkness  of 
the  absorption  lines  in  the  Sun’s  spectrum  should  establish  the  relative  abundance 
of  the  elements  there.  That  is,  darker,  stronger  absorption  lines  generally  indicate 
high  absorption  and  therefore  larger  amounts  of  the  absorbing  element.  However, 
the  strength  of  an  element’s  absorption  lines  depends  only  to  some  extent  on  the 
element’s  abundance. 

For  example,  atoms  exist  in  altered  physical  states  at  the  high  temperatures  that 
prevail  within  stars.  This  can  result  in  a  change  in  the  wavelength  and  intensity  of 
the  spectral  lines  observed  in  stellar  atmospheres.  In  the  latter  part  of  the  19th 
century,  Joseph  Norman  Lockyer  (1836-1920)  had  already  shown  experimentally 
that  elements  exhibit  different  spectra  under  varying  conditions  of  temperature  and 
pressure  in  the  terrestrial  laboratory,  and  that,  in  particular,  the  arc  and  higher 
temperature  spark  spectra  of  the  same  element  differed  (Lockyer  1887).  The  Sun  is 
obviously  hotter  than  typical  laboratory  temperatures,  or  it  would  not  shine  so 
brightly,  and  different  temperatures  and  pressures  prevail  at  various  locations 
within  the  solar  atmosphere.  Astronomers  eventually  realized  that  an  element 
displays  different  spectral  lines  depending  on  the  physical  conditions  of  the  solar 
region  in  which  it  is  located. 

Moreover,  some  stars  show  conspicuous  lines  other  than  the  dominant  lines  in 
the  Sun’s  spectrum,  suggesting  that  different  stars  have  different  compositions. 
Scientists  began  to  think  that  this  was  instead  related  to  the  stellar  temperature. 


Example:  Intensity  of  the  red  hydrogen  alpha  transition  in  the  solar 
atmosphere 

The  intensity  of  a  spectral  line  emitted  by  atoms  or  ions  in  an  excited  state 
s  will  depend  on  the  number  density,  Ns,  of  atoms  or  ions  occupying  the 
excited  state,  the  energy  Es  of  that  state,  and  the  temperature  T.  Under 
conditions  of  local  thermodynamic  equilibrium,  the  Boltzmann  distribution 
indicates  that  (Boltzmann  1872): 


Ntot 


(6.26) 


where  gs  is  the  degeneracy,  or  the  statistical  weight,  of  level  a,  and  the  partition 
function  for  free  particles  of  mass  m  is  U  =  (2nmkT)3^2V/h3  where  V  is  the 
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volume  occupied  by  the  gas,  the  Boltzmann  constant  k  =  1.381  x 
10-23  J  K-1  and  the  Planck  constant  h  =  6.626  x  10-34  J  s. 

For  the  red  hydrogen  alpha  transition,  the  wavelength  2  =  656.28  nm  = 
6.65628  x  10-7  m,  and  the  energy  Es  =  he/ 2  =  3.027  x  10-19  J,  where  the 

o  _ 1 

speed  of  light  c  =  2.9989  x  10  m  s  .  The  number  of  atoms  undergoing  this 
transition  in  the  solar  chromosphere  and  the  photosphere  depends  on  the  tem¬ 
perature,  TP  =  5,780  K  for  the  photosphere  and  Tc  =  2  x  104  K  for  the 
chromosphere.  The  relative  number  of  atoms,  NsC/NsP,  in  the  two  layers  of  the 
solar  atmosphere  is  given  by: 


Kc 

NsP 


7>y/2  exp(jf) 

Tc '  exp  (if) 


(6.27) 


So  there  are  more  atoms  excited  into  this  state  in  the  chromosphere  than  the 
underlying  photosphere,  because  at  higher  temperatures  the  exponential 
function  is  closer  to  one  and  at  lower  temperatures  it  is  a  smaller  number. 


When  the  theory  of  ionization  in  stellar  atmospheres  was  developed,  it  became 
clear  that  the  presence  or  absence  of  specific  spectral  lines  did  not  necessarily 
indicate  the  chemical  composition  of  a  star’s  atmosphere.  In  1920,  Meghnad  Saha 
(1893-1956),  a  young  lecturer  at  the  University  of  Calcutta,  demonstrated  that  the 
spectral  lines  of  different  elements  are  excited  under  different  conditions  of  tem¬ 
perature  and  pressure.  This  set  the  stage  for  showing  that  many  stars  have  similar 
compositions  (Saha  1920). 

In  his  analysis,  Saha  demonstrated  the  analogy  between  the  dissociation  of 
molecules  and  the  ionization  of  atoms.  He  replaced  the  mass  of  the  atom  with  the 
mass  of  the  electron  in  the  expression  for  the  degree  of  dissociation  of  a  molecule, 
thereby  obtaining  his  now-famous  ionization  equation.  This  formula,  known  as  the 
Saha  equation,  relates  the  degree  of  ionization  of  an  atom  to  temperature  and 
pressure,  and  therefore  indicates  that  the  relative  intensities  of  a  star’s  different 
spectral  lines  are  caused,  in  part,  by  differences  in  the  pressure  and  temperature  of 
the  stellar  atmosphere. 

The  fraction  v  of  atoms  that  are  ionized  in  a  gas  at  a  certain  temperature  T  and 
pressure  P  is  given  by: 


l  —  x 


(27 ime)3^2  ( kT )5/2 

P 


exp 


(6.28) 


where  %  is  the  ionization  potential,  or  the  amount  of  energy  to  be  supplied  to  an 
atom  to  ionize  it,  h  is  the  Planck  constant  and  me  is  the  mass  of  the  electron.  The 
ionization  potentials  of  some  cosmically  abundant  atoms  are  given  in  Table  6.5. 
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Table  6.5  Ionization  potentials  %  for  different  stages  of  ionization  of  the  most  abundant  atoms  in 
the  cosmos 


Z  Stage  of  Ionization 


I 

II 

III 

IV 

V 

VI 

VII 

VIII 

1 

H 

13.598 

2 

He 

24.587 

54.416 

6 

C 

11.260 

24.383 

47.887 

64.492 

392.077 

489.981 

7 

N 

14.534 

29.601 

47.448 

77.472 

97.888 

552.057 

667.029 

8 

0 

13.618 

35.116 

54.934 

77.412 

113.896 

138.116 

739.315 

871.4 

Under  conditions  of  local  thermodynamic  equilibrium,  the  number  density,  Nn 
of  atoms  in  the  rth  stage  of  ionization  is  related  to  that  of  the  (r  +  1)  state,  Nr+i,  by 
the  Saha  equation  (Saha  1920,  1921): 

U^lQ^ai  (ZM  (6.29) 

Nr  Ur  h 3  *\kT  )'  V  ' 

where  Ne  is  the  free  electron  density,  Ur  is  the  partition  function  of  the  rth  stage, 
the  Xr  is  the  energy  required  to  remove  an  electron  from  the  ground  state  of  the 
r-times  ionized  atom,  T  is  the  temperature,  me  is  the  electron  mass,  and  h  and  k  are 
respectively  the  Planck  and  Boltzmann  constants.  The  electron  density  Ne  can  be 
expressed  in  terms  of  the  electron  pressure  Pe  =  NekT. 

The  population  density,  Nn,  of  the  nth  quantum  level  is  given  by: 


Nn  =  N,Ni 
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where  Ne  and  Nt  are,  respectively,  the  free  electron  and  ion  densities,  gn  is  the 
statistical  weight  of  the  nth  level,  and  Xn  is  the  excitation  energy  of  the  nth  level 
above  ground  level.  For  hydrogen  like  atoms, 


X„=/Wz2(  l-U  (6.31) 

_  18* 

where  IH  =  13.6  eV  =  2.179  x  10~10  J  is  the  ionization  potential  of  the  hydro¬ 
gen  atom,  and 

gn  =  2  n2.  (6.32) 


Saha  used  his  ionization  equation  in  a  physical  theory  for  stellar  spectra,  speci¬ 
fying  temperatures  of  stars  of  different  spectral  type.  His  result  showed  that  differ¬ 
ences  in  stellar  spectra  are  caused  by  differences  in  excitation  rather  than  in  chemical 
composition.  The  English  astrophysicists  Ralph  A.  Fowler  (1899-1944)  and  Edward 
Milne  (1896-1950)  then  showed  that  the  number  of  atoms  or  ions  responsible  for  the 
production  of  a  spectral  line  can  be  estimated  from  the  line  intensity  once  the 
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temperature  and  pressure  of  the  stellar  atmosphere  are  known  (Fowler  and  Milne 
1924).  This  paved  the  way  for  the  work  of  the  American  astronomer  Cecilia  H.  Payne 
(1900-1979),  who  showed  that  stars  with  different  spectra  have  essentially  the  same 
composition,  and  it  eventually  led  to  the  realization  that  the  lightest  element, 
hydrogen,  is  by  far  the  most  abundant  element  in  most  stars  (Focus  6.1).  This  dis¬ 
covery  also  had  a  fundamental  role  in  understanding  how  the  Sun  shines  -  by  fusion 
reactions  of  the  nucleus  of  the  hydrogen  atom,  the  proton. 

Focus  6.1  Hydrogen,  the  most  abundant  element  in  the  Sun  and  most 
stars 

In  a  brilliant  doctoral  dissertation  written  in  1925,  the  American  astronomer 
Cecilia  H.  Payne  (1900-1979)  showed  that  the  atmospheres  of  virtually 
every  luminous,  middle-aged  star  have  the  same  ingredients  (Payne  1925). 
Her  calculations  also  indicated  that  hydrogen  is  by  far  the  most  abundant 
element  in  the  Sun  and  most  other  stars.  However,  she  could  not  believe  that 
the  composition  of  stars  differed  so  enormously  from  that  of  the  Earth,  where 
gaseous  hydrogen  is  rarely  found,  so  she  mistrusted  her  understanding  of  the 
hydrogen  atom.  Prominent  astronomers  of  the  time  also  did  not  think  that 
hydrogen  was  the  main  ingredient  of  the  Sun  and  other  stars. 

Subsequent  detailed  investigations  of  the  Sun’s  absorption-line  intensi¬ 
ties,  by  the  German  astronomer  Albrecht  Unsold  (1905-1995),  the  British 
astronomer  William  H.  McCrea  (1904-1999)  and  the  American  astronomer 
Henry  Norris  Russell  (1877-1957)  showed  that  the  Sun  is  composed  mainly 
of  the  lightest  element,  hydrogen,  accounting  for  92.1  %  of  the  number  of 
atoms  in  the  Sun  (Unsold  1928;  McCrea  1929;  Russell  1929).  Hydrogen  is  a 
million  times  more  abundant  than  any  other  element  in  solar  atmosphere, 
and  the  number  density  of  hydrogen  atoms  at  the  base  of  the  chromosphere  is 
about  1018  atoms  m-3. 

The  Danish  astronomer  Bengt  Stromgren  (1908-1987)  next  calculated  the 
hydrogen  content  in  the  interior  of  stars,  assuming  that  they  are  chemically 
homogeneous,  and  showed  that  their  observed  luminosities  require  that  the 
entire  star,  not  just  its  atmosphere,  be  composed  predominantly  of  hydrogen 
(Stromgren  1932). 

We  now  know  that  very  old  stars  have  very  few  elements  other  than 
hydrogen  and  helium;  these  stars  probably  have  existed  since  our  Galaxy 
formed.  Middle-aged  stars,  like  the  Sun,  contain  noticeable,  but  still  small, 
amounts  of  heavier  elements. 

Hydrogen  is  the  most  abundant  element  in  the  stellar  universe,  so  there 
was  nothing  wrong  with  Miss  Payne’s  calculations.  The  Earth  just  does  not 
have  sufficient  gravity  to  retain  hydrogen  in  its  atmosphere  for  any  length  of 
time.  Any  hydrogen  gas  that  our  young  planet  might  have  once  had  must 
have  evaporated  away  while  the  Earth  was  forming  and  has  long  since 
escaped,  or  become  locked  into  water  or  surface  rocks. 
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In  addition  to  specifying  the  compositions  of  stars,  detailed  observations  of 
absorption  or  emission  lines  yield  information  about  the  temperature,  density, 
motion,  and  magnetism  of  the  Sun  and  other  stars,  as  well  as  rarefied  nebulae, 
interstellar  matter,  and  a  various  cosmic  objects. 


6.5  Wavelengths  and  Shapes  of  Spectral  Lines 
6.5.1  Radial  Motion  Produces  a  Wavelength  Shift 

Just  as  a  source  of  sound  can  vary  in  pitch  or  wavelength  depending  on  its  motion, 
the  wavelength  of  electromagnetic  radiation  shifts  when  the  emitting  source  moves 
with  respect  to  the  observer  (See  Sect.  4.2).  Such  a  wavelength  change  is  named 
the  Doppler  effect  after  the  Austrian  scientist,  mathematician  and  schoolteacher, 
Christiaan  Doppler  (1803-1853)  who  first  explained  how  it  works  (Doppler  1842). 

The  Doppler  effect  shows  that  the  wavelength  of  a  spectral  fine  that  an 
astronomer  observes  can  differ  from  the  emitted  fine  wavelength.  The  size  of  the 
wavelength  change  depends  on  the  relative  speed  of  the  radiating  source  along  the 
observer’s  line  of  sight,  known  as  the  radial  velocity.  The  greater  the  radial 
velocity,  the  larger  the  Doppler  shift.  In  1868,  the  English  astronomer  William 
Huggins  (1824-1910)  was  the  first  to  use  this  method  to  determine  the  velocity  of 
a  star  moving  away  from  the  Earth  (Huggins  1868). 

For  a  source  radial  velocity  directed  away  from  the  observer  along  the  line  of 
sight,  the  shift  is  toward  longer,  redder  wavelengths  in  the  visible  part  of  the 
electromagnetic  spectrum,  and  therefore  also  is  known  as  a  redshift.  When  the 
motion  is  toward  the  observer,  there  is  a  blue  shift  to  shorter,  bluer  wavelengths. 
For  a  source  radial  velocity,  Vn  away  from  the  observer  along  the  line  of  sight,  the 
Doppler  redshift  formula  is  applicable: 

_  ^ observed  ^ L  TL  V observed  ^  Vr  r-  nnS 

Z  —  —  — - : - — - ~  —  lor  V r  C,  (0.33 J 

Al  Al  V  observed  C 


or 


and 


f^=l+z  (6.34) 


1  +  — ^  for  Vr  <  <  c 

where  the  observed  line  wavelength  is,  A observed ,  the  emitted  fine  wavelength  is  2L, 
the  z  is  known  as  the  redshift  since  visible  spectral  lines  are  Doppler  shifted  to 
longer,  redder  wavelengths,  v  denotes  frequency  and  the  speed  of  light 
c  =  2.9979  x  108  m  s-1. 


(6.35) 
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For  large  radial  velocities  approaching  c,  the  relativistic  Doppler  effect  yields: 


i 

for  Vr  «  c  (6.36) 


and 


1  +  z  = 


c-F  Vr 
C  —  Vr 


C 


(Z+1)2-1 

(z+  1)2+1 


for  Vr  ~  c. 


(6.37) 


Example:  Redshifting  Lyman  a  into  the  visible 


The  Lyman  a  transition  from  hydrogen  atoms  in  a  star  that  is  moving  at 
relatively  low  speeds  with  respect  to  the  Earth  will  be  emitted  at  the  rest 
wavelength  of  lL  =  121.567  nm,  in  the  ultraviolet  part  of  the  electromag¬ 
netic  spectrum.  If  a  galaxy  is  moving  at  a  high  enough  speed,  it  might 
redshift  the  emission  into  a  red  wavelength  of  lobs  =  600  nm,  with  a  redshift 
z  =  (lobs  ~  II) /II  =  3.936.  For  redshifts  greater  than  one,  the  velocities 

o  _ 1 

approach  that  of  light,  and  the  radial  velocity  Vr=2.76xl0  ms  using 


the  high-speed  equation 


Vj_  _  (z+l)  —  1 

C  _  (z+l)2+l 


of  light  c  =  2.9979  x  108  ms  '. 


for  the  radial  velocity  Vr  with  the  speed 


6.5.2  Gravitational  Redshift 


For  massive,  collapsed  stars,  there  also  is  a  detectable  gravitational  redshift  caused 
by  the  loss  of  radiation  photon  energy  in  overcoming  the  immense  gravitational 
pull  of  a  star.  This  is  a  small  effect  for  stars  like  the  Sun,  whose  gravitational 

_ z: 

redshift  is  about  2  x  10  ,  but  it  increases  for  collapsed  stars. 

When  a  photon  of  energy  hv  =  me  leaves  the  surface  of  a  massive  body  of 
mass  M  and  radius  R ,  it  loses  an  energy  AE  given  by 


A  E  =  hAv  = 


GMm 
~ R " 


GMhv 

Rc 2 


(6.38) 


where  the  radiation  moves  at  the  speed  of  light  c  and  is  imagined  to  have  an 
effective  mass  m.  The  gravitational  redshift  is  given  by 


Av  _  A2  _  GM 

v  l  Rc 2 


(6.39) 


_ /r 

The  gravitational  redshift  for  the  Sun  is  ~  2  x  10  ,  which  is  a  value  that 

would  be  produced  by  the  Doppler  effect  at  a  radial  velocity  of  only  0.64  km  s-1, 
and  so  is  very  difficult  to  measure.  Nevertheless,  such  a  measurement  has  been 
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reported  (Blamont  and  Roddier  1961;  Forbes  1970;  Snider  1972).  The  effect 
increases  for  collapsed  stars,  and  it  has  been  measured  for  white  dwarf  stars  with  a 
mass  about  equal  to  that  of  the  Sun  and  a  size  about  equal  to  that  of  the  Earth 
(Adams  1925;  Popper  1954;  Greenstein  et  al.  1971;  Greenstein  and  Trimble  1972; 
Shipman  1972,  Sect.  13.3) 

The  most  accurate  tests  of  the  gravitational  redshift  are  made  in  the  Earth’s 
gravity,  which  is  much  weaker  than  that  of  the  Sun  or  white  dwarf  stars.  The 
greater  terrestrial  accuracy  in  the  measurement  of  this  effect  is  possible  because 
the  wavelengths  of  nuclear  spectral  lines  are  more  precisely  known  than  those  of 
the  spectral  lines  of  any  celestial  object.  The  discovery  of  narrow  gamma  ray 
spectral  lines  of  nuclei  (Mossbauer  1958)  led  to  the  first  detection  of  the  terrestrial 
gravitational  redshift.  Using  the  Mossbauer  effect,  Pound  and  Rebka  (1959,  1960) 
measured  a  value  of  =  (2.57  db  0.26)  x  10“ 15  as  opposed  to  the  predicted 
value  of  =  2.46  x  10“ 15.  The  agreement  between  theory  and  experiment  was 
subsequently  improved  to  about  one  percent  (Pound  and  Snider  1964,  1965) 

Example:  Measuring  the  gravitational  redshift  on  the  Earth  and  stars 

Due  to  the  Earth’s  relatively  low  gravity,  the  gravitational  redshift  mea¬ 
surement  requires  radiation  at  a  very  precise  gamma-ray  wavelength,  gen¬ 
erated  by  a  nuclear  resonance  Mossbauer  effect.  In  going  from  the  ground  to 
a  height  H ,  the  wavelength  change  due  to  the  gravitational  redshift  will  be 
Ak/k  =  GMeH/(R2ec 2)  «  10-16  //,  where  G  =  6.674  x  10“n  N  m2  kg'2, 
the  mass  of  the  Earth  ME  =  5.974  x  1024  kg,  the  speed  of  light 
c  =  2.9979  x  108  m  s-1,  and  the  radius  of  the  Earth  RE  =  6.371  x  106  m. 
Pound  and  Rebka  (1959,  1960)  measured  the  change  over  a  distance  of 
22.5  m,  requiring  a  measurement  precision  of  2  x  10-15. 

For  the  Sun,  the  gravitational  redshift  is  A 2/2  =  GM0/(R  0C2)«  2x 
10-6,  where  the  Sun’s  mass  M0  =  1.989  x  1030kg  and  the  Sun’s  radius 
Rq  =  6.955  x  108m.  By  way  of  comparison,  the  thermal  broadening 
Ak// \L  =  V thermal! c  =  (3kT / m)1  / c  =  4  x  10-5  for  a  spectral  line  of 
hydrogen,  of  mass  m  =  1.67  x  10  ^  kg,  emitted  from  the  visible  disk  of 
the  Sun  at  temperature  T  =  5780  K,  where  the  Boltzmann  constant 
k  =  1.381  x  10  ~  J  K  (also  see  next  example).  So  the  thermal  broad¬ 
ening  effect  is  about  16  times  greater  than  the  gravitational  redshift  effect, 
making  the  latter  very  difficult  to  detect. 

A  white  dwarf  star  has  about  the  same  mass  as  the  Sun,  but  a  radius  that  is 
comparable  to  that  of  the  Earth  and  100  times  smaller  than  the  Sun,  at  about 

/T 

6.37  x  10°  m,  so  the  gravitational  redshift  is  about  100  times  larger  and 
easier  to  detect  (Sect.  13.3). 


A  more  exact,  relativistic  expression  for  the  gravitational  redshift,  which  is 
applicable  to  a  very  massive,  collapsed  object  is  (Einstein  1911,  1916): 
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Fig.  6.8  Effects  that  broaden  a  spectral  line  The  motion  of  absorbing  or  emitting  atoms  can 
broaden  a  line  to  the  short-wavelength  and  long-wavelength  side  of  the  resting,  or  nonmoving, 
wavelength,  here  denoted  by  AL  (top).  The  Doppler  effect  describes  the  broadening  (see  Fig.  4.2). 
When  the  motion  is  due  to  the  heat  or  temperature  of  the  radiating  atoms,  the  effect  produces 
thermal  broadening;  when  the  average  temperature  of  a  collection  of  atoms  increases,  the  thermal 
broadening  becomes  wider.  The  Doppler  effect  broadening  can  also  be  caused  by  the  rotation  or 
expansion  of  the  source.  An  intense  magnetic  field  can  split  a  single  line  at  wavelength  2L  into 
two  components  by  the  Zeeman  effect  (bottom).  The  wavelength  difference  Aal  between  the  split 
lines  is  proportional  to  the  strength  of  the  magnetic  field.  (From  “The  Life  and  Death  of  Stars”  by 
Kenneth  R.  Lang,  published  by  Cambridge  University  Press,  2013.  Reprinted  with  permission.) 


(6.40) 


Vessot  et  al.  (1980)  have  compared  the  time  of  a  hydrogen  maser  clock  in  a 
rocket  with  a  similar  clock  on  the  ground,  confirming  the  relativistic  gravitational 
redshift  within  the  uncertainties  of  the  measurement. 


6.5.3  Thermal  Motion  Broadens  Spectral  Lines 

Any  observed  spectral  line  is  the  superposition  of  the  lines  emitted  by  many 
individual  atoms  in  different  physical  conditions.  Rather  than  appearing  at  a  single 
wavelength,  the  observed  line  therefore  is  broadened  over  a  range  of  wavelengths 
(Fig.  6.8).  Van  Vleck  and  Haber  (1977)  have  reviewed  absorption  and  emission 
lines  and  line  breadths. 
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The  amount  of  wavelength  broadening  increases  with  the  temperature  of  the 
source  or,  to  be  exact,  it  varies  as  the  thermal  velocity  of  the  moving  atoms  and  the 
square  root  of  the  temperature. 

For  atoms  in  thermodynamic  equilibrium  at  temperature  T,  a  line  emitted  at 
wavelength  hL  will  be  Doppler  broadened  by  an  amount  AhD  given  by: 


A  Xv  —  hi 


^  V thermal 


C 


' 


(6.41) 


O  _  1 

where  c  =  2.9979  x  10  m  s  ,  and  the  thermal  velocity  Vthermai  of  an  atom  of 
mass  m  at  temperature  T  is  given  by: 


F thermal 


J3kT- 

3kT 

V  m 

m 

1  1/2 


_ no  _ 1 

where  the  Boltzmann  constant  k  =  1.381  x  10  J  K  . 


(6.42) 


Example:  Thermal  broadening  of  atomic  hydrogen  lines  in  the 
photosphere 

The  temperature  of  the  visible  solar  disk,  the  photosphere,  is  T  =  5,780  K,  so 
the  thermal  velocity,  Vthermai  of  a  hydrogen  atom  of  mass  mH  =  1.007825 
u  =  1.6735  x  10-27  kg,  will  be  Vthermai  =  (3kT/mH)l^2=  1.20  x  104  m  s  \ 

_ no  _ i 

where  the  Boltzmann  constant  k  —  1.381  x  10  ^  J  K  .  The  wavelength 
broadening  Ah  in  wavelength  2  is  given  by  Ah/ h  =  V thermal/ c  =  4.0  x  10-5, 

o  _ 1 

where  the  speed  of  light  c  =  2.9979  x  10“  m  s'1. 


6.5.4  Rotation  or  Expansion  of  the  Radiating  Source  can 
Broaden  Spectral  Lines 

If  a  source  is  rotating,  the  Doppler  effect  of  the  object’s  side  rotating  toward  the 
observer  produces  a  blueshift  to  shorter  wavelengths;  the  other  side,  which  is 
rotating  away,  shifts  a  line  to  longer  wavelengths.  The  combined  effect  produces  a 
line  broadening  that  increases  with  the  rotation  velocity  and  that  depends  on  the 
projected  linear  equatorial  velocity  or  the  observed  rotational  velocity  of  the  line. 

If  a  source  is  rotating  with  an  equatorial  rotation  velocity  Vrot  it  will  produce  a 
line  broadening  by  an  amount  Ahrot  given  by: 

Vrot  sin  i 


A hyot  h L 


c 


5 


(6.43) 
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where  i  is  the  inclination  of  the  source’s  equator  to  the  celestial  equator.  The 
quantity  of  Vrot  sin  i  is  the  projected  linear  equatorial  velocity  or  the  observed 
rotational  velocity  of  the  line. 

A  similar  broadening  applies  to  an  expanding  source,  which  exhibits  a  line 
broadening  that  increases  with  the  expansion  velocity,  Vexp,  producing 

A/lexp  =  4  •  (6.44) 


6.5.5  Curve  of  Growth 

The  Belgian  astronomer  Marcel  Minnaert  (1893-1970)  and  his  Dutch  colleague 
Gerard  F.W.  Mulders  (1908-1993)  introduced  the  equivalent  width,  Wf,  or  the 
area  of  an  absorption  line  profile,  as  a  measure  of  line  intensity  (Minnaert  and 
Mulders  1930).  By  plotting  the  equivalent  widths  against  the  number  of  absorbers 
(in  a  logarithmic  form),  they  showed  for  the  first  time  the  empirical  curve  of 
growth,  which  permits  the  comparisons  of  line  intensities  and  widths  with  theo¬ 
retical  expectations.  It  is  mainly  of  concern  for  advanced  studies  of  stellar 
atmospheres. 

The  two  classical  model  atmospheres  are  known  as  the  Schuster-Schwarzschild 
atmosphere  (Schuster  1905;  Schwarzschild  1906)  and  the  Milne-Eddington 
atmosphere  (Milne  1921,  1930;  Eddington  1917,  1926,  b).  In  local  thermodynamic 
equilibrium  they  both  lead  to  the  approximate  relation  for  the  equivalent  width, 
Wf,  of  a  line  of  intensity  /;  at  wavelength,  2.  If  we  define 

W,_  =  J!£jJ±dX,  (6.45) 

where  the  subscripts  C  and  2  denote  the  continuum  adjacent  to  the  line  and  the  line 
wavelength,  we  have: 


WX 

2 


4sQmec: 


■ Nf 2, 


(6.46) 


where  the  electron  charge  is  e,  the  electric  constant  is  £0>  the  electron  mass  is  me ,  the 
speed  of  light  is  c,  the  A  is  the  column  density  of  the  atoms  producing  the  spectral  line 
and  the  oscillator  strength /is  related  to  the  absorption  cross  section  a  by 


a  = 


4so  mec 


(6.47) 
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Each  spectral  line  is  characterized  by  the  oscillator  strength/,  and  the  larger  the 
/  the  stronger  the  spectral  line  that  is  seen.  We  can  determine  N  once  the  oscillator 
strength  is  known  and  the  equivalent  width  measured. 

Observations  of  the  intensities  and  widths  of  spectral  lines  can  be  compared 
with  theoretical  expectations  in  order  to  determine  the  excitation  temperature,  the 
turbulent  velocity,  the  electron  and  gas  pressures,  the  surface  gravity,  and  the 
abundance  of  the  elements  in  a  stellar  atmosphere.  Nowadays  there  are  complex 
theoretical  models  for  the  atmospheres,  and  relevant  data  is  on  the  Internet. 


6.5.6  Magnetic  Fields  Split  Spectral  Lines 

When  an  atom  is  placed  in  a  magnetic  field,  it  acts  like  a  tiny  compass,  and  it 
adjusts  the  energy  levels  of  its  electrons.  If  the  atomic  compass  is  aligned  in  the 
direction  of  the  magnetic  field,  the  electron’s  energy  increases.  If  it  is  aligned  in 
the  opposite  direction,  the  energy  decreases.  Because  each  energy  change  coin¬ 
cides  with  an  alteration  in  the  wavelength  or  frequency  of  the  radiation  emitted  by 
that  electron,  a  spectral  fine  emitted  at  a  single  wavelength  by  a  randomly  oriented 
collection  of  atoms  becomes  a  group  of  three  lines  of  slightly  different  wave¬ 
lengths  in  the  presence  of  a  magnetic  field  (Fig.  6.9).  The  size  of  an  atom’s  internal 


Fig.  6.9  The  Zeeman  effect  The  magnetic  field  in  a  sunspot  can  be  measured  using  the  Zeeman 
effect.  In  a  sunspot  {right)  the  spectral  lines  that  are  normally  at  a  single  wavelength  become  split 
into  two  or  three  components  (left),  depending  on  the  orientation  of  the  field  with  respect  to  the 
line  of  sight.  The  vertical  line  crossing  the  sunspot  denotes  the  alignment  of  the  observing 
instrument  slit.  The  separation  of  the  outermost  components  is  proportional  to  the  strength  of  the 
magnetic  field,  in  this  sunspot  about  0.4  tesla,  or  4,000  gauss.  The  components  also  have  a 
circular  polarization,  which  indicates  the  direction  of  the  longitudinal  magnetic  field.  (Courtesy 
of  NO  AO.) 
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adjustments,  and  the  extent  of  its  spectral  division,  increases  with  the  strength  of 
the  magnetic  field. 

This  magnetic  transformation  was  named  the  Zeeman  effect ,  after  Pieter 
Zeeman  (1865-1943),  who  first  noticed  it  in  a  laboratory  (Zeeman  1896,  1897a,  b). 
His  Dutch  colleague  Hendrik  A.  Lorentz  (1853-1928)  predicted  the  effect  (Lorentz 
1898).  The  pair  received  the  1902  Nobel  Prize  in  Physics  for  their  work. 

We  can  gain  some  insight  to  the  Zeeman  effect  by  considering  the  motion  of  a 
free  electron  in  the  presence  of  a  magnetic  field.  A  charged  particle  cannot  move 
straight  across  a  magnetic  field,  but  instead  gyrates  around  it.  If  the  particle 
approaches  the  magnetic  field  in  a  perpendicular  direction,  a  magnetic  force  pulls 
it  into  a  circular  motion  about  the  magnetic  field  line.  Because  the  particle  can 
move  freely  in  the  direction  of  the  magnetic  field,  it  spirals  around  it  with  a  helical 
trajectory. 

The  size  of  the  circular  motion,  called  the  radius  of  gyration  and  designated  Rg , 
depends  on  the  velocity,  Vj_,  of  the  particle  in  the  perpendicular  direction,  the 
magnetic  field  strength,  B ,  and  the  mass,  m,  and  charge,  Ze ,  of  the  particle.  That 
gyration  radius  is  described  by  the  equation  (Larmor  1897): 


(6.48) 


provided  that  the  velocity  is  not  close  to  the  speed  of  light,  c.  An  electron  will 
circle  about  the  magnetic  field  with  a  radius,  Rg ,  and  with  a  period,  P,  given  by: 

2nRg  2nme 
V±  ~  eB 

At  the  velocity  Vj_,  the  electron  goes  once  around  the  circumference  2nR  in  the 
period  P.  The  rotational  frequency,  vg,  of  this  circular  motion,  in  a  plane  per¬ 
pendicular  to  the  magnetic  field,  is 


v„  =  -  =  «  2.80  x  1010B  Hz,  (6.50) 

g  P  2nme  K  ’ 

where  the  elementary  charge  of  the  electron  is  e  —  1 .6022  x  10  19  C,  the  mass  of 

_ Q  1 

the  electron  is  me  =  9.1094  x  10  kg,  and  B  is  the  magnetic  field  strength  in 
tesla. 

When  an  atom  is  placed  in  a  magnetic  field,  a  very  similar  thing  happens  to  its 
electrons  and  the  spectral  lines  they  emit.  A  line  that  radiates  at  a  wavelength  2L 
without  a  magnetic  field  becomes  split  into  two  or  three  components  depending  on 
the  orientation  of  the  magnetic  field.  This  has  been  detected  in  sunspots  (Fig.  6.8). 

We  can  represent  the  orbital  motion  of  the  electron  using  the  Bohr  model  of  the 
atom  for  the  electron  in  the  ground  state  orbit  with  quantum  number  n  =  1.  The 
orbital  angular  momentum  in  a  circular  orbit  will  be  meVR  =  nh/ (2n)  =  hj (2n)  for 
orbital  velocity  V,  orbital  radius  R ,  and  h  denotes  the  Planck  constant  with 
h  =  6.6261  x  10  J  s.  When  the  atom  is  placed  in  a  magnetic  field  of  strength  B , 
the  magnetic  moment,  designated  //,  is  the  Bohr  magneton,  denoted  by  fiB ,  given  by: 
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HB  = - (mVR)  = - =  9.274  x  10“24  J  T_1,  (6.51) 

2  me  2nme 

where  J  denotes  the  joule  units  of  energy  and  T  designates  the  tesla  unit  of 
magnetic  field  strength. 

When  the  atom  is  placed  in  the  magnetic  field,  a  single  spectral  line  is  split  into 
three  closely  spaced  spectral  lines.  The  adjacent  energy  levels  differ  in  energy, 
AEb ,  corresponding  to  a  photon  energy  difference  h(v\  —  v2)  related  by: 

eh 

AEB  =  fiBB  =  - - B  =  h(v i  -  v2).  (6.52) 

4nme 

The  two  outer  components  of  the  three  Zeeman-split  spectral  lines  will  be  sepa¬ 
rated  by  an  energy  difference  2 AEB,  and  a  total  frequency  separation  Av  given  by: 

Av  =  —  B  «  2.80  x  10105  Hz.  (6.53) 

h 

For  the  three  component  split,  the  shift,  A  A,  in  wavelength  of  the  two  outer 
components  is  given  by: 


A2 

A  A  =  — Av  ~  93.3  A2B  m, 
c 


(6.54) 


where  the  shift  in  frequency  is  Av,  and  the  speed  of  light  is 
c  =  2.9979  x  108  m  s  ',  both  A  A  and  A  are  in  meters,  and  B  is  in  tesla.  The 
separation  is  thus  proportional  to  the  magnetic  field  strength  B. 

Example:  Zeeman  effect  in  sunspots  and  interstellar  space 

The  magnetic  field  strength  in  a  sunspot  measured  by  Hale  (1908a,  b)  was 
B  =  0.3  tesla.  For  the  red  Ha  transition  at  a  wavelength  of 
A  =  656.3  nm  =  6.56  x  10“  m,  the  Zeeman  effect  will  give  a  line  split¬ 
ting  of  AA  =  93.3  A2  B  «  1.20  x  10-11  m  or  AA/A  «  1.80  x  10-5.  The 
Doppler  broadening  of  this  line  due  to  the  thermal  motion  of  the  hydrogen 
atoms  at  a  temperature  of  5,780  K  is  AA/A  =  V thermal/ c  =  4.0  x  10-5  (see 
previous  example),  only  twice  the  expected. 

The  interstellar  spaces  are  very  cold,  about  100  K  or  less,  with  less 
thermal  broadening,  but  an  accuracy  of  A  A/ A  =  1  x  10“ 10  is  required  to 
make  the  measurement  at  a  wavelength  of  the  A  =  21  cm  transition  of 
interstellar  hydrogen. 


The  American  solar  astronomer  George  Ellery  Hale  (1868-1938)  first  made 
measurements  of  this  Zeeman  splitting  in  sunspots,  showing  that  they  have  magnetic 
field  strengths  of  about  0.3  tesla  (Hale  1908a,  b).  This  is  about  10,000  times  the 
strength  of  the  terrestrial  magnetic  field  that  orients  our  compasses.  The  split  lines 
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Table  6.6  Cosmic  magnetic 
fields 


Object 

Magnetic  field  strength,  B  (tesla)a 

Earth  (equator  to  pole) 

3  x  10“5-6  x  10“5 

Solar  wind  (at  Earth  orbit) 

2.5  x  10“9 

Sunspot 

0.3 

Sun  (global) 

3  x  icr4 

Interstellar  space 

io-i° 

White  dwarf  star 

102 

Pulsar 

108 

a  1  tesla  =  1  T  =  104  gauss  =  104  G 


are  circularly  polarized,  and  the  direction  of  polarization  indicates  the  direction  of 
the  magnetic  field,  pointing  into  or  out  of  the  Sun  (Hale  et  al.  1919). 

As  suggested  by  Bolton  and  Wild  (1957),  the  magnetic  field  strength  in 
interstellar  space,  with  about  B  =  10“ 10  tesla,  can  be  observed  at  radio  wave¬ 
lengths  (Verschuur  1968,  1971;  Troland  and  Heiles  1977). 

Ledoux  and  Rensen  (1966)  have  provided  an  early  summary  of  magnetic  stars; 
Donati  and  Landstreet  (2009)  reviewed  magnetic  fields  in  nondegenerate  stars. 
Angel  (1978)  provided  a  review  of  magnetic  white  dwarf  stars.  Crutcher  (2012) 
discusses  magnetic  fields  in  molecular  clouds;  Beck  et  al.  (1996)  review  galactic 
magnetism;  Sofue  et  al.  (1986)  reviewed  the  global  structure  of  magnetic  fields  in 
galaxies,  and  Carilli  and  Taylor  (2002)  reviewed  observations  of  magnetic  fields  in 
clusters  of  galaxies. 

The  magnetic  field  strengths,  B ,  of  various  cosmic  objects  are  listed  in 
Table  6.6. 

All  of  this  refers  only  to  the  outer  atmosphere  of  the  Sun  and  other  stars.  As  it 
turns  out,  atoms  do  not  exist  in  most  of  the  Sun,  except  in  the  cool  visible  layer  in 
which  dark  absorption  lines  are  formed.  It  is  too  hot  everywhere  else  for  whole 
atoms  to  survive.  Innumerable  collisions  fragment  the  abundant  hydrogen  atoms 
into  their  constituent  pieces.  Their  protons  and  electrons  have  been  set  free  from 
their  atomic  bonds,  wandering  throughout  the  solar  material  unattached  to  each 
other.  Some  of  these  protons  merge  together  near  the  Sun’s  center,  but  an 
understanding  of  these  nuclear  fusion  reactions  had  to  await  investigations  of  how 
nuclear  particles  occasionally  escape  the  tight  confines  of  radioactive  elements. 


Chapter  7 

Transmutation  of  the  Elements 


7.1  The  Electron,  X-rays  and  Radium 

When  voltage  is  applied  to  the  ends  of  a  glass  tube  filled  with  rarefied  gas,  an  arc 
of  light  passes  from  the  cathode  -  negative  end  -  of  the  gas  tube  to  the  other  end  - 
the  anode,  which  is  positive.  If  vacuum  pumps  are  used  to  reduce  the  gas  pressure 
in  the  glass  tubes,  they  cease  to  glow  inside,  but  the  glass  shines  where  the 
invisible  cathode  rays  apparently  are  striking  it. 

The  French  scientist  Jean  Perrin  (1870-1942)  used  a  magnet  to  deflect  the 
location  of  the  glowing  spot  on  the  glass,  suggesting  that  the  unseen  streams  of 
“rays”  were  beams  of  negatively  charged  particles  (Perrin  1895).  A  magnetic  field 
deflects  a  negative  charge  in  one  direction  and  a  positive  charge  in  the  other 
direction,  and  the  size  of  the  deflection  depends  on  the  mass  of  the  charged 
particle,  its  velocity,  and  the  strength  of  the  magnetic  field.  The  British  physicist 
Joseph  John  Thomson  (1856-1940)  subsequently  used  both  electric  and  magnetic 
fields  to  measure  the  deflection  of  the  beams  of  charged  particles,  now  called 
electrons. 

The  deflection  experiments  worked  this  way:  an  electrical  field  produces  a  force 
that  bends  a  beam  of  electrons  in  the  direction  of  the  field,  toward  positive  and 
away  from  negative  electrodes.  By  measuring  the  deviation  from  the  original 
direction  of  motion,  the  charge  to  mass  ratio  e/m  can  be  determined.  Because  the 
deflection  also  depends  on  the  velocity  of  the  electrons,  another  measurement  was 
needed,  which  was  accomplished  by  placing  a  magnet  near  the  electron  beam. 
Combining  both  the  electrical  and  magnetic  experiments  determines  the  charge  to 
mass  ratio  of  the  electron.  So,  it  was  from  this  ratio  and  the  fundamental  unit  of 
charge  that  the  mass  of  the  electron  was  determined.  Thomson  concluded  that  the 
electron  is  much  less  massive  than  any  atom.  In  fact,  it  is  roughly  1 ,000  times  less 
massive  than  the  lightest  atom,  hydrogen  (Thomson  1897a,  b,  1903). 

Meanwhile,  the  German  physics  professor  Wilhelm  Rontgen  (1845-1923) 
inadvertently  left  some  wrapped,  unexposed  photographic  plates  near  a  glowing 
gas  tube  that  he  was  studying.  Later,  he  found  that  the  plates  were  fogged  and  that 
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this  always  happened  when  other  new  plates  were  left  near  the  apparatus.  Rontgen 
concluded  that  invisible  rays  were  passing  out  of  the  tube  and  fogging  the  plates. 

In  those  days,  photographs  were  taken  with  glass  plates  covered  with  a  light- 
sensitive  emulsion  of  silver  salts.  This  preceded  the  use  of  photographic  film, 
which  has  been  replaced  by  the  charge-coupled  devices  used  in  digital  cameras. 

To  remove  any  light,  Rontgen  enclosed  the  electrical  discharge  tube  in  black 
cardboard,  and  noticed  a  glow  coming  from  a  nearby  sheet  of  paper  coated  with  a 
substance  (barium  platinocyanide)  known  to  shine  in  strong  light  -  but  there  was 
no  light  to  make  it  glow.  The  cause  seemed  to  be  the  same  invisible  rays  that 
fogged  the  photographic  plates.  Moreover,  when  Rontgen’ s  wife  placed  her  hand 
between  the  electrified  gas  tube  and  a  photographic  plate,  the  developed  photo¬ 
graph  showed  the  bones  of  her  hand  and  the  ring  she  was  wearing  (Rontgen  1896). 

The  report  of  these  previously  unknown  and  penetrating  rays  caused  a  public 
sensation,  for  they  could  see  inside  humans  and  reveal  the  invisible.  The  rays  were 
able  to  penetrate  skin  and  muscle,  detecting  human  bones,  which  revolutionized 
medicine. 

No  one  knew  what  these  mysterious,  penetrating  emissions  were,  so  Rontgen 
called  them  x-rays,  using  the  mathematical  designation  x  for  something  unknown. 
Subsequent  investigations  eventually  showed  that  the  x-rays  are  electromagnetic 
radiation  of  very  short  wavelength  and  high  photon  energy. 

In  1896,  at  the  Sorbonne  in  Paris,  Henri  Becquerel  (1852-1908)  was  investi¬ 
gating  fluorescent  substances  that  could  collect  the  energy  of  light  and  remain 
luminous  after  the  light  source  was  removed.  Uranium  salts,  for  example,  glowed 
in  the  dark,  and  Becquerel  thought  that  they  also  might  emit  x-rays  after  being 
stimulated  by  sunlight.  However,  clouds  shut  out  the  Sun,  so  he  tossed  the  packet 
of  uranium  salts  into  a  drawer  in  his  worktable. 

A  box  of  unexposed  photographic  plates  had  been  left  in  the  drawer,  wrapped  in 
thick  black  paper  and  never  opened;  this  led  to  another  accidental  discovery.  When 
Becquerel  took  out  the  plates  a  few  days  later  and  developed  photographs  taken 
with  them,  he  found  that  they  were  ruined,  as  if  they  had  previously  been  exposed 
to  light.  The  uranium  salts  were  emitting  unseen  rays  that  could  affect  the  plates, 
even  in  pitch  darkness.  Becquerel  called  the  invisible,  highly  penetrating  phe¬ 
nomenon  uranic  rays  (Becquerel  1896).  For  a  time  they  were  called  Becquerel 
rays,  and  eventually  they  became  known  as  radioactive  rays,  but  for  decades,  no 
one  knew  exactly  what  they  were. 

The  new  type  of  rays  passed  without  difficulty  through  a  covered  box  of 
photographic  plates,  but  they  might  not  penetrate  metal.  Becquerel  repeated  the 
experiment  with  an  iron  key  placed  between  new  photographic  plates  and  the 
uranium.  When  the  plates  were  developed,  they  showed  the  silhouette  of  the  key. 
This  indicted  that  the  uranium  was  emitting  rays  that  were  unable  to  pass  through 
the  iron,  even  though  they  could  penetrate  dark  paper  that  blocks  ordinary  light.  In 
this  respect,  the  uranium  rays  resembled  x-rays;  but  unlike  x-rays,  the  uranium  was 
emitting  rays  spontaneously  without  any  previous  excitation  by  sunlight  or 
electricity. 
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Hearing  of  Becquerel’s  discovery,  Pierre  Curie  (1859-1906),  also  a  professor  of 
physics  at  the  Sorbonne,  and  the  young  graduate  student  he  recently  had  married, 
Manya  (Marie)  Curie  (1867-1934),  began  to  investigate  the  new  type  of  rays. 
Madame  Curie  wanted  to  know  if  uranium  was  the  only  element  that  emitted  the 
mysterious  rays,  and  developed  methods  to  measure  the  amounts  being  released. 
To  her  surprise,  she  found  that  impure  uranium  ores  emitted  more  rays  than  could 
be  explained  in  terms  of  the  uranium  they  contained.  The  couple  began  a  laborious 
two-year  search  for  the  unknown  emitters;  from  one  ton  of  the  uranium  ore  known 
as  pitchblende  they  extracted  just  a  few  grams  of  powerful  new  elements  that  had 
not  been  known  previously  (Curie  1898;  Curie  and  Curie  1898).  One  was  called 
radium  and  the  other  polonium,  after  Marie’s  native  Poland. 

Radium  is  one  million  times  more  radioactive  than  uranium,  which  -  in  the 
terminology  of  the  time  -  meant  that  radium  is  emitting  the  penetrating  rays  more 
intensely  than  uranium,  not  that  either  substance  emits  radio  waves.  Crystals 
containing  radium  can  light  up  an  otherwise  dark  room,  and  also  burn  the  skin,  as 
Curie  discovered  to  his  dismay. 


7.2  Radioactivity 

At  the  time  of  their  discovery,  no  one  knew  exactly  what  radioactive  rays  were, 
where  their  energy  came  from,  or  why  the  radioactive  materials  kept  pouring  out 
energy.  Moreover,  the  amount  of  energy  being  released  by  radium  was  difficult  to 
explain,  for  it  far  surpassed  anything  that  had  been  achieved  by  chemical  reactions. 
And,  it  is  a  natural  process  that  happens  all  the  time  in  the  ground  on  which  we 
stand. 

It  was  the  English  physicist  Ernest  Rutherford  (1871-1937)  who  found  that  the 
source  of  radioactive  energy  must  come  from  the  interior  of  the  radioactive  atoms. 
These  very  heavy  atoms  were  unstable,  disintegrating  all  by  themselves  and  slowly 
leaking  energy  from  their  interiors  in  spontaneous  transmutation  (Rutherford  1904). 

Rutherford  previously  found  that  the  radioactive  rays  emitted  by  uranium 
included  at  least  two  distinct  types,  termed  alpha  rays  and  beta  rays  (Rutherford 
1899).  These  rays  are  not  waves  of  radiation;  they  instead  are  beams  of  energetic, 
fast-moving  particles.  By  using  electrical  and  magnetic  fields,  the  two  types  of 
particles  could  be  separated  and  their  physical  properties  examined.  The  directions 
in  which  the  beams  were  deflected  indicated  the  sign  of  their  electrical  charge,  and 
the  magnitude  of  the  deflection  provided  a  measure  of  both  the  charge  and  the 
mass. 

The  most  energetic  particles  emitted  by  radioactive  substances  are  alpha  particles 
(Ramsay  and  Soddy  1903).  Alpha  particles  carry  a  double  dose  of  positively  charged 
protons,  and  they  move  at  astonishingly  high  speeds  -  about  1.5  x  10  m  s  ,  or 
one-twentieth  the  speed  of  light  (Rutherford  1911).  This  corresponds  to  a  typical 

_  IQ 

kinetic  energy  of  about  5  MeV,  or  five  mega-electron- volts  and  8.01  x  10  J. 
Rutherford  and  his  colleagues  eventually  showed  that  an  alpha  particle  is  nothing 


194 


7  Transmutation  of  the  Elements 


Radioactive 
Atomic  Nucleus 

Fig.  7.1  Radioactive  alpha  decay  An  unstable,  heavy  nucleus  of  a  radioactive  element  can 
disintegrate  or  decay  into  a  stable,  lighter  nucleus,  with  the  emission  of  an  alpha  particle  that 
carries  mass  away  from  the  heavy  nucleus  during  its  decay.  The  subatomic  alpha  particle  consists 
of  two  protons  and  two  neutrons.  The  nucleus  of  a  helium  atom  is  an  alpha  particle.  Radioactive 
alpha  decay  of  an  individual  heavy  element  such  as  uranium  does  not  occur  very  often,  on 
average.  We  would  have  to  wait  4.47  billion  years  for  half  of  a  rock  of  uranium  to  change  into 
lead  by  emitting  alpha  particles.  (From  “The  Life  and  Death  of  Stars”  by  Kenneth  R.  Lang, 
published  by  Cambridge  University  Press,  2012.  Reprinted  with  permission.) 


more  than  the  nucleus  of  the  helium  atom,  containing  two  protons  and  two  neutrons. 
The  alpha  particles  were  being  emitted  by  a  heavy,  unstable  atom  that  was  turning 
spontaneously  into  a  slightly  lighter  atom,  without  any  interactions  with  another 
particle  or  radiation  from  outside  the  atom  (Fig.  7.1). 

Radioactive  decay  is  occasionally  accompanied  by  the  emission  of  charged  beta 
rays,  which  make  an  electronic  adjustment  to  an  atom’s  nucleus  without  signifi¬ 
cantly  changing  its  mass.  Beta  rays  consist  of  negatively  charged  particles  of  low 
mass,  which  eventually  were  identified  as  high-speed  electrons. 

The  ejection  of  alpha  and  beta  particles  often  is  accompanied  by  the  emission  of 
powerful  electromagnetic  radiation,  akin  to  x-rays  but  with  even  shorter  wave¬ 
length  and  greater  energy  (Villard  1900;  Rutherford  and  Andrade  1914).  Because 
the  energetic  radiation,  called  gamma  rays  and  designated  y,  is  not  charged, 
electrical  or  magnetic  fields  do  not  deflect  it. 

Working  with  the  young  chemist  Frederick  Soddy  (1877-1956),  Rutherford 
found  that  radioactive  atoms  continued  to  disintegrate  into  other  pieces  after  the 
emission  of  an  alpha  particle  (Rutherford  and  Soddy  1902,  1903).  Uranium,  for 
example,  initially  turned  into  thorium,  which  also  was  radioactive  and  released 
other  substances,  including  gaseous  radon. 

However,  the  progressive  disintegration  of  heavy,  unstable  elements  does  not 
continue  forever.  As  radioactive  decay  progresses,  the  inner  parts  of  an  atom 
rearrange  into  greater  stability,  eventually  reaching  an  equilibrium  that  does  not 
decay.  For  uranium,  this  stable  endpoint  is  lead. 

A  simplified  notation,  described  in  Focus  7.1,  clarifies  how  these  complex 
chains  of  radioactive  decay  work. 
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Focus  7.1  Nuclear  nomenclatures 

The  number  of  protons  in  a  nucleus  is  denoted  by  the  atomic  number  Z.  They 
account  for  the  charge  of  the  nucleus  but  not  for  all  of  its  mass.  A  nucleus  of 
any  element  except  hydrogen  has  about  twice  the  mass  of  the  sum  of  its 
protons.  The  extra  mass  is  due  to  neutral,  or  uncharged,  particles  called 
neutrons,  each  with  about  the  same  mass  as  a  proton.  The  mass  of  the 
nucleus  is  specified  by  the  mass  number  A  =  N  +  Z,  where  N  is  the  number 
of  neutrons;  the  mass  number  A  also  gives  the  number  of  nucleons  in  the 
nucleus,  which  is  the  sum  of  the  number  of  protons  and  the  number  of 
neutrons. 

Letters  denote  the  nuclei  and  other  subatomic  particles.  Both  a  letter  and  a 
superscript,  the  mass  number  A,  designate  a  nucleus.  An  arrow  — >  specifies 
the  reaction.  Nuclei  on  the  left  side  of  the  arrow  react  to  form  products  given 
on  the  right  side  of  the  arrow.  The  alpha  decay  of  uranium,  for  example,  is 
given  by 

238u^  234Th  +  a  (7.1) 

or 

238u  234Th  +4He>  (7.2) 

where  U  denotes  a  nucleus  of  uranium,  Th  indicates  a  nucleus  of  thorium, 
and  the  alpha  particle  a  is  the  nucleus  of  the  helium  atom,  also  designated 
4He. 

Another  way  of  describing  the  alpha  decay  of  a  nucleus  of  mass  number 
A  and  atomic  number  Z,  the  number  of  protons,  is: 

(A,  Z)  — »  (A  —  4,  Z  —  2)  +  a,  (7.3) 

where  the  helium  nucleus  a,  with  A  =  4  and  Z  =  2,  balances  out  both  the 
mass  and  charge  on  each  side  of  the  reaction.  The  total  sum  of  neutrons  and 
protons  on  the  left  side  of  the  arrow  is  equal  to  the  total  sum  on  the  right 
side;  it’s  just  that  they  have  been  redistributed. 

The  spontaneous  decay  of  the  uranium,  U,  radium,  Ra,  and  polonium,  Po, 
nuclei  can  be  represented  by: 


238u- 

■»  234Th  +  4 He  +  Energy, 

(7.4) 

226  Ra  - 

->  222 Rn  +  4 He  +  Energy, 

210Po  - 

->  206 Pb  +  4 He  + Energy, 

(7.5) 

where  the  arrow  denotes  the  decay  of  the  nucleus  on  the  left  side  to  the 
nuclei  on  the  right  side,  the  superscripts  denote  the  number  of  protons  and 


196 


7  Transmutation  of  the  Elements 


neutrons  in  the  nucleus,  Th  denotes  the  thorium  nucleus,  Rn  denotes  the 
nucleus  of  radon,  Pb  denotes  the  lead  nucleus,  and  He  is  the  helium  nucleus, 
the  alpha  particle.  Both  radium  and  polonium  are  also  natural  products  in  the 
decay  chain  of  radioactive  uranium. 


7.3  Tunneling  Out  of  the  Atomic  Nucleus 

Why  don’t  the  nuclei  of  the  radioactive  atoms  decay  completely  all  at  once?  Or  to 
ask  a  related  question,  how  have  the  nuclei  of  so  many  uranium  atoms  managed  to 
retain  their  alpha  particles?  After  all,  there  is  still  plenty  of  uranium  around  bil¬ 
lions  of  years  after  the  Earth  formed,  continuing  to  make  the  rocks  and  soil  around 
us  radioactive.  The  reason  is  that  it  takes  significant  energy  to  break  free  of  the 
strong  forces  that  bind  the  protons  and  neutrons  together  in  the  nucleus  of  an  atom. 
They  are  locked  so  firmly  within  the  nucleus  that  exceptional  force  must  be 
applied  to  dislodge  them. 

In  fact,  the  escape  of  an  alpha  particle  from  an  atomic  nucleus  seemed 
impossible  from  the  viewpoint  of  classical  physics.  The  forces  holding  the  particle 
inside  the  nucleus  are  so  strong  that  the  energy  required  to  overcome  them  is 
enormous.  It  is  as  if  the  atomic  nucleus  is  surrounded  by  tall  walls  with  an  energy 
much  higher  than  that  of  the  alpha  particles. 

The  Russian  physicist  George  Gamow  (1904-1968)  resolved  this  paradox  in 
1928,  by  using  the  uncertain,  probabilistic  nature  of  quantum  theory  to  explain  the 
mechanism  of  radioactive  decay.  Quantum  theory  indicated  that  the  location  of  a 
tiny  subatomic  particle  is  not  defined  precisely.  It  might  be  anywhere,  although 
with  decreasing  probability  at  regions  far  from  the  most  likely  location. 

This  explains  the  escape  of  alpha  particles  from  the  nuclei  of  radioactive  atoms 
such  as  uranium.  These  particles  usually  lack  the  energy  to  overcome  the  nuclear 
barrier,  but  some  have  a  small  probability  of  escaping  to  the  outside  world.  And 
because  there  is  some  probability  that  they  will  “leak  through”  the  nuclear  walls, 
eventually  some  do.  The  chance  of  escaping  the  atomic  nucleus  increases  with  the 
kinetic  energy  of  the  subatomic  particle  and  decreases  with  its  electrical  charge. 

Not  all  nuclei  are  radioactive.  For  a  nucleus  to  be  unstable  there  has  to  be 
available  a  final  state  in  which  the  sum  of  the  masses  of  the  decay  products  is  less 
than  the  mass  of  the  initial  nucleus. 

So,  how  fast  is  the  rate  of  decay?  Gamow’ s  (1928)  quantum-mechanical  for¬ 
mula  for  the  “transparency”  of  the  nuclear  walls  was  in  good  agreement  with 
Rutherford’s  suggestion  that  the  number  of  radioactive  atoms,  A,  in  a  rock  changes 
with  time,  t,  since  its  solidification  with  a  constant  rate  of  decay  according  to  the 
equation  (Rutherford  1900): 
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dN 

dt 


=  —XN 


where  the  radioactive  decay  constant  2  =  0.693/t^  =  In  2/tj_/2  and  t{/2  is  the 
half-life  of  the  radioactive  species. 

The  decay  rates  of  various  radioactive  substances  differ.  The  nuclei  of  uranium 
can  retain  their  alpha  particles  for  billions  of  years,  whereas  other  radioactive 
nuclei  eject  them  in  a  matter  of  seconds.  These  rates  are  quantified  in  terms  of  the 
nuclear  half-life,  which  is  the  time  needed  for  a  given  amount  of  a  radioactive 
substance  to  decay  to  half  of  its  initial  value. 

The  decay  is  probabilistic,  governed  by  the  rules  of  quantum  mechanics  that 
describe  a  random,  slow,  and  statistical  decay.  On  the  level  of  a  single  atom,  it  is 
impossible  to  predict  when  a  given  atom  will  decay,  and  the  probability  that  a 
given  unstable  atom  decays  is  the  same  for  all  atoms  of  that  type,  independent  of 
age.  For  numerous  identical  atoms,  the  decay  rate  is  predictable  using  quantum 
theory,  and  that  rate  depends  on  the  radioactive  element  under  consideration. 

At  almost  the  same  time  as  Gamow’s  discovery,  the  British  physicist  Ronald 
W.  Gurney  (1898-1953)  and  the  American  physicist  Edward  U.  Condon 
(1902-1974)  developed  a  similar  explanation  for  spontaneous  radioactive  decay 
(Gurney  and  Condon  1928). 

The  decay  equation  integrates  to  give  the  number  of  radioactive  atoms,  N(t)  at 
time  t : 


.  .  /— 0.693  A 

N(t)  =  No  exp  (— 2f)  =  No  exp  ^ - J  ,  (7.7) 

where  N0  is  the  number  of  atoms  at  time  t  =  0,  the  time  of  solidification,  2  is  the 
radioactive  decay  constant,  Ti/2  is  the  half-life  of  the  radioactive  atom,  and  0.693 
is  the  natural  logarithm  of  2  denoted  ln2. 

The  mean  lifetime,  t,  of  a  radioactive  atom  is  the  inverse  of  the  decay  constant,  or 

_  1  _  t1/2  _  t1/2 
X  ~J~  ha  ~  0.693  ’ 

where  the  radioactive  decay  constant  2  =  0.693/t^  =  In  (2) /tx /2,  and  Ti/2  is  the 
half-life  of  the  atom. 

The  half-life  is  the  time  required  for  a  quantity  of  radioactive  atoms  to  fall  to 
half  its  value  as  measured  at  the  beginning  of  the  time  period,  following  the 
exponential  decay  equation.  For  example,  uranium  238  decays  into  thorium  234 
with  the  emission  of  an  alpha  particle  and  a  half-life  of  4.46  x  109  years,  or  4.46 
billion  years.  The  thorium  eventually  decays  into  radium  226,  which  itself  decays 
into  radon  gas  with  the  emission  of  another  alpha  particle  and  a  half-life  of 
1,601  years.  The  radon  eventually  decays  into  polonium  210,  which  decays  into 
stable  lead  206  with  the  emission  of  another  alpha  particle  and  a  half-life  of  just 
138.4  days. 
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Table  7.1  Long-lived  radioactive  isotopes  used  for  dating 


Radioactive  parent 

Stable  daughter 

Half-life  (years) 

Rubidium  (Rb)  187 

Strontium  (Sr)  87 

48.8  billion 

Rhenium  (Re)  187 

Osmium  (Os)  187 

44  billion 

Lutetium  (Lu)  176 

Halfnium  (Hf)  176 

35.7  billion 

Thorium  (Th)  232 

Lead  (Pb)  208 

14.05  billion 

Uranium  (U)  238 

Lead  (Pb)  206 

4.47  billion 

Potassium  (K)  40 

Argon  (Ar)  40 

1.27  billion 

Samarium  (Sm)  146 

Neodymium  (Nd)  142 

0.10  billion 

Uranium  (U)  235 

Lead  (Pb)  207 

704  million 

Plutonium  (Pu)  244 

Thorium  (Th)  232 

83  million 

Iodine  (I)  129 

Xenon  (Xe)  129 

16  million 

Palladium  (Pd)  107 

Silver  (Ag)  107 

6.5  million 

Manganese  (Mn)  53 

Chromium  (Cr)  53 

3.7  million 

Aluminum  (Al)  26 

Magnesium  (Mg)  26 

0.72  million 

The  parent  radioactive  atoms  eventually  decay  into  stable  daughters,  and  the 
amount  of  the  daughter  steadily  increases  at  the  expense  of  the  parent. 

Half-lives  for  the  decay  of  long-lived  radioactive  parent  isotopes  are  listed  in 
Table  7.1  together  with  the  stable  daughter  isotopes.  As  an  example,  it  takes 
4.47  billion  years  for  half  the  atoms  in  a  lump  of  uranium  to  change  into  lead. 

This  provides  us  with  a  way  of  measuring  the  age  of  the  solar  system.  You  just 
measure  the  relative  amounts  of  radioactive  parents  and  non-radioactive  daughters. 
When  this  ratio  is  combined  with  the  known  rates  of  radioactive  decay,  the  time 
since  the  rock  solidified  and  locked  in  the  radioactive  atoms  is  found. 

The  daughters  can  escape  easily  when  the  rock  is  molten;  only  when  it  cools 
and  solidifies  do  they  start  to  accumulate.  For  this  reason,  the  age  determined  for  a 
rock  is  actually  the  time  since  the  rock  became  solid.  If  the  rock  is  remelted,  for 
example  by  the  impact  of  a  meteorite,  its  radioactive  clock  is  reset  and  the  age  will 
measure  the  time  since  the  last  solidification. 

Radioactive  dating  of  primitive  meteorites,  ancient  rocks  returned  from  the 
Moon,  and  deep  ocean  sediments  indicates  an  age  of  about  4.6  billion  years.  These 
relics  have  remained  unaffected  by  the  geological  erosion  processes  that  removed 
the  primordial  record  from  most  terrestrial  rocks.  If  the  solar  system  originated  as 
one  entity,  then  this  also  should  be  the  approximate  age  of  the  Sun  and  the  rest  of 
the  solar  system  (Focus  7.2). 


Focus  7.2  The  age  of  the  solar  system 

Radioactive  elements  can  be  used  to  clock  the  age  of  rocks  on  the  Earth’s 
surface,  meteorites,  and  lunar  rock  samples.  The  number  of  radioactive 
atoms  in  the  rock  will  be  halved  in  a  time  equal  to  the  half-life.  Radioactive 
uranium,  U“  ,  decays,  for  example,  into  lead,  Pb~  (which  is  stable),  with  a 
half-life  of  about  4.47  billion  years;  so  every  4.47  billion  years  the  amount 

oq  o 

of  uranium-238  in  a  rock  will  be  halved.  We  can  apply  the  equations  to  U“  , 
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and  express  the  abundance  in  terms  of  another  kind  of  lead,  Pb204,  which  is 
not  a  radioactive  decay  product.  If  a  terrestrial  rock,  lunar  sample,  or  a  non¬ 
terrestrial  meteorite  became  a  closed  system  at  time  t  =  0,  then  the  present 
abundances  of  lead  and  uranium  are  related  by  the  equation: 


where  the  subscripts  t  and  0  denote  the  present  and  initial  abundance, 
respectively. 

If  all  of  the  rock  samples  have  the  same  initial  Pb206/Pb204  abundance, 
and  if  all  of  them  have  the  same  age,  t ,  then  a  plot  of  (Pb206/Pb204)/  against 
(U238/Pb204)?  should  lie  in  a  straight  line  of  slope  [exp  (A238  t)  —  1].  Such  a 
plot  is  called  an  isochron.  If  a  system  formed  t  years  ago  and  initially 
contained  no  lead,  then  a  curve  of  the  ratios  207Pb/206Pb  and  238U/206Pb  also 
provides  the  age  t. 

Radioactive  dating  has  been  used  to  determine  an  age  of  carbonaceous 
chondrite  meteorites  of  t  =  4.566  ±  0.002  billion  years,  where  one  billion 
years  =  109  years  =  one  Gigayear,  abbreviated  Gyr  (Patterson  1956,  Birck 
1990).  Radioactive  dating  of  the  oldest  rocks  returned  from  the  Moon 
indicate  an  age  of  about  4.5  billion  years  (Wasserburg  et  al.  1977),  and  deep 
sediments  in  the  Earth’s  oceans  are  dated  at  4.55  billion  years.  Rounding  off 
the  numbers  and  allowing  for  possible  systematic  errors,  we  can  say  that  the 
Earth,  Moon  and  meteorites  solidified  at  the  same  time  some  4.6  billion 
years  ago,  with  an  uncertainty  of  no  more  than  0.1  billion  years.  If  the  solar 
system  originated  as  one  entity,  then  this  should  also  be  the  approximate  age 
of  the  Sun  and  the  rest  of  the  solar  system. 


Despite  Gamow’s  tunneling  discovery,  our  understanding  of  radioactivity  was 
still  incomplete;  there  was  something  wrong  with  the  way  the  beta  rays  were 
behaving.  This  resulted  in  the  discovery  of  an  entirely  new  particle  -  the  electron 
neutrino. 


7.4  The  Electron  and  the  Neutrino 

When  first  discovered,  the  electrons  emitted  from  radioactive  elements  were  called 
beta  rays,  to  distinguish  them  from  alpha  rays  (helium  nuclei)  and  gamma  rays 
(high-energy  radiation)  that  also  are  emitted  during  radioactive  decay  processes. 
From  their  measured  charge  and  mass,  it  was  discovered  that  the  betas  are  not  rays 
at  all  but  instead  ordinary  electrons  moving  at  nearly  the  speed  of  light.  The 
emission  of  high-speed  electrons  by  a  radioactive  element  is  known  as  beta  decay. 
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Detailed  measurements  of  the  high-speed  electrons,  given  off  during  radioactive 
decay,  seemed  to  violate  a  fundamental  principle  of  physics  known  as  the  con¬ 
servation  of  energy.  According  to  this  rule,  the  total  energy  of  a  system  must 
remain  unchanged  unless  acted  on  by  an  outside  force.  We  know  of  no  process  that 
disobeys  this  principle. 

Nevertheless,  the  sum  of  the  energy  of  the  beta  decay  nucleus  and  the  energy  of 
the  emitted  electrons  sometimes  turned  out  to  be  less  energy  than  the  amount  of 
energy  lost  by  the  initial  nucleus.  Careful  measurements  failed  to  turn  up  the 
missing  energy,  which  seemed  to  have  vanished  into  thin  air,  suggesting  that 
energy  might  not  be  conserved  during  beta  decay.  The  eminent  Danish  physicist 
Niels  Bohr  (1885-1962)  even  proposed  that  the  conservation  of  energy  law  was 
being  violated  on  the  atomic  scale,  suggesting  that  the  beta  decay  observations 
may  force  us  to  renounce  the  very  idea  of  energy  balance  (Bohr  1930). 

However,  it  turned  out  that  a  mysterious,  invisible  particle  was  spiriting  away 
the  missing  energy.  It  was  the  elusive  neutrino,  whose  existence  was  postulated  by 
Wolfgang  Pauli  (1900-1958),  a  brilliant  Austrian  physicist.  Pauli  proposed  a 
“desperate  way  out”  of  the  energy  crisis,  speculating  that  an  electrically  neutral 
particle,  produced  at  the  same  time  as  the  electron,  carried  off  the  remaining 
energy  (Pauli  1930,  1933).  The  sum  of  the  energies  remains  constant  during  the 
beta  decay,  so  the  energy  is  balanced  and  the  principle  of  conservation  of  energy  is 
saved. 

Pauli  thought  he  had  done  “a  terrible  thing”,  for  his  desperate  remedy  postu¬ 
lated  an  invisible  particle  that  could  not  be  detected.  Dubbed  the  neutrino ,  or 
“little  neutral  one”  by  the  Italian  physicist  Enrico  Fermi  (1901-1954),  the  new 
particle  could  not  be  observed  with  the  technology  of  the  day,  since  the  neutrino  is 
electrically  neutral,  has  almost  no  mass,  and  moves  at  nearly  the  speed  of  light. 
Therefore  the  hypothetical  neutrinos  were  removing  energy  that  would  never  be 
seen  again.  (Even  in  Pauli  and  Fermi’s  time,  the  observed  high-energy  shape  of  the 
emitted  electron’s  energy  spectrum  indicated  that  the  mass  of  the  neutrino  is  either 
zero  or  very  small  relative  to  the  mass  of  the  electron). 

Unlike  light  or  any  other  form  of  radiation,  neutrinos  can  move  nearly  unim¬ 
peded  through  any  amount  of  material,  even  the  entire  universe.  In  the  parlance  of 
modern  physics,  neutrinos  are  characterized  by  a  weak  interaction  with  anything  in 
the  material  world. 

As  beautifully  described  by  Fermi,  the  decay  process  occurs  when  the  neutron 
in  a  radioactive  nucleus  transforms  into  a  proton  with  the  simultaneous  emission  of 
an  energetic  electron  and  a  high-speed  neutrino  (Fermi  1934).  When  left  alone 
outside  a  nucleus,  a  neutron  will,  in  fact,  self-destruct  in  about  10  min  into  a 
proton,  plus  an  electron  to  balance  the  charge,  and  a  neutrino  to  help  remove  the 
energy. 

We  can  write  the  radioactive  beta  decay  reaction  as 
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where  n  denotes  a  neutron,  p  is  a  proton,  e~  is  an  electron,  and  ve  denotes  the  anti¬ 
electron  neutrino,  or  the  anti-particle  of  the  electron  neutrino.  The  proton  and 
electron  have  equal  charges  of  opposite  sign,  so  their  charges  cancel  to  match  the 
uncharged  neutron  -  provided  that  the  neutrino  has  no  charge.  The  electron  e~  is 
sometimes  denoted  beta  minus  (/F)  to  denote  the  emission  of  the  beta  particle,  the 
electron.  The  reaction  was  also  known  as  negative  beta  decay,  with  the  negative 
standing  for  the  negative  charge  of  the  negative  beta  particle,  the  electron. 

The  nuclear  reaction  for  negative  beta  decay  can  be  written: 

(Z-  1,  A)  -  (Z,  A)  +  e~  +  ve,  (7.11) 

where  (Z,  A)  denotes  a  nucleus  of  atomic  number  or  charge  Z  and  mass  number  A. 

If  a  positron,  denoted  e+,  is  emitted,  the  reaction  is  written: 

Energy  +  p  — >  n  +  e+  +  ve,  (7-12) 

where  energy  has  been  supplied  to  fuel  the  reaction  and  ve  denotes  the  electron 
neutrino.  Sometimes  the  positron  in  this  reaction  is  denoted  beta  plus  (/F).  Unlike 
/F  decay,  the  positive  beta  decay  is  not  a  spontaneous  reaction  that  occurs  in 
isolation  because  it  requires  energy,  the  mass  of  the  neutron  being  slightly  greater 
than  the  mass  of  the  proton.  The  nuclear  reaction  for  positive  beta  decay  is 

Energy  +  (Z  +  1,  A)  —>  (Z,  A)  +  e+  +  ve.  (7.13) 

As  far  as  anyone  could  tell,  an  atomic  nucleus  consists  only  of  neutrons  and 
protons,  so  the  electron  and  neutrino  seemed  to  come  out  of  nowhere.  They  do  not 
reside  within  the  nucleus  and  are  created  at  the  time  of  nuclear  transformation.  No 
one  knew  exactly  how  the  neutrinos  formed. 

How  do  you  observe  something  that  spontaneously  appears  out  of  nowhere  and 
is  so  close  to  being  nothing  at  all?  Calculations  suggested  that  the  probability  of  a 
neutrino  interacting  with  matter,  so  that  the  effect  might  be  seen,  is  so  incredibly 
small  that  no  one  could  ever  detect  it.  To  see  one  neutrino,  we  would  have  to 
produce  enormous  numbers  at  about  the  same  time,  and  build  a  massive  detector  to 
increase  the  chances  of  detecting  it.  Although  almost  all  of  the  neutrinos  still 
would  pass  through  any  amount  of  matter  unhindered  and  undetected,  a  rare 
collision  with  other  subatomic  particles  might  leave  a  trace. 

Nuclear  reactors  produce  large  numbers  of  neutrinos,  and  if  a  massive  detector 
is  placed  near  a  large  nuclear  reactor,  with  appropriate  shielding  from  extraneous 
signals,  the  telltale  sign  of  the  hypothetical  neutrino  may  be  barely  observed. 

The  existence  of  the  neutrino  was  finally  proven  by  Project  Poltergeist,  an 
experiment  designed  by  Clyde  L.  Cowan  (1919-1974)  and  Frederick  Reines 
(1918-1998)  of  the  Los  Alamos  National  Laboratory  in  New  Mexico.  They  placed 
a  10-ton  (10,000  L)  tank  of  water  next  to  a  powerful  nuclear  reactor  engaged  in 
making  plutonium  for  use  in  nuclear  weapons.  After  shielding  the  neutrino  trap 
underground  and  running  it  for  about  100  days,  Reines  and  Cowan  detected  a  few 
synchronized  flashes  of  gamma  radiation  that  signaled  the  interaction  of  a  few 
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neutrinos  with  the  nuclear  protons  in  water  (Reines  and  Cowan  1953,  Cowan  et  al. 
1956). 

The  neutrinos  themselves  were  not  observed,  and  they  never  have  been.  Their 
presence  was  inferred  by  an  exceedingly  rare  interaction.  One  of  every  billion 
billion,  or  10  ,  neutrinos  that  passed  through  the  water  tank  hit  a  proton,  pro¬ 
ducing  the  telltale  burst  of  radiation.  Nearly  four  decades  later,  Reines  received  the 
1995  Nobel  Prize  in  Physics  for  the  detection  of  the  neutrino;  but  by  that  time 
however,  Cowan  had  died  and  therefore  could  not  share  in  the  award. 

As  discussed  in  Chap.  8,  the  Sun  emits  copious  amounts  of  neutrinos.  Every 
second,  trillions  upon  trillions  of  neutrinos  that  were  produced  inside  the  Sun  pass 
right  through  the  Earth  without  even  noticing  that  it  is  there.  The  indestructible 
neutrinos  interact  so  rarely  with  the  material  world  that  almost  nothing  ever 
happens  to  them.  Billions  of  ghostly  neutrinos  from  the  Sun  are  passing  right 
through  us  every  second,  even  in  our  bedrooms  at  night,  and  they  did  not  come 
through  the  door.  The  solar  neutrinos  travel  right  through  the  Earth,  a  building,  and 
us,  without  our  body  noticing  them  or  them  noticing  our  body. 

Moreover,  when  a  minute  number  of  the  Sun’s  neutrinos  were  snared  in  mas¬ 
sive  underground  detectors,  fewer  than  expected  were  observed;  this  eventually  led 
to  a  new  understanding  of  the  neutrino.  For  the  time  being,  however,  let  us  move 
on  to  energetic  cosmic  rays  that  are  always  entering  the  atmosphere  from  outer 
space. 


7.5  Cosmic  Rays 

Subatomic  particles  are  entering  our  atmosphere  from  all  directions  in  interstellar 
space  and  moving  at  nearly  the  speed  of  light.  The  perpetual  high-energy  rain  was 
discovered  about  a  century  ago,  when  the  Austrian  physicist  Victor  Franz  Hess 
(1883-1964),  an  ardent  amateur  balloonist,  measured  the  amount  of  ionization  at 
different  heights  within  our  atmosphere  (Hess  1912). 

It  was  already  known  that  radioactive  rocks  at  the  Earth’s  surface  were  emitting 
energetic  “rays”  -  the  alpha  and  beta  particles  -  that  ionize  molecules  in  the 
atmosphere  near  the  ground.  It  was  expected  that  the  ionizing  rays  would  be 
absorbed  completely  after  passing  through  sufficient  quantities  of  the  atmosphere. 
The  measured  ionization  at  first  decreased  with  altitude,  as  expected  from  atmo¬ 
spheric  absorption  of  energetic  particles  emitted  by  radioactive  rocks.  However, 
the  ionization  rate  measured  by  Hess  increased  at  even  higher  altitudes  to  levels 
exceeding  that  at  the  ground  when  the  balloons  rose  to  above  1  km  in  altitude 
(Hess  1912).  This  meant  that  some  penetrating  source  of  ionization  came  from 
beyond  the  Earth.  By  flying  his  balloons  at  night  and  during  a  solar  eclipse,  when 
the  high-altitude  signals  persisted,  Hess  showed  that  they  could  not  come  from  the 
Sun  but  rather  from  some  other  source. 

The  American  physicist  Robert  A.  Millikan  (1868-1953)  subsequently  used 
high-altitude  balloon  measurements  to  confirm  that  the  “radiation”  comes  from 
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beyond  the  terrestrial  atmosphere,  and  he  gave  it  the  present  name  of  cosmic  rays 
(Millikan  1926;  Millikan  and  Cameron  1926).  Millikan  believed  the  cosmic  rays 
were  gamma  rays  associated  with  the  synthesis  of  heavy  elements  deep  in  space, 
the  “birth  cries”  of  new  matter.  We  now  know  that  cosmic  rays  are  energetic 
charged  particles,  not  radiation,  and  more  likely  the  “death  cries”  of  massive 
exploding  stars. 

Global  measurements  showed  that  cosmic  rays  are  electrically  charged.  During 
an  ocean  voyage  in  1927  and  1928,  the  Dutch  physicist  Jacob  Clay  (1882-1955), 
for  example,  found  lower  cosmic-ray  intensity  near  the  Earth’s  Equator  than  at 
higher  terrestrial  latitudes;  his  results  were  confirmed  and  extended  between  1930 
and  1933  by  Arthur  H.  Compton  (1892-1962)  of  the  University  of  Chicago  (Clay 
1932;  Compton  1932).  Compton  conclusively  demonstrated  an  increase  in  cosmic- 
ray  intensity  with  terrestrial  latitude,  and  also  made  measurements  at  mountain 
altitudes,  where  the  increase  with  latitude  was  even  stronger.  His  results  indicated 
that  cosmic  rays  must  be  electrically  charged  particles  deflected  by  the  Earth’s 
magnetic  field  toward  its  magnetic  poles,  which  are  close  to  the  geographic  ones. 

The  amount  of  cosmic  rays  entering  the  Earth’s  atmosphere  varies  with  the 
amount  of  solar  activity  over  an  1 1  year  cycle;  the  amount  of  cosmic  rays  increase 
when  the  solar  activity  decreases  and  vice  versa.  This  inverse  correlation  has  been 
named  the  Forbush  effect,  after  Scott  Forbush  (1904-1984)  who  discovered  it 
(Forbush  1950).  It  has  been  attributed  to  an  enhancement  in  the  interplanetary 
magnetic  field  originating  from  the  Sun  at  times  of  increased  solar  activity  (Davis 
1955;  Meyer  et  al.  1956);  these  magnetic  fields  divert  the  cosmic  rays  and  keep 
them  from  encountering  the  Earth. 

The  charged  cosmic-ray  particles  also  are  deflected  and  change  direction  during 
encounters  with  the  interstellar  magnetic  field  that  winds  its  way  among  the  stars. 
Therefore,  we  cannot  look  back  along  their  incoming  path  and  tell  where  cosmic 
rays  originate;  the  direction  of  arrival  shows  only  where  they  last  changed  course. 
The  favored  hypothesis,  proposed  by  Walter  Baade  (1893-1960)  and  Fritz  Zwicky 
(1898-1974),  is  that  cosmic  rays  are  accelerated  to  their  tremendous  energy  during 
the  supernova  explosion  of  massive  stars  that  have  run  out  of  thermonuclear  fuel 
(Baade  and  Zwicky  1934a,  b,  c,  d).  Kulsrud  et  al.  (1972)  and  Blandford  and 
Ostriker  (1980)  have  discussed  the  acceleration  of  cosmic  rays  in  supernovae. 

By  the  late  1940s,  instruments  carried  by  high-altitude  balloons  established  that 
the  most  abundant  cosmic  ray  particles  arriving  in  the  Earth’s  upper  atmosphere 
are  protons  -  the  nuclei  of  former  hydrogen  atoms  -  and  the  second  most  abundant 
particles  are  helium  nuclei  -  the  alpha  particles.  Cosmic-ray  electrons  arriving  near 
the  top  of  the  atmosphere  took  longer  to  discover  (Earl  1961),  mainly  because  they 
are  far  less  abundant  than  the  cosmic-ray  protons  at  a  given  energy. 

About  79  %  of  the  cosmic  rays  arriving  at  the  top  of  the  Earth’s  atmosphere  are 
protons,  about  14.7  %  are  nuclei  of  helium,  and  roughly  1  %  is  carbon  and  oxygen 
nuclei  (Table  7.2). 

Although  they  are  relatively  few  in  number,  cosmic  rays  contain  phenomenal 
amounts  of  energy.  That  energy  usually  is  measured  in  units  of  electron  volts, 
abbreviated  eV  -  for  conversion  use  1  eV  =  1.602  x  10“ 19  J.  The  greatest  flux  of 
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Table  7.2  Average  fluxes  of  primary  cosmic  rays  at  the  top  of  the  atmospherea 


Type  of  nucleus 

2  1 

Flux  (particles  ms) 

Hydrogen  (protons) 

640 

Helium  (alpha  particles) 

94 

Carbon,  oxygen 

6 

a  The  flux  is  in  units  of  nuclei  per  square  meter  per  second  for  particles  with  energies  greater  than 
1.5  billion  (1.5  x  109)  electron  volts  per  nucleon,  denoted  1.5  GeV  per  nucleon,  arriving  at  the 
top  of  the  atmosphere  from  directions  within  30°  of  the  vertical 
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Particle  Energy  (eV) 


Fig.  7.2  Flux  of  cosmic  rays  The  energy  spectrum  of  cosmic-ray  particles  striking  the  outer 
atmosphere  of  the  Earth.  The  particle  flux  is  plotted  as  a  function  of  the  particle  energy  in  units  of 
electron  volts,  abbreviated  eV,  where  1  eV  =  1.602  x  1(T19  J  and  1  GeV  =  109  eV,  or 
1  billion  eV.  The  most  abundant  cosmic-ray  particles  are  protons  with  energies  of  about 
1.5  x  109  eV.  Every  second  about  640  enter  every  square  meter  of  the  Earth’s  outer  atmosphere. 
They  probably  are  accelerated  to  high  energy  during  the  supernova  explosions  of  massive  stars. 
One  cosmic -ray  proton  of  1010  eV  in  energy  enters  each  square  meter  of  the  Earth’s  outer 
atmosphere  every  second.  The  more  energetic  cosmic  ray  particles  of  10 14  eV  are  less  abundant, 
with  one  per  square  meter  every  year.  Solar  flares  can  emit  protons  with  energies  of  1010  eV  or 
less,  and  these  solar  energetic  particles  can  strike  the  Earth  when  the  solar  active  region  is  on  the 
near  side  of  the  Sun.  Cosmic  rays  with  low  flux  and  very  high  energy,  greater  than  1  million 
billion  eV,  or  1015  eV,  may  be  of  extragalactic  origin.  (From  “The  Life  and  Death  of  Stars”  by 
Kenneth  R.  Lang,  published  by  Cambridge  University  Press,  2012.  Reprinted  with  permission.) 


cosmic-ray  protons  arriving  at  the  Earth  have  energies  of  1-10  GeV,  or  at  1  billion 
(109)  to  10  billion  (1010)  electron  volts  of  energy.  By  way  of  comparison,  a  helium 
nucleus,  or  alpha  particle,  emitted  during  radioactive  decay  reaches  no  more  than  a 
million  eV  in  energy,  or  a  thousand  times  less  than  that  of  a  cosmic  ray  proton. 
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Table  7.3  Particle  speeds  at  different  particle  energies,  expressed  as  fractions  of  the  speed  of 
light,  ca 


Particle  kinetic  energy  (keV) 

Electron  speed  (times  c) 

Proton  speed  (times  c ) 

1  keV 

0.063 

0.0015 

1,000  keV  =  1  MeV 

0.94 

0.046 

100,000  keV  =  100  MeV 

0.999987 

0.43 

1,000,000,000  keV  =  1  GeV 

0.99999987 

0.88 

a  An  energy  of  one  kilo-electron  volt  is  1  keV  =  1.6022  x  10  16  J,  and  the  speed  of  light 
c  =  2.99792458  x  108  m  s_1 


Cosmic  rays  do  not  all  have  the  same  energy,  and  some  reach  an  energy  of  1  0  e V, 
more  than  10  billion  (1010)  times  the  abundant  ones;  Kotera  and  Olinto  (2011)  have 
reviewed  the  astrophysics  of  ultra  high-energy  cosmic  rays.  The  flux  of  the  most 
common,  lower-energy  cosmic  rays  at  109— 1010  eV  is  greatest  (Fig.  7.2). 

Even  at  an  energy  of  109  eV,  a  cosmic-ray  proton  must  be  traveling  at  88  %  of 
the  speed  of  light  (Table  7.3). 

Cosmic  rays  enter  the  atmosphere  with  such  great  energies  that  they  act  like 
colossal  atom  destroyers,  hitting  molecules  and  their  component  atoms  in  the 
upper  atmosphere  and  producing  showers  of  subatomic  debris,  known  as  sec¬ 
ondary  cosmic-ray  particles.  This  eventually  led  to  the  first  observation  of  an 
energetic  particle  that  does  not  belong  to  the  atom.  But  first  a  method  needed  to  be 
developed  to  detect  the  then-unknown  particle. 

Subatomic  particles  coming  down  through  the  atmosphere  are  detected  near  or 
at  the  ground  by  tracks  in  a  cloud  chamber,  which  creates  a  cloudy  mist  that 
precipitates  as  long  thin  bands  of  fog,  along  the  trajectory  of  the  particles.  This  is 
somewhat  similar  to  the  white  vapor  trails  of  jet  aircraft,  which  record  an  air¬ 
plane’s  movement  in  the  sky.  Fine  water  droplets  condense  from  the  jet  exhaust 
fumes  and  create  the  elongated  clouds. 

The  first  cloud  chamber,  invented  by  Charles  Thomas  Rees  Wilson 
(1869-1859),  was  very  simple,  consisting  of  a  metallic  cylinder  with  a  glass  cover 
and  a  piston  that  could  be  moved  up  and  down  from  below,  permitting  air  filled 
with  water  vapor  to  enter  the  space  above  it  (Wilson  1911).  When  the  piston  was 
lowered  quickly,  the  sudden  expansion  cooled  the  gas  so  that  a  mist  formed  in  the 
chamber,  like  the  foggy  mist  found  high  in  the  mountains.  The  water  vapor  in  the 
chamber  condensed  or  precipitated  out  on  any  ions  present,  making  the  ionized 
tracks  of  cosmic  rays  visible  and  showing  where  they  had  moved. 

When  the  cloud  chamber  is  placed  between  the  poles  of  a  strong  magnet,  the 
magnetic  field  exerts  a  force  on  any  charged  particle  entering  the  chamber,  which 
produces  a  curved  particle  track.  When  Carl  Anderson  (1945-2004),  who  was 
Millikan’s  student  at  the  California  Institute  of  Technology,  built  such  a  device, 
using  a  powerful  electromagnet,  he  found  in  1932  that  a  few  of  the  cosmic  ray 
showers  produce  two  similar  curved  trajectories  in  opposite  directions  (Fig.  7.3). 
Further  experiments  revealed  that  an  electron,  which  has  a  negative  charge,  was 
producing  one  of  the  curved  tracks,  whereas  a  particle  with  about  the  same  mass  as 
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Fig.  7.3  The  electron  and  the  positron  An  invisible  gamma-ray  photon  (top)  produces  an 
electron  and  a  positron  (short  for  positive  electron),  seen  by  curved  tracks  in  a  bubble  chamber. 
Both  the  electron  and  the  positron  are  bent  into  circular  tracks  by  the  instrument’s  magnetic  held, 
moving  in  opposite  directions  because  of  their  opposite  electrical  charge  and  spiraling  into  a 
smaller  circular  motion  as  they  lose  energy.  In  this  upper  pair ,  some  of  the  photon’s  energy  is 
taken  up  in  displacing  an  atomic  electron,  which  shoots  off  toward  the  bottom  left.  In  the  lower 
example,  all  of  a  gamma  ray’s  energy  goes  into  the  production  of  the  electron— positron  pair.  As  a 
result,  these  particles  are  more  energetic  than  the  upper  pair ,  and  their  tracks  do  not  curve  so 
tightly  in  the  chamber’s  magnetic  held.  (Schematic  of  a  Lawrence  Berkeley  Laboratory  bubble- 
chamber  image,  reproduced  by  Lrank  Close,  Michael  Marten,  and  Christine  Sutton  in  The 
Particle  Explosion ,  New  York:  Oxford  University  Press  1987.  (Lrom  “The  Life  and  Death  of 
Stars”  by  Kenneth  R.  Lang,  published  by  Cambridge  University  Press,  2013.  Reprinted  with 
permission.) 


the  electron  and  a  positive  charge  of  the  same  amount  but  opposite  sign  as  the 
electron  was  producing  the  track  curved  in  the  opposite  direction.  Anderson  had 
discovered  the  positron,  short  for  “positive  electron”  and  the  anti-matter  coun¬ 
terpart  of  the  electron  (Anderson  1932a,  1933).  The  energetic  cosmic  rays  had 
produced  a  new  type  of  particle  that  had  never  been  seen  before! 

Anderson  received  the  Nobel  Prize  in  Physics  in  1936  for  his  discovery  of  the 
positron,  sharing  the  award  with  Hess  for  his  discovery  of  cosmic  rays.  Wilson  had 
already  received  the  recognition  9  years  earlier,  for  his  method  of  making  the 
paths  of  electrically  charged  particles,  the  cosmic  rays,  visible  by  condensation  of 
vapor. 

As  it  turned  out,  Paul  Adrien  Maurice  Dirac  (1902-1984),  then  at  Cambridge 
University,  had  predicted  the  existence  of  anti-matter  (Dirac  1928).  For  Dirac, 
mathematical  beauty  was  the  most  important  aspect  of  any  physical  law  describing 
nature.  He  noticed  that  equations  that  describe  the  electron  have  two  solutions. 
Only  one  was  needed  to  characterize  the  electron;  the  other  solution  specified  a 
sort  of  mirror  image  of  the  electron  -  that  is,  an  anti-particle,  now  called  the 
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Fig.  7.4  Cosmic  ray 
shower  When  a  primary 
cosmic  ray  enters  the  Earth’s 
atmosphere  and  collides  with 
the  nucleus  of  an  atom  in  an 
atmospheric  molecule,  it  can 
produce  a  shower  of 
secondary  subatomic 
particles.  Here  we  show  the 
most  abundant  cosmic  ray 
particle,  the  proton, 
producing  a  neutron, 
designated  n\  pions  denoted 
by  7i,  muons  denoted  by  in , 
with  the  anti-muon-neutrino, 
vM,  gamma  rays,  electrons  e~ 
and  positrons  e+ 


positron.  At  the  time  of  his  discovery,  Anderson  nevertheless  was  unaware  of 
Dirac’s  theoretical  prediction  of  the  positron. 

The  nuclear  reaction  that  describes  the  creation  of  a  positron  is  known  as  the 
inverse  beta  decay  or  positive  beta  decay,  and  it  occurs  when  the  energy  of  a 
colliding  cosmic-ray  particle  is  used  to  convert  a  proton,  p,  into  a  neutron,  n.  The 
reaction,  written 


Energy  +  p  — >  n  +  +  ve,  (7.14) 

includes  the  production  of  the  positron,  denoted  e+  or  sometimes  /T,  which  carries 
away  the  charge  of  the  proton,  leaving  a  neutron  with  a  slight  increase  in  mass,  and 
the  electron  neutrino,  designated  ve,  to  balance  the  energy  books  on  the  two  sides 
of  the  reaction. 

Once  created,  anti-matter  does  not  stay  around  for  very  long,  for  any  anti-matter 
will  promptly  self-destruct  when  it  encounters  ordinary  matter.  When  an  electron 
and  positron  meet,  they  annihilate  one  another  and  disappear  in  a  puff  of  energetic 
radiation.  The  electron-positron  annihilation  reaction  is: 

e~  +  e+  — >  y  +  y,  (7.15) 

where  e~  is  the  electron,  and  y  denotes  a  gamma-ray  photon,  each  with  energy  of 
0.511  MeV  equal  to  mec  ,  which  corresponds  to  the  complete  destruction  of  an 
electron  of  rest  mass  me  where  c  is  the  speed  of  light.  As  discussed  subsequently, 
this  reaction  helps  to  produce  radiation  in  the  core  of  the  Sun,  and  also  is  observed 
during  explosive  flares  on  the  visible  solar  disk. 

When  examining  the  cloud-chamber  tracks  of  secondary  particles  produced  by 
cosmic  rays  entering  our  atmosphere,  scientists  discovered  other  previously 
unknown  subatomic  particles  (Fig.  7.4).  There  is  the  muon,  denoted  p  (Anderson 
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Table  7.4  Nobel  Prizes  related  to  experimental  investigations  of  subatomic  matter 


Years 

Person 

Accomplishment 

1927 

C.  T.  R.  Wilson  (Scottish 
physicist) 

For  his  method  of  making  the  paths  of  electrically 
charged  particles  visible  by  condensation  of 
vapor 

1936 

Victor  Franz  Hess  (Austrian 
physicist) 

Carl  Anderson  (American 
physicist) 

For  his  discovery  of  cosmic  radiation  [now  called 
cosmic  rays] 

For  his  discovery  of  the  positron 

1938 

Enrico  Fermi  (Italian  physicist) 

For  his  demonstrations  of  the  existence  of  new 
radioactive  elements  produced  by  neutron 
irradiation,  and  for  his  related  discovery  of 
nuclear  reactions  brought  about  by  slow  neutrons 

1939 

Ernest  Lawrence  (American 
physicist) 

For  the  invention  and  development  of  the  cyclotron 
and  for  results  obtained  with  it,  especially  with 
regard  to  artificial  radioactive  elements 

1948 

Patrick  M.  S.  Blackett  (English 
physicist) 

For  his  development  of  the  Wilson  cloud  chamber 
method,  and  his  discoveries  therewith  in  the 
fields  of  nuclear  physics  and  cosmic  radiation 

1950 

Cecil  Powell  (English  physicist) 

For  his  development  of  the  photographic  method  of 
studying  nuclear  processes  and  his  discoveries 
regarding  mesons  made  with  this  method 

1951 

John  Cockcroft  (English  physicist) 
Ernest  T.  S.  Walton  (English 
physicist) 

For  their  pioneering  transmutation  of  atomic  nuclei 
by  artificially  accelerated  atomic  particles 

1959 

Emilio  Segre  (Italian-born 
American  physicist) 

Owen  Chamberlain  (American 
physicist) 

For  their  discovery  of  the  antiproton 

1995 

Frederick  Reines  (American 
physicist) 

For  the  detection  of  the  neutrino 

2002 

Raymond  Davis  Jr. 

(American  astrophysicist) 
Masatoshi  Koshiba  (Japanese 
astrophysicist) 

For  their  pioneering  contributions  to  astrophysics,  in 
particular  for  the  detection  of  cosmic  neutrinos 

and  Neddermeyer  1937;  Neddermeyer  and  Anderson  1937)  and  the  pion,  desig¬ 
nated  7i,  discovered  using  photographic  emulsions  to  detect  secondary  cosmic  ray 
particles  at  high  altitudes,  including  the  Pic  Du  Midi  astronomical  observatory 
(Lattes  et  al.  1947).  The  muon  has  a  mass  between  that  of  the  electron  and  the 
positron.  The  pion  also  has  an  intermediate  mass.  Hideki  Yukawa  (1907-1981) 
had  predicted  the  existence  of  the  pion  more  than  a  decade  before  its  discovery,  in 
his  theory  for  the  nuclear  force  (Yukawa  1935,  1937).  Altogether,  the  investiga¬ 
tions  of  x-rays,  radioactivity,  and  cosmic  rays  resulted  in  quite  a  lot  of  recognition 
by  the  Nobel  Prize  in  Physics  (Table  7.4). 

Atomic  nuclei  from  outer  space  hit  the  upper  atmosphere  and  produce  a  debris 
of  pions.  These  soon  decay  into  muons,  which  are  always  raining  down  to  the 
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Earth.  Many  of  the  muons  decay  into  electrons,  positrons,  and  neutrinos  during 
their  flight  through  the  atmosphere,  and  the  very  energetic  muon  neutrinos  pene¬ 
trate  deep  underground. 

In  1998  a  Japanese  group  reported  that  observations  of  muon  neutrinos,  gen¬ 
erated  by  cosmic  rays  interacting  with  the  atmosphere,  indicate  that  neutrinos 
change  type  or  flavor,  oscillating  between  types  as  they  travel  through  matter 
(Fukada  et  al.  1998a).  There  were  roughly  twice  as  many  muon  neutrinos  coming 
from  the  atmosphere  directly  over  their  detector  than  those  coming  from  the  other 
side  of  the  Earth.  The  muon  neutrinos  are  produced  in  the  atmosphere  above  every 
place  on  our  planet,  but  some  of  them  apparently  disappeared  while  traveling 
through  the  Earth  to  arrive  at  the  detector  from  below  (Sect.  8.4). 

In  2011,  the  Alpha  Magnetic  Spectrometer  was  carried  by  Space  Shuttle  to  the 
International  Space  Center ,  where  it  will  search  for  evidence  of  dark  matter  and 
anti-matter  by  measuring  cosmic  rays  above  the  Earth’s  atmosphere. 

During  the  first  half  of  the  20th  century,  investigations  of  cosmic  rays  revealed 
many  unexpected  aspects  of  subatomic  particles,  and  this  has  inspired  the  con¬ 
struction  of  particle  accelerators,  which  have  been  used  to  discover  many  other 
subatomic  particles. 


7.6  Nuclear  Transformation  by  Bombardment 

What  happens  if  we  turn  radioactivity  around  and  instead  of  watching  the  nucleus 
of  an  unstable  heavy  atom  decay,  we  bombard  a  perfectly  normal,  lighter  nucleus 
with  very  energetic  particles?  Perhaps  this  normally  stable  element  could  be 
transformed  artificially  on  the  Earth  through  such  a  nuclear  bombardment.  After 
all,  that  is  what  the  cosmic  rays  were  doing  in  the  atmosphere,  resulting  in  all  kinds 
of  surprises,  turning  some  atoms  into  previously  unknown,  fundamental  particles. 

Medieval  alchemists  had  been  trying  to  change  one  element  into  another,  like 
lead  into  gold,  for  centuries,  but  they  always  failed  in  their  attempts  because  the 
chemical  and  thermal  reactions  they  employed  were  nowhere  near  energetic 
enough  to  crack  open  the  nucleus  of  an  atom. 

The  first  successful  attempts  to  transform  elements  in  the  terrestrial  laboratory 
occurred  when  Patrick  M.  S.  Blackett  (1897-1974),  a  recent  graduate  of  Cam¬ 
bridge  University,  directed  a  beam  of  fast  alpha  particles,  ejected  by  radioactive 
decay,  into  Wilson’s  cloud  chamber.  The  chamber  was  filled  with  normal  atmo¬ 
spheric  air,  which  is  composed  mainly  of  nitrogen  molecules.  Blackett  improved 
the  cloud  chamber  so  that  the  air  expanded  and  cooled  automatically,  and  he  took 
automatic  photographs  of  the  alpha-particle  tracks.  Most  of  them  passed  straight 
through  the  chamber.  However,  after  more  than  23,000  photographs  of  alpha 
particles  bombarding  nitrogen  in  the  cloud  chamber,  during  a  three-year  period 
from  1921  to  1924,  Blackett  finally  succeeded  in  recording  just  eight  head-on 
collisions  of  alpha  particles  with  the  nuclei  of  nitrogen  atoms  (Blackett  1925). 
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Fig.  7.5  Nuclear  transformation  When  an  alpha  particle,  or  helium  nucleus  denoted  4He,  is 
sent  through  a  cloud  chamber,  it  usually  passes  right  through  it,  with  a  trajectory  that  marks  out  a 
straight  line.  Occasionally,  the  alpha  particle  will  strike  the  nucleus,  14N,  of  a  nitrogen  atom  in  the 
air  within  the  chamber,  transforming  it  into  the  nucleus,  O,  of  an  oxygen  atom  with  the 
emission  of  a  proton,  the  nucleus  of  a  hydrogen  atom  and  denoted  !H.  Such  a  nuclear 
transformation  was  first  observed  in  cloud  chamber  photographs  taken  in  the  early  1920s  by 
Blackett  (1897-1974).  (From  “The  Life  and  Death  of  Stars”  by  Kenneth  R.  Lang,  published  by 
Cambridge  University  Press,  2013.  Reprinted  with  permission.) 


On  each  photograph,  the  track  of  an  alpha  particle  suddenly  stopped,  being 
replaced  with  the  fine,  straight  track  of  an  ejected  proton  and  the  short,  stubby 
recoil  track  of  the  struck  nucleus  (Fig.  7.5).  However,  there  was  no  sign  of  the 
recoiling  alpha  particle.  The  collision  had  brought  the  alpha  particle  into  the 
nitrogen  nucleus,  forming  a  nucleus  of  a  form  of  oxygen.  The  reaction  can  be 
written  as  follows: 


4He  +  14N— >  170  +  1H,  (7.16) 

where  the  collision  of  an  alpha  particle,  or  helium  nucleus  4He,  with  a  nitrogen 
nucleus,  14N,  gave  rise  to  the  nucleus  of  oxygen,  170,  and  a  proton,  the  nucleus  of 
hydrogen,  1H.  The  old  alchemist’s  dream  finally  had  been  realized  in  a  laboratory 
on  the  Earth,  in  which  nuclear  transformation  had  been  induced  and  recorded. 

Enthusiastic  scientists  directed  beams  of  alpha  particles  into  many  other  ele¬ 
ments,  creating  nuclear  transformations  similar  to  the  one  observed  for  nitrogen. 
When  it  came  to  heavier  elements,  with  greater  nuclear  charge,  however,  a  nuclear 
transformation  could  not  be  produced.  The  greater  electrical  repulsion  of  nuclei 
with  atomic  number  Z  greater  than  1 8  always  withstood  the  bombardment  by  alpha 
particles. 

That  is  when  Gamow’s  paper  on  the  decay  of  heavy  radioactive  nuclei  had  a 
decisive  role  (Gamow  1928).  His  calculations  indicated  that  on  rare  occasions 
alpha  particles  could  tunnel  through  the  positively  charged  wall  of  a  nucleus,  but 
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that  fast  protons  more  easily  overcome  the  barrier  than  slow,  heavier  nuclei. 
Because  it  has  a  smaller  electrical  charge,  a  proton  suffers  less  nuclear  repulsion 
when  approaching  a  given  nucleus,  and  therefore  has  a  greater  probability  of 
penetrating  it.  Moreover,  because  the  proton  is  four  times  less  massive  than  an 
alpha  particle,  it  might  be  easier  to  accelerate  it  to  high  speed. 

Rutherford’s  student  John  Cockcroft  (1897-1967)  had  studied  electrical  engi¬ 
neering,  and  used  his  background  to  build  a  machine  that  would  accelerate  the 
hydrogen  nuclei,  the  protons,  in  very  intense  electrical  fields.  When  applying 
500,000  V,  Cockcroft  was  able  to  produce  a  parallel  beam  of  protons  traveling  at 

i—i  i 

the  speed  of  10  m  s  or  1  / 30th  the  speed  of  light. 

When  bombarding  lithium  with  high-energy  protons,  Cockcroft  and  his  col¬ 
league  Ernest  T.  S.  Walton  (1903-1995)  turned  the  lithium  nucleus  into  two  alpha 
particles  by  the  following  nuclear  reaction  (Cockcroft  and  Walton  1932a): 

*H  +  7  Li  — >  4He  +  4He,  (7.17) 

where  H  denotes  the  proton,  the  nucleus  of  a  hydrogen  atom,  Li  designates  the 
lithium  nucleus,  and  4He  denotes  an  alpha  particle,  the  nucleus  of  the  helium  atom. 

At  about  the  same  time,  Ernest  Lawrence  (1901-1958),  located  at  Berkeley 
University,  invented  the  cyclotron,  which  used  magnets  to  bend  the  path  of  a 
charged  particle  into  a  circular  orbit  that  passed  across  an  alternating  and  accel¬ 
erating  voltage.  As  the  radius  of  the  spiraling  orbit  increased,  so  did  the  particle’s 
speed;  therefore,  the  time  to  complete  each  orbit  remained  constant,  and  the 
particles  were  repeatedly  accelerated  in  equal  time  intervals  before  directing  them 
into  a  collision  with  something  else.  This  is  similar  to  pushing  a  child  in  a  swing  at 
the  same  part  of  its  swinging  motion,  pumping  it  up  to  greater  and  greater  speed. 

Lawrence  built  increasingly  larger  cyclotrons  at  Berkeley’s  Radiation  Labo¬ 
ratory,  whirling  the  protons  and  other  particles  to  faster  and  faster  speeds. 
A  cyclotron  of  just  0.69  m  in  diameter  was  able  to  accelerate  protons,  denoted  'H 
to  5  MeV  in  energy  (Lawrence  and  Livingston  1932,  1934;  Lawrence  et  al.  1932), 
and  a  1.5  m  cyclotron  was  used  to  accelerate  deuterons,  the  nuclei  of  heavy 
hydrogen  designated  "H,  to  an  energy  of  16  MeV  (Lawrence  and  Cooksey  1936). 
His  final  cyclotron,  with  a  diameter  of  nearly  5  m,  could  accelerate  the  deuterium 
nuclei  of  heavy  hydrogen  to  energies  of  195  MeV. 

Deuteron  bombardment  of  various  elements,  including  beryllium,  resulted  in 
the  creation  of  numerous  neutrons  and  artificial  radioactive  isotopes.  In  1939,  the 
Nobel  Prize  in  Physics  was  awarded  to  Lawrence  for  the  invention  and  develop¬ 
ment  of  the  cyclotron  and  for  results  obtained  with  it,  especially  with  regard  to 
artificial  radioactive  elements.  It  wasn’t  until  1951  that  John  Cockcroft  and  Ernest 
Walton  received  the  prize  for  their  pioneer  work  on  the  transmutation  of  atomic 
nuclei  by  artificially  accelerated  atomic  particles. 

These  early  accomplishments  stimulated  the  construction  of  increasingly 
powerful  particle  accelerators  that  accelerated  particles  to  more  and  more  energy. 
Eventually,  accelerators  were  built  that  reached  cosmic-ray  energies,  and  this 
resulted  in  the  discovery  of  new,  previously  unknown  particles,  such  as  the  tau 
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Fig.  7.6  Proton-proton  collision  Two  beams  of  protons  have  been  whirled  in  opposite 
directions  to  nearly  the  speed  of  light,  each  with  an  energy  of  7  trillion  electron  volts,  or  7 
TeV  —  1  x  1012eV  =  1.12  x  10-6J,  and  directed  into  collision  with  one  another  at  CERN’s 
Large  Hadron  Collider  (LHC).  This  image  displays  the  tracks  of  more  than  100  charged  particles 
as  they  fly  away  from  the  point  of  proton  collision.  Experiments  with  this  instrument  have 
provided  evidence  that  is  consistent  with  a  new,  previously  unseen  particle,  named  the  Higgs 
boson ,  in  the  mass-energy  range  of  1011  eV.  By  studying  the  collision  particle  debris,  including 
correlations  among  them,  scientists  hope  to  gain  an  improved  knowledge  of  how  subatomic 
particles  interact  at  extremely  high  energies,  including  the  hot,  dense  conditions  only  a  small 
fraction  of  a  second  after  the  “big  bang”.  At  the  point  of  proton  impact,  temperatures  of  more 
than  1  million,  or  10  ,  K  are  generated,  exceeding  100,000  times  the  temperature  at  the  center  of 
the  Sun.  In  particle  physics,  a  hadron  is  a  composite  particle  made  of  quarks  held  together  by  the 
strong  force;  the  best-known  hadrons  are  the  protons  and  neutrons,  which  are  components  of 
atomic  nuclei.  CERN  is  a  French  acronym  for  the  Conseil  Europeen  pour  la  Recherche  Nucleaire 
(the  European  Organization  for  Nuclear  Research).  The  Compact  Muon  Solenoid  (CMS)  particle 
detector  created  this  image.  (Courtesy  of  CERN.) 


lepton  found  as  the  result  of  particle  collisions  using  the  Stanford  Linear  Particle 
Accelerator  (Perl  et  al.  1975,  1976). 

Eventually,  the  2  km-diameter  Tevatron,  an  accelerator  at  the  Fermi  National 
Accelerator  Laboratory,  abbreviated  Fermi  lab,  used  thousands  of  electromagnets 
to  whirl  protons  up  to  1,000  GeV,  or  1  TeV,  of  energy.  By  2009  the  Large  Hadron 
Collider  at  the  European  Organization  for  Nuclear  Research  (known  as  CERN,  an 
acronym  for  Conseil  European  pour  la  Recherche  Nucleaire)  was  using  super¬ 
conducting  magnets  in  a  circular  tunnel  27  km  in  circumference  to  produce  two 
beams  of  protons  moving  in  opposite  directions,  and  eventually  directed  into 
collision  with  one  another  (Fig.  7.6).  It  has  provided  evidence  consistent  with 
another  previously  unknown  particle,  the  Higgs  boson. 

In  the  meantime,  on  the  eve  of  World  War  II  (1939-1945),  the  German 
radiochemist  Otto  Hahn  (1879-1968),  who  had  been  working  with  Fritz 
Strassmann  (1902-1980)  and  Lise  Meitner  (1878-1968),  showed  that  when  ura¬ 
nium  is  bombarded  with  neutrons  it  could  be  split  into  two  nearly  equal  fragments 
(Hahn  and  Strassmann  1939a,  b).  The  process  is  known  as  nuclear  fission,  anal¬ 
ogous  to  binary  fission  in  the  biological  sciences.  This  was  altogether  different 
from  the  proton  bombardment  of  much  lighter  nuclei,  for  the  heavy  uranium  was 
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broken  in  two  and  released  enormous  amounts  of  energy  in  the  process.  Moreover, 
it  also  freed  additional  neutrons,  which  in  turn  can  trigger  the  fission  of  neigh¬ 
boring  nuclei,  resulting  in  a  runaway  chain  reaction  if  not  properly  controlled. 
When  controlled,  a  nuclear  chain  reaction  can  be  used  to  generate  electricity  in 
nuclear  reactors;  when  uncontrolled,  it  has  applications  in  nuclear  weapons.  Hahn 
received  the  1944  Nobel  Prize  in  Chemistry  for  the  discovery  of  nuclear  fission, 
but  some  historians  believe  that  Meitner  should  have  shared  the  award. 

Meitner,  who  was  in  exile  in  Copenhagen,  confirmed  the  fission  of  uranium  by 
neutron  bombardment  (Meitner  and  Frish  1939),  and  the  Danish  physicist  Niels 
Bohr  described  it  to  Albert  Einstein  (1875-1955),  who  had  immigrated  to  the 
United  States.  By  the  time  Einstein  heard  about  uranium  fission,  World  War  II 
(1939-1945)  had  begun,  and  scientists  feared  that  Nazi  Germany  would  use  the 
discovery  to  build  an  atomic  bomb  to  conquer  the  world.  In  1939  and  1940 
Einstein  wrote  to  the  President  of  the  United  States,  Franklin  Roosevelt 
(1882-1945),  encouraging  a  program  that  would  achieve  a  nuclear  chain  reaction 
and  to  consider  the  development  of  “extremely  powerful  bombs”  that  the  Germans 
also  might  be  constructing.  The  concern  was  real,  for  it  was  later  discovered  that 
prominent  German  physicists,  including  Werner  Heisenberg  (1901-1976)  and  Carl 
von  Weizsacker  (1912-2007),  helped  the  Germans  investigate  the  feasibility  of 
constructing  nuclear  weapons  during  World  War  II. 

In  1942,  the  famed  American  physicist  J.  Robert  Oppenheimer  (1904-1967) 
invited  a  small  group  of  theoretical  physicists  to  the  University  of  California  at 
Berkeley  to  discuss  how  an  atomic  bomb  might  be  assembled.  Within  a  year,  they 
had  all  moved  to  Los  Alamos,  the  secret  laboratory  in  New  Mexico  where  thou¬ 
sands  of  scientists,  technicians,  and  military  personnel  worked  under  Oppenhei¬ 
mer’ s  enthusiastic  direction  to  create  the  first  atomic  bomb. 

Many  of  the  best  scientific  minds  in  the  country  were  involved,  including  Hans 
Bethe  (1906-2005),  head  of  the  Los  Alamos  theoretical  division;  Richard 
Feynman  (1918-1988),  who  worked  on  numerical  calculations  of  bomb  perfor¬ 
mance;  and  the  Italian  immigrant  Enrico  Fermi  (1901-1954),  who  helped  produce 
the  first  self-sustaining  nuclear  chain  reaction.  Another  famous  physicist,  Philip 
Morrison  (1915-2005),  accompanied  the  first  bombs  all  the  way  to  their  final 
flights,  caring  for  them  before  they  were  dropped  on  Japan  in  1945. 

Some  of  the  same  scientists  who  developed  the  atomic  bomb  also  showed  how 
nuclear  reactions  deep  inside  the  Sun  and  other  stars  makes  them  shine,  while  also 
producing  most  of  the  elements  heavier  than  helium  that  are  now  found  in  the 
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Chapter  8 

What  Makes  the  Sun  Shine? 


8.1  Can  Gravitational  Contraction  Supply  the  Sun’s 
Luminosity? 


When  we  measure  the  total  amount  of  sunlight  that  illuminates  and  warms  our 
globe,  and  extrapolate  back  to  the  Sun,  we  find  that  it  is  emitting  an  enormous 
power  of  385.4  million,  million,  million,  million,  or  3.828  x  1026,  watts,  where 
one  watt  =  1  J  s_1.  This  brilliance  is  far  too  great  to  be  perpetually  sustained,  and 
we  therefore  wonder  what  heats  the  Sun,  and  how  long  that  heat  will  last. 

In  the  mid-nineteenth  century,  the  German  physicist  Hermann  von  Helmholtz 
(1821-1894)  proposed  that  the  Sun’s  luminous  energy  is  due  to  its  gravitational 
contraction  (Helmholtz  1856,  1908).  If  the  Sun  were  gradually  shrinking,  the 
compressed  matter  would  become  hotter  and  the  solar  gases  would  be  heated  to 
incandescence;  in  more  scientific  terms,  the  Sun’s  gravitational  energy  would  be 
converted  slowly  into  the  kinetic  energy  of  motion  and  heat  up  the  Sun,  so  that  it 
would  continue  to  radiate.  This  follows  from  the  principle  of  conservation  of 
energy,  which  Helmholtz  was  one  of  the  first  to  propose  (Helmholtz  1847). 
It  states  that  energy  can  be  neither  created  nor  destroyed;  it  can  only  change  form. 

The  Irish  physicist  William  Thomson  (1824-1907),  later  Lord  Kelvin,  subse¬ 
quently  showed  that  the  Sun  could  have  illuminated  the  Earth  at  its  present  rate  for 
about  100  million  years  by  slowly  contracting  (Kelvin  1862,  1899;  Burchfield 
1975).  We  can  follow  his  reasoning  by  noting  that  the  gravitational  potential 
energy,  Q,  of  a  star  of  mass  Ms  and  radius  Rs  is  given  by  (Sect.  3.2): 


3  GMj 
5  Rs  ’ 


_  11  r\  _ ^ 

where  the  gravitational  constant  G  =  6.674  x  10  N  m^kg  .  The  change,  zJQ, 
in  gravitational  potential  energy  created  by  a  decrease  in  radius,  ARS,  is: 


AQ  = 
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where  the  radius  decrease  is  in  meters,  and  we  have  normalized  the  mass  and 

an 

radius  in  terms  of  the  Sun’s  mass  MQ  =  1.989  x  10  kg  and  the  Sun’s  radius 
rq  =  6.955  x  108  m. 

If  the  energy  change  provides  an  absolute  luminosity,  Ls ,  in  a  time  interval,  At, 
then  Ls  =  zlQ/ At,  and  the  rate  of  change  in  radius  is: 


A  Rs 
At 


5  LsR2s 
3GMj 


1.17  x  10“6 


where  the  Sun’s  absolute  luminosity  is  L0  =  3.828  x  10“  J  s  .  Since  one  year  is 
3.156  x  10  s,  this  shows  that  a  contraction  of  only  36.9  m  per  year  will  power 
the  Sun  at  the  present  rate.  That  is  a  very  small  change  considering  the  much  larger 
radius  of  the  Sun. 

The  problem  with  this  mechanism  is  the  long  duration  of  the  Sun  and  other 
stars.  If  the  source  of  a  star’s  present  luminosity  were  gravitational  potential 
energy,  then  the  current  radius  would  shrink  to  zero  in  the  Kelvin-Helmholtz  time, 
denoted  by  the  symbol  tk_h,  given  by  Kelvin  (1863): 


'Zk-h 
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=  5.95  x  1014 


Using  1  year  =  3.156  x  10  s,  the  Kelvin-Helmholtz  time  for  the  Sun  is  about 

,~J 

1.89  x  10  years.  A  contraction  of  36.9  m  per  year  will  power  the  Sun  at  its 
present  rate  by  converting  gravitational  potential  energy  into  heat.  However,  if  the 
Sun  continues  to  shine  this  way,  it  will  shrink  down  to  practically  nothing  and 
vanish  from  sight  in  18.9  million  years.  The  astonishing  thing,  which  was  not 
realized  at  the  time  Lord  Kelvin  wrote  his  articles,  was  the  Sun’s  durability.  It  has 
lasted  much  longer  than  he  envisioned. 

The  problem  is  much  worse  for  a  giant  star  that  has  both  a  larger  radius  and  a 
greater  luminosity.  As  shown  by  the  British  astronomer  Arthur  Stanley  Eddington 
(1882-1944),  gravitational  contraction  can  only  keep  a  giant  star  shining  for  no 
more  than  100,000  years,  and  he  therefore  proposed  that  energy  that  is  locked  up 
inside  the  atom  was  a  likely  alternative  candidate  for  making  the  stars  shine 
(Eddington  1920). 

Kelvin  had  assumed  the  Sun  would  be  about  as  old  as  the  Earth,  and  calculated 
the  age  of  the  Earth  under  the  assumption  that  it  began  in  an  initially  molten  state 
and  cooled  from  the  outside  in.  Using  the  equation  of  heat  conduction  with  the 
known  conductivity  of  rock,  he  calculated  that  it  would  take  about  100  million 
years  to  reach  the  then  observed  temperature  gradient  between  the  hot  lower  levels 
of  mines  and  the  cold  surface  of  the  Earth  (Kelvin  1863,  1899). 

The  discovery  of  radioactivity  provided  an  entirely  new  perspective  on  our 
planet’s  internal  heat  and  age.  Radioactive  elements  could  heat  the  planet  from 
inside,  emitting  energetic  particles  that  produced  a  rise  in  internal  temperature 
(Rutherford  1905),  so  the  Earth’s  hot  interior  is  not  a  result  of  cooling  from  an 
earlier,  hotter  state.  Radioactivity  also  clocked  the  Earth’s  age,  by  establishing  the 
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relative  amounts  of  radioactive  parent  elements  and  stable,  non-radioactive 
daughters.  When  this  ratio  is  combined  with  the  known  rates  of  radioactive  decay, 
they  indicated  that  the  Earth  is  at  least  2-3  billion  years  old  (Boltwood  1907; 
Rutherford  1929).  Modem  measurements  using  this  method  establish  an  age  of 
about  4.6  billion  years  for  the  Earth,  and  presumably  for  the  Sun. 

Moreover,  in  looking  back  at  the  Earth’s  history,  we  find  that  the  Sun  has  been 
shining  steadily  and  relentlessly  for  eons,  with  a  brilliance  that  could  not  be 
substantially  less  than  it  is  now.  The  radioactive  clocks  in  rock  fossils  indicate,  for 
example,  that  the  Sun  was  hot  enough  to  sustain  primitive  creatures  on  the  Earth 
3.4  billion  years  ago  (Tice  and  Lowe  2004). 

Even  in  the  early  20th  century,  no  one  had  any  clue  as  to  why  the  Sun,  or  any 
other  star,  could  shine  so  brightly  for  billions  of  years.  That  understanding  had  to 
await  the  discovery  of  subatomic  particles,  and  the  realization  that  the  Sun  is 
composed  mainly  of  hydrogen.  Of  equal  importance  was  the  fact  that  the  center  of 
the  Sun  is  much  hotter  than  an  ordinary  fire,  enabling  it  to  consume  atomic  nuclei. 


8.2  How  Hot  is  the  Center  of  the  Sun? 


The  most  abundant  atom  in  the  Sun  is  hydrogen,  with  a  single  proton  at  its  nuclear 
center  and  one  electron  outside  of  the  nucleus.  It  is  so  hot  within  most  of  the  Sun, 
except  its  cool  outer  atmosphere,  that  all  of  the  protons  and  electrons  have  been 
liberated  from  their  atomic  bonds  and  move  about  unattached  to  one  another. 

Protons  are  1,836  times  more  massive  than  electrons;  therefore  they  dominate 
the  gravitational  effects  inside  the  star.  The  temperature,  Tco,  at  the  center  of  the 
Sun  can  be  estimated  by  assuming  that  each  proton  down  there  is  hot  enough  and 
moving  fast  enough  to  counteract  the  gravitational  compression  it  experiences 
from  the  rest  of  the  star.  That  is  we  can  equate  the  thermal  energy  of  the  proton  to 
the  gravitational  energy,  expressed  by  the  relation: 
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where  the  Boltzmann  constant  k  =  1.38065  x  10  ^  J  K  ,  the  Newtonian 
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gravitational  constant  G  =  6.614  x  10  N  m^  kg  ,  the  mass  of  the  proton  is 
mP  =  1.6726  x  10-27  kg,  the  mass  of  the  Sun  M0  =  1.989  x  1030  kg  and  the 

o 

radius  of  the  Sun  RQ  =  6.955  x  10°  m. 

Solving  for  the  central  temperature  of  the  Sun  we  obtain: 


2  GttlpMq 

_ _ 

3  kRQ 


1.5  x  107K. 


So  deep  down  inside  the  Sun,  within  its  dense  central  core,  the  protons  have  a 
temperature  of  15  million  K.  This  and  other  physical  parameters  of  the  Sun  are 
given  in  Table  8.1. 
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Example:  Gas  pressure  and  mass  density  at  the  center  of  the  Sun 

The  gas  pressure  Pc®  at  the  center  of  the  Sun,  will  be  the  force  per  unit  area 
due  to  the  gravitational  force  per  unit  area  of  the  material  above  it.  To  a 
rough  approximation: 


Mq  CM  .  ,  gm 


© 


4 


4 


r ^ 


-0 


4 


r ^ 


1015  pascal, 


_  11  o  _ o 

where  the  gravitational  constant  G  =  6.61  A  x  10  N  m  kg  ,  the  Sun’s 
mass  Mq  =  1.989  x  1030  kg,  and  the  Sun’s  radius  R&  =  6.955  x  108  m. 
A  more  exact  calculation  gives  a  value  about  ten  times  as  large  or 
PCq  «  10 16  pascal. 

From  the  ideal  gas  law  (Sect.  5.4): 


Pce  =  NkTce  =  (8.8) 

mp 

where  N  is  the  number  density  of  protons,  the  Boltzmann  constant 

OQ  1 

k  =  1.38065  x  10  ^  J  K  ,  the  pc®  is  the  central  mass  density  of  the  Sun, 

_ 0^7 

and  the  mass  of  the  proton  mP  =  1.6726  x  10  kg.  Solving  for  the  central 
mass  density  we  obtain: 


Pc © 


PcQtnp 

kTC® 


105  kg  m  3, 


n 

using  a  central  temperature  of  TCQ  «  1.5  x  10  K. 


Such  extreme  central  conditions  were  recognized  more  than  a  century  ago, 
when  Jonathan  Homer  Lane  (1819-1880),  an  American  astronomer  and  inventor 
working  at  the  U.S.  Patent  Office,  assumed  that  gas  pressure  supports  the  weight  of 
the  Sun  (Lane  1870;  Ritter  1898).  Although  no  one  knew  about  nuclear  protons  at 
the  time,  Lane’s  basic  reasoning  still  applies.  The  hot  protons  move  about  with 
high  speeds,  frequently  colliding  with  one  another  and  creating  the  gas  pressure 
that  holds  up  the  Sun.  For  the  Sun,  a  central  temperature  of  15  million  degrees  K 
establishes  equilibrium  between  the  outward  pressure  of  the  moving  protons  and 
the  inward  gravitational  pull  at  the  Sun’s  center. 

In  any  layer  within  the  Sun,  the  weight  of  the  overlying  gas  must  be  equal  to  the 
outward-pushing  pressure;  otherwise  the  Sun  would  expand  or  contract,  which  is 
not  observed.  At  greater  distances  from  the  center,  there  is  less  overlying  material 
to  support  and  the  compression,  pressure,  and  temperatures  are  less,  so  the  solar 
material  becomes  progressively  thinner  and  cooler. 

As  one  might  suspect,  a  more  massive  star  produces  greater  compression  at  its 
center,  and  a  higher  central  temperature  is  required  to  hold  it  up.  The  central 
temperature,  Tcs ,  of  a  star  of  mass,  Ms.  and  radius,  Rs ,  is  given  by: 


8.2  How  Hot  is  the  Center  of  the  Sun? 
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Table  8.1  Physical  properties  of  the  Sun 

Mq  =  mass  of  Sun  =  1.989  x  1030  kg 
Rq  =  radius  of  Sun  =  6.955  x  108  m 

pQ  =  mean  mass  density  of  Sun  =  3Mq/(4uRq )  «  1,409  kg  m_ 

_ Q 

pcQ  =  central  mass  density  of  Sun  =  151,300  kg  m 

Tcq  =  central  temperature  of  Sun  =  2Gm7>M0/(3/:7?0)  «  1.5  x  107  K 

VesCQ  =  escape  velocity  from  photosphere  of  Sun  =  (2GM .  / R  :  )'  2  6.177  x  105  m  s 

D&  =  1  AU  =  mean  Earth-Sun  distance  =  1.4959787  x  10u  m  «  1.496  x  1011  m 

0Q  =  Rq/D&  =  angular  radius  of  Sun  =  959.63"  where  1"  =  1  s  of  arc 

(At  the  Sun  1  s  of  arc  =  1"  =  7.253  x  105  m) 

PrQ  =  sidereal  rotation  period  of  visible  solar  disk  at  the  equator  =  25.67  days 
VQ  =  rotation  velocity  of  visible  solar  disk  at  the  equator  =  1,971  m  s_1 

_ i  _ 9 

/©  =  solar  constant  =  1,361  J  s  m 

L0  =  absolute  luminosity  of  Sun  =  47c/0Z)0  %  3.828  x  1026  J  s-1 

T ©  =  effective  temperature  of  visible  solar  disk  =  [Lq/^tlgR^)]1^  ^ j  5,780  K 

mv0  =  apparent  visual  magnitude  of  the  Sun  =  —26.74  mag 

mboi>(D  =  apparent  bolometric  magnitude  of  the  Sun  =  —26.83  mag 

Mv0  =  absolute  visual  magnitude  of  the  Sun  =  +4.83  mag 

Mboio  =  absolute  bolometric  magnitude  of  Sun  =  +4.74  mag 

Bq  =  magnetic  held  strength  at  visible  solar  disk  =  100-1,000  G  =  0.01-0.1  T 

X  =  mass  fraction  of  hydrogen  =  0.7154 

Y  =  mass  fraction  of  helium  =  0.2703 

Z  =  mass  fraction  of  all  other  atoms  =  0.0142 

Age  =  4.6  x  109  year 


Tcs  —  Tcq 


(MsRq\ 

\MqRs) 


(8.10) 


It  is  the  moving  particles  inside  any  star,  including  the  Sun,  which  holds  up  the 
star.  This  motion,  pushing,  and  pressure  of  the  particles  prevent  a  star  from  col¬ 
lapsing  under  its  enormous  weight.  What  keeps  the  particles  down  there  hot,  to 
sustain  their  rapid  motion?  It  is  nuclear- fusion  reactions  in  the  compact,  dense  core 
of  a  star  that  energizes  the  particles  there,  sustaining  their  heat  and  making  them 
move  rapidly.  Once  the  nuclear  reactions  begin,  the  subatomic  energy  that  is 
liberated  keeps  the  nuclei  sufficiently  hot  to  ensure  continuation  of  the  reactions. 


8.3  Nuclear  Fusion  Reactions  in  the  Sun’s  Core 
8.3.1  Mass  Lost  is  Energy  Gained 

The  only  known  method  for  keeping  the  Sun  shining  with  its  present  luminosity 
for  billions  of  years  involves  nuclear  fusion  reactions  under  the  intense  pressures 
and  exceptionally  high  temperatures  at  great  depths  within  the  Sun.  They  are 
termed  “nuclear”  because  it  is  the  interaction  of  atomic  nuclei  that  powers  the 
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Sun.  In  nuclear-fusion  reactions,  two  or  more  atomic  nuclei  fuse  together  to 
produce  a  heavier  nucleus,  releasing  energy,  subatomic  particles,  and  radiation. 
For  the  Sun,  it  is  protons,  the  nuclei  of  abundant  hydrogen  atoms  that  fuse  together 
to  make  the  nuclei  of  helium  atoms,  the  next  most  abundant  element  in  the  Sun. 

Energy  can  be  derived  only  from  energy,  and  the  source  of  energy  in  nuclear 
fusion  is  mass  loss.  The  basic  idea  was  provided  by  Albert  Einstein  (1879-1955)  in 
his  Special  Theory  of  Relativity ,  which  included  the  famous  formula  E  =  me  for 
the  equivalence  of  mass,  m,  and  energy,  E  (Einstein  1905a,  b,  1906,  1907). 

o  _ 1 

Because  the  speed  of  light  c  =  2.9979  x  10  m  s  ,  or  about  300  million  meters 
per  second,  is  a  very  large  number,  only  a  small  amount  of  mass  is  needed  to 
produce  a  huge  amount  of  energy. 

Important  evidence  for  the  then  unknown  source  of  the  Sun’s  energy  was  being 
obtained  at  the  Cavendish  Laboratory  at  Cambridge  University  in  England  about  a 
decade  after  Einstein’s  seminal  work.  Here  Ernest  Rutherford  (1871-1937) 
showed  that  the  massive  nuclei  of  all  atoms  are  composed  of  hydrogen  nuclei, 
which  he  named  protons.  At  about  the  same  time,  the  chemist  Francis  Aston 
(1877-1945),  also  working  at  the  Cavendish,  invented  the  mass  spectrograph  and 
used  it  to  show  that  the  mass  of  the  helium  nucleus  is  slightly  less  massive,  by  a 
mere  0.7  %,  than  the  sum  of  the  masses  of  the  four  hydrogen  nuclei,  or  protons, 
that  enter  into  it  (Aston  1919,  1920). 

While  Rutherford  and  Aston  were  discovering  the  inner  secrets  of  the  atoms, 
the  astronomer  Arthur  Stanley  Eddington  (1882-1944),  director  of  the  nearby 
Cambridge  Observatory,  was  examining  the  internal  workings  of  the  Sun  and  other 
stars,  and  reasoned  that  the  stars  are  the  crucibles  in  which  the  heavier  elements 
are  made  from  lighter  ones. 

Realizing  that  such  stellar  alchemy  would  release  energy,  Eddington  (1920) 
proposed  that  hydrogen  is  transformed  into  helium  inside  stars,  with  the  resultant 
mass  difference  released  as  energy  to  power  the  Sun.  The  mass  that  is  lost  goes 
into  energizing  the  Sun  and  other  stars.  The  mass  difference,  Am ,  is  converted  into 
energy,  AE,  to  power  the  Sun,  all  in  accordance  with  Einstein’s  equation 
AE  =  Amc  .  Eddington  rightly  concluded  that  this  could  supply  the  Sun’s  current 
luminous  output  for  an  estimated  15  billion  years. 

During  the  ensuing  decade  it  was  realized  that  the  lightest  known  element, 
hydrogen,  is  the  most  abundant  element  in  the  Sun,  so  hydrogen  nuclei,  or  protons, 
must  play  the  dominant  role  in  nuclear  reactions  within  our  daytime  star.  Physi¬ 
cists  were  nevertheless  convinced  that  protons  could  not  react  with  each  other 
inside  the  Sun.  Even  in  a  high-speed,  head-on  collision  at  the  enormous  central 
temperature  of  the  Sun,  two  protons  did  not  have  sufficient  energy  to  overcome 
their  mutual  electrical  repulsion  and  merge  together.  The  thermal  velocity  of  the 
average  proton  at  a  temperature  of  15  million  K  was  far  too  slow,  and  the  central 
temperature  had  to  be  at  least  a  thousand  times  hotter  to  raise  the  average  kinetic 
energy  of  the  protons  above  the  electrical  barrier  (Focus  8.1). 
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Focus  8.1  The  temperatures  necessary  for  thermonuclear  reactions 

Protons  are  positively  charged,  and  like  charges  repel  one  another  with  an 
electrostatic  force  given  by  Coulomb’s  law  (Coulomb  1785),  discovered  by 
the  French  physicist  Charles  Augustin  de  Coulomb  (1736-1806).  This  means 
that  there  is  an  electrified  barrier  that  prevents  protons  from  becoming  too 
close  to  each  other. 

The  electrical  force,  F,  on  a  charge  qi  due  to  the  presence  of  another 
charge  q2  separated  from  it  by  a  distance,  D,  is  given  by  Coulomb’s  law: 

F  =  1  qiqi  »  8.9875  x  109^r,  (8.11) 

4m0  D2  D2  y  ' 

_  1  O  _  1 

where  n  «  3.14159  and  the  electric  constant  s0  =  8.8542  x  10  F  m  . 
A  positive  force  implies  an  electrical  repulsion  and  a  negative  force  implies 
an  electrical  attraction. 

The  repelling  force  is  proportional  to  the  square  of  the  electrical  charge  of 
the  protons,  and  it  is  inversely  proportional  to  their  separation.  Therefore,  the 
force  of  repulsion  between  two  protons  becomes  increasingly  larger  as  they 
are  brought  closer  together.  Stated  another  way,  the  protons  do  not  move  fast 
enough,  with  enough  kinetic  energy,  to  overcome  the  electrical  barrier  and 
merge  together. 

The  potential  energy  U2\  at  charge  1  by  charge  2  is  given  by: 


1  qiq2 

4nso  D 


(8.12) 


where  for  two  protons  q\  =  q2  =  e,  the  elementary  charge.  To  determine  the 
velocity  that  is  just  fast  enough  for  one  proton  to  move  into  another,  we 
equate  the  kinetic  energy  of  motion  to  the  electrical  potential  energy,  or: 


1 

-mpV 


4t18qD  ’ 


and  solve  for  the  velocity,  V, 


(8.13) 


r  2e2  1 1/2 

-  =  1.66  x  107  m  s  1 , 

47T£o  mpD 


(8.14) 


where  the  charge  of  a  proton  is  e  =  1.6022  x  10“ 19  C,  the  mass  of  the 

_ 0^7 

proton  is  mP  =  1.6726  x  10  “  kg,  and  we  have  assumed  that  the  separation 
of  two  protons  is  comparable  to  the  size  of  an  atomic  nucleus,  or 
D  =  10“ 15  m,  when  they  touch  and  merge  together. 

The  mean  speed  of  a  proton  inside  the  Sun  is  determined  by  equating  the 
kinetic  energy  of  the  moving  proton  to  the  thermal  energy  that  keeps  it  hot 
(Sect.  5.2),  or 
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\^pV]h=\kT,  (8.15) 

where  T  is  the  temperature  and  the  Boltzmann  constant 
k  =  1.38066  x  10-23  J  K_1.  Solving  for  the  thermal  velocity,  Vth\ 

1/2 

«  157  Tl/2  m  s-1. 

For  the  center  of  the  Sun,  where  the  temperature  T  =  15  million  K,  the 
speed  is  only  Vth  «  6.1  x  105  m  s”1,  more  than  twenty  times  lower  than 
that  required  to  overcome  the  electrical  repulsion  between  two  protons. 
A  proton  would  have  to  be  at  a  temperature  of  T  «  1010,  or  10  billion,  K  for 

'-j  _ i 

the  nuclear  fusion  to  occur  at  the  speed  of  1.6  x  10'  m  s  . 

Even  at  the  enormous  central  temperature  of  the  Sun,  two  protons  do  not 
seem  to  have  enough  energy  to  overcome  their  electrical  repulsion  and  move 
into  each  other.  As  subsequently  described  in  the  text,  when  the  quantum- 
mechanical  penetration  probability  of  a  nucleus  is  combined  with  the 
Maxwellian  distribution  of  particle  speeds,  a  few  protons  at  the  higher  speeds 
can  fuse  together  in  the  center  of  the  Sun. 


(8.16) 


As  it  turned  out  Eddington  was  correct,  for  the  Russian  physicist  George 
Gamow  (1904-1968)  already  had  provided  an  explanation  for  this  paradox.  While 
at  the  University  of  Gottingen,  in  what  is  now  Western  Germany,  Gamow  showed 
how  a  subatomic  particle  could  escape  from  the  nucleus  of  a  radioactive  atom.  He 
used  the  quantum  theory  of  the  very  small,  in  which  a  subatomic  particle  can  act 
like  a  spread-out  wave  with  no  precisely  defined  position,  to  determine  the 
penetrability  of  the  barrier  surrounding  a  nucleus  during  radioactive  decay 
(Gamow  1928;  Gurney  and  Condon  1928).  This  tunnel  effect  works  in  the  opposite 
way  when  nuclear  particles  merge  rather  than  separate,  and  it  explains  why  nuclei 
can  fuse  together  inside  a  star. 

With  Gamow’ s  encouragement,  two  young  students,  the  English  astronomer 
Robert  d’Escourt  Atkinson  (1898-1982)  and  the  Austrian  physicist  Fritz  Houter- 
mans  (1903-1966),  applied  and  extended  his  quantum-tunneling  theory  to  the 
process  of  nuclear  fusion  in  stars  (Atkinson  and  Houtermans  1929).  Using 
Gamow ’s  penetration  probability  with  the  Maxwellian  distribution  of  particle 
speeds,  they  showed  that  the  fusion  of  light  nuclei  could  create  stellar  energy  in 
accordance  with  Einstein’s  formula  connecting  mass  loss  to  energy  gained. 

It  was  immediately  clear  that  the  most  effective  nuclear  interactions  were  those 
involving  light  nuclei  with  low  charge  and  that  only  a  few  particles  in  the  high¬ 
speed  tail  of  the  Maxwellian  speed  distribution  would  be  able  to  penetrate  nuclei. 
For  this  reason,  nuclear  reactions  proceed  slowly  in  the  Sun  and  other  stars. 
Atkinson  and  Houtermans  also  demonstrated  that  the  rate  of  nuclear  reactions 
substantially  increases  with  the  increasing  central  temperature  of  stars. 


8.3  Nuclear  Fusion  Reactions 
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Atkinson  then  addressed  this  problem  in  far  greater  detail,  arguing  that  the 
observed  relative  abundances  of  the  elements  might  be  explained  by  the  synthesis 
of  heavy  nuclei  from  lighter  ones  within  stars  (Atkinson  1931).  By  this  time  the 
great  stellar  abundance  of  hydrogen  had  been  established,  and  Atkinson  subse¬ 
quently  demonstrated  that  the  most  likely  nuclear  reaction  within  stars  is  the 
collision  of  two  protons  to  form  a  deuteron  and  a  positron  (Atkinson  1936). 

The  probability  of  penetration  depends  on  the  kinetic  energy,  or  speed,  and  the 
electrical  charges  of  the  colliding  particles.  They  have  a  greater  impact  when 
moving  at  faster  speeds,  but  the  electrical  repulsion  increases  with  the  charge.  For 
this  reason,  the  lightest  nuclei,  the  protons,  are  more  likely  to  fuse  together  than 
the  heavier  ones,  which  have  greater  nuclear  charge  and  mass.  That  is,  the  lightest 
elements  carry  the  smallest  charge,  with  less  electrical  repulsion  between  them, 
and  they  also  move  faster  than  heavier  nuclei  at  a  given  temperature. 

Even  with  this  enhanced  penetration  probability,  the  average  kinetic  energy  of 
two  colliding  protons  is  not  enough  for  fusion  to  occur  at  the  center  of  the  Sun. 
However,  the  particles  in  a  hot  gas  do  not  all  move  at  the  same  average  velocity. 
There  is  a  relatively  small  number,  in  the  high-speed  tail  of  the  Maxwellian  speed 
distribution,  which  moves  at  much  faster  speeds,  permitting  fusion  once  the 
penetration  probability  also  is  considered. 

The  number  of  high-speed  protons  decreases  exponentially  with  increasing 
speed  and  kinetic  energy,  whereas  the  tunneling  probability  increases  exponen¬ 
tially  with  the  energy  (Fig.  8.1).  In  the  overlap  region,  where  the  exponential 
decline  meets  the  exponential  rise,  there  are  protons  that  can  participate  in  the 
nuclear  fusion  reactions  that  make  the  Sun  shine.  Thus,  protons  do  sometimes  get 
close  enough  to  move  into  each  other  and  fuse  together,  even  though  their  average 
energy  is  well  below  that  required  to  overcome  their  electrical  repulsion. 

If  two  nuclei,  designated  by  1  and  2,  undergo  fusion,  the  nuclear  reaction  is 
written 


1+2-^3+4  +  e,  (8.17) 

where  Q  is  the  amount  of  energy  released  during  this  reaction,  often  given  in  units 
of  MeV  with  1  MeV  =  1.602  x  10“ 13  J.  The  reaction  rate,  r12,  is  given  by 

rn  =  N1N2<(tV>  mV1,  (8.18) 

where  N]  and  N2  are  the  number  densities  of  the  two  nuclei  and  <aV>  denotes  an 
averaged  product  of  the  interaction  cross-section  o  and  the  relative  velocity  V  of 
the  two  nuclei.  It  takes  into  account  both  the  tunneling  cross  section  and  the 
Maxwellian  speed  distribution,  which  depends  on  the  temperature. 

As  an  example,  the  fusion  of  two  protons  at  the  center  of  the  Sun,  where  the 
temperature  is  15.6  million  K,  has  <cfV>  =  1.19  x  10-49  m3  s-1,  which  is 
determined  from  both  the  theory  of  thermonuclear  reactions  and  laboratory  mea¬ 
surements  of  the  proton-proton  reaction. 
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Fig.  8.1  Nuclear  tunneling  in  the  Sun’s  core  The  high-speed  tail  (left)  of  the  Maxwellian 
distribution  of  nuclear  particle  speeds  plotted  as  a  function  of  kinetic  energy,  E,  for  protons  near 
the  center  of  the  Sun.  The  P(E)  function  (right)  describes  the  quantum  mechanical  probability  of 
two  protons  overcoming  the  electrical  repulsion  between  them;  it  depends  on  the  Gamow  energy 
Eg  and  the  relative  energy  E  of  the  two  colliding  protons.  The  shaded  area  (center)  illustrates  the 
function  f(E),  which  is  the  product  of  the  speed  and  penetration  function;  it  determines  the 
nuclear  reaction  rate  in  the  core  of  the  Sun 


In  stellar  model  calculations  one  uses  the  mass  density,  p,  and  the  mass  fraction 
Xt  of  the  nuclei  i,  which  are  related  to  the  number  density  Nt  by: 

Ni  =  pNA J  (8.19) 

A 

OO  _  1 

where  the  Avogadro  constant  NA  =  6.022141  x  10~  mol  ,  and  At  is  the  atomic 
mass  of  nuclear  species  i  in  atomic  mass  units. 

The  mean  lifetime  of  nucleus  1  until  destruction  by  fusion  with  nucleus  2  is: 


N2<gV>  ' 

where  N2  is  the  number  density  of  nucleus  2  and  <aV>  is  the  average  product  of 
the  interaction  cross  section  a  and  the  relative  velocity  V  of  the  two  nuclei. 

c  _ Q 

At  the  center  of  the  Sun  the  mass  density  is  p  =  1.513  x  10  kgm  ,  and  since 
the  mass  is  established  by  the  protons  with  a  mass  mP  =  1.6726  x  10  ~  kg,  we 
have  Ni  =  N2  =  p/mP  «  10  m_  .  The  mean  lifetime  for  destruction  of  a  proton 
by  fusion  with  another  proton  at  the  center  of  the  Sun  is  therefore  t  ~  10  s  « 
3  billion  years,  and  since  the  density  decreases  with  distance  from  the  very  center  of 
the  Sun  there  is  plenty  of  time  to  keep  the  Sun  shining  by  this  reaction. 

The  power  generated  per  unit  mass,  s12,  by  this  reaction  is  given  by: 
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S12  =  ^  J  S_1  kg-1.  (8.21) 

The  total  energy  released  by  the  proton-proton  chain  of  reactions  that  make  the 
Sun  shine  is  Q  «  26  MeV  %  4  x  10-12  J,  and  a12  ~  10-3  J  s-1  kg-1.  If  the 
core  of  the  Sun  has  a  mass  of  0.2  M©  «  0.4  x  10  kg,  the  product  of 

a/:  _ i 

Ei 2  x  0.2  M0  ^  4  x  10"  J  s  ,  which  is  the  present  luminosity  of  the  Sun,  L©. 


Example:  How  frequent  are  proton  collisions  at  the  center  of  the  Sun? 

The  number  density,  NP,  of  protons  at  the  center  of  the  Sun  is  NP  = 
Pco/mp  «  10  m_  ,  where  the  central  density  pco  =  1.5  x  10  kg  m_ 

_ 0^7 

and  the  proton  mass  mP  =  1.6726  x  10  "  kg.  That  is  100  million  trillion 
trillion  protons  per  cubic  meter  at  the  center  of  the  Sun.  The  mean  free  path  / 
between  collisions  is  /  =  1/ (NPnR  )  (Sect.  5.2),  the  reciprocal  of  the  product 
of  the  proton’s  area,  nR  and  the  number  density  of  protons.  With  a  proton 

Q  O  _ Q  _  1C 

number  density  of  10  “  m  a  proton  with  a  radius  of  10  m  will  move 
about  0.003  m  before  striking  another  proton.  The  thermal  velocity, 
Vth  =  (3kT/mP)1^2  ^  6.1  x  105  m  s-1  for  a  proton  at  a  temperature  of 
Tcq  =15  million  degrees,  where  the  Boltzmann  constant 

OQ  1 

k  =  1.38065  x  10  J  K  ,  so  the  time  between  collisions  is 
l /Vth  ~  5  x  10-9  s  and  there  are  about  200  million  collisions  occur  every 
second. 


Despite  the  exceptionally  large  number  of  collisions,  a  fusion  reaction  between 
two  colliding  protons  does  not  happen  very  often.  The  protons  nearly  always 
bounce  off  one  another  without  triggering  a  nuclear  reaction  during  a  collision. 
Even  with  the  help  of  tunneling,  the  average  proton  must  make  about  10  trillion 
trillion,  or  10  ,  collisions  before  nuclear  fusion  can  happen.  It  only  occurs  when 
the  collision  is  almost  exactly  head  on,  and  between  exceptionally  fast  protons. 
This  explains  why  the  Sun  does  not  expend  all  of  its  nuclear  energy  at  once,  like  an 
immense  hydrogen  bomb. 


8.3.2  Understanding  Thermonuclear  Reactions 

What  is  of  primary  interest  in  fueling  stars  is  the  rate  at  which  the  nuclear  reactions 
occur  and  the  power  they  generate.  But  before  considering  these  details,  it  is  useful 
to  know  the  units  that  nuclear  astrophysicists  commonly  use  when  considering 
thermonuclear  reactions.  The  include: 
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Size  =  1  Fermi  =  1  fm  =  10  15  m 
Cross  section  =  1  barn  =  10  28  nr  =  100  fm2 

Energy  =  1  MeV  =  1.60217646x  10“13  J  =  1000  keV  (8.22) 

Mass  =  1  atomic  mass  unit  =  u  =  1.66053886  x  10  27  kg 
Rest  mass  energy  =  Mass  x  c2 


Proton 
Neutron 
Helium  nucleus 
Electron 
Atomic  mass  unit 


MP  =  938.2720  MeV 
mn  =  939.5654  MeV 
3727.379  MeV  =  Alpha  particle 
me  =  0.511  MeV 
u  =  93 1 .494  MeV 


Boltzmann  constant  =  k  =  1.38065  x  10  23  J  K  1 

=  8.61733  x  10-11  MeV  KT1 


(8.23) 


The  mass  of  an  atom’s  nucleus  is,  for  example,  always  less  than  the  sum  of  the 
individual  masses  of  its  constituent  protons  and  neutron,  or  nucleons.  Energy  is 
removed  to  bind  the  subatomic  nucleons  together  and  form  a  nucleus,  and  this 
energy  has  mass.  This  mass  is  removed  from  the  total  mass  of  the  original  par¬ 
ticles,  and  it  is  missing  in  the  resulting  nucleus.  The  missing  mass  is  known  as  the 
nuclear  mass  defect ,  and  represents  the  energy  released  when  the  nucleus  formed. 
The  mass  defect,  AM ,  for  a  nucleus  containing  A  nucleons,  Z  protons,  and  A- 
Z  neutrons  is: 


AM  \Zi7ip  -\~  {A  Z)  mn  winuc.  i 


(8.24) 


where  A  is  the  mass  number  of  the  nucleus,  Z  is  the  atomic  number,  mP  is  the  mass 
of  the  proton,  mn  is  the  mass  of  the  neutron,  and  mnuc  is  the  mass  of  the  nucleus. 

The  binding  energy ,  EB ,  used  to  assemble  the  nucleus  from  its  constituent 
nucleons  is 


Eb  =  AM  c2.  (8.25) 

The  binding  energy  measures  how  tightly  bound  a  nucleus  is.  The  binding 
energy  is  the  energy  required  to  separate  the  nucleus  into  its  constituent  nucleons, 
and  it  also  is  a  measure  of  the  energy  released  during  nuclear  fusion  of  light  nuclei 
into  heavier  ones.  The  binding  energy  released  during  nuclear  fusion  is  responsible 
for  energy  production  in  the  interior  of  stars. 

The  binding  energy  per  nucleon ,  f  is  given  by  /=  EB/A  and  illustrated  in 
Fig.  8.2.  It  shows  that  nuclei  near  iron  have  the  largest  nuclear  binding,  while 
lighter  and  heavier  nuclei  are  less  tightly  bound.  The  graph  indicates  that  energy 
can  be  released  by  combining  lighter  nuclei  into  heavier  ones,  known  as  nuclear 
fusion ,  provided  they  are  less  massive  than  iron.  For  example,  it  shows  that  about 
7  MeV  x  4  =  28  MeV  in  energy  would  be  released  during  the  fusion  of  four 
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Fig.  8.2  Binding  energy  The  binding  energy  per  nucleon,  denoted  /,  is  shown  as  a  function  of 
atomic  mass  number,  A.  The  highest  nuclear  binding  energy  is  found  near  iron,  denoted  56Fe. 
Energy  can  be  released  by  the  nuclear  fusion  of  lighter  nuclei  into  heavier  ones,  provided  they  are 
less  massive  than  iron.  For  example,  the  Sun  shines  by  fusing  four  protons,  denoted  lH,  into  one 
helium  nucleus,  designated  4He.  About  7  MeV  is  released  for  each  nucleon,  and  since  four 
nucleons  are  involved  the  total  energy  release  in  the  synthesis  of  one  helium  nucleus  is  about 
28  MeV.  Further  energy  is  released  in  giant  stars  by  the  nuclear  fusion  of  helium  into  carbon, 
denoted  C,  but  note  that  energy  would  not  be  released  by  fusing  helium  into  lithium,  denoted 

/T  f-j 

°Li  or  Li,  due  to  its  lower  binding  energy  per  nucleon 


protons,  denoted  1H,  into  one  helium  nucleus,  designated  4He;  the  exact  value  of 
the  energy  release  is  Q  =  26.73  MeV. 

The  amount  of  energy  released  during  nuclear  reactions,  or  the  Q  value,  results 
from  the  mass  difference  between  the  initial  and  final  nuclei.  The  assumption  that 
nuclear  mass  values  are  equal  to  the  measured  mass  values  usually  makes  little 
difference,  for  the  binding  energy  and  mass  defect  contribution  of  electrons  are 
negligible  compared  to  those  of  the  nucleons.  (As  discussed  subsequently,  there  is 
a  small  difference  owing  to  the  production  of  positrons.) 

The  helium  is  synthesized  by  fusing  four  protons  together,  so  the  fraction  of 
mass  converted  during  each  one  of  these  transmutations  is  Q/(4mPc  )  =  0.007, 
since  Q  =  26.73  MeV  and  the  energy  equivalent  of  the  mass  of  the  proton  is 
mPc  =  938  MeV.  Under  the  assumption  that  the  Sun  continues  to  emit  its  current 
luminosity  L0  by  this  process,  it  could  last  a  time  of  0.007  MQc  / 

i  o  11 

L0  =  3. 2x10  s  ^  10  years,  but  since  the  nuclear  reactions  are  limited  to  the 
core  of  the  Sun,  the  nuclear  lifetime  is  closer  to  1010  years. 
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Example:  Binding  energy  of  a  deuteron 

A  deuteron,  denoted  H  or  ~D,  is  the  nucleus  of  the  deuterium  atom,  and 
consists  of  one  proton  and  one  neutron.  The  mass  of  the  deuterium  atom, 
which  is  the  mass  of  the  deuteron  for  the  accuracy  needed,  is 
mD  =  2.013553  u,  and  the  mass  of  the  proton  and  neutron  are,  respectively, 
mP  =  1.007276  u  and  mn  =  1.008665  u,  where  the  atomic  mass  unit 
u  =  1.66053886  x  10-27  kg.  The  mass  defect  AM  =  mP  +  mn—mD  = 
0.002388  u  =  3.9654  x  10-30  kg,  and  the  binding  energy  EB  =  AMc2  = 
3.56  x  10-13  J  =  2.224  MeV,  where  the  speed  of  light  c  =  2.9979  x 
108  m  s'1  and  1  J  =  6.24150974  x  1012  MeV. 


The  binding  energy  and  binding  energy  per  nucleon  for  several  nuclei  are  given 
in  Table  8.2. 

As  suggested  by  Table  8.2  and  Fig.  8.2,  the  binding  energy  per  nucleon 
/  =  Eb/A  for  cosmically  abundant  elements  exhibits  a  nearly  smooth  increase  with 
increasing  A  up  until  iron,  with  a  steady  decrease  beyond  that.  This  means  that 
binding  energy  is  released  in  the  fusion  of  very  light  nuclei  into  somewhat  heavier 
nuclei,  and  that  iron  is  the  most  tightly  bound,  abundant  nucleus.  The  reason  the 
trend  reverses  after  iron  is  the  growing  positive  charge  of  the  nuclei.  It  means  that 
you  cannot  gain  energy  by  synthesizing  elements  heavier  than  iron  inside  stars. 
Energy  can  only  be  released  from  these  very  heavy  nuclei  by  nuclear  fission  into 
intermediate-mass  nuclei. 

The  calculation  of  thermonuclear  reaction  rates  is  enormously  complex,  for  it 
depends  upon  the  specific  reaction,  whether  or  not  there  is  a  resonance  in  the 
reaction  cross  section,  and  on  accelerator  measurements  of  the  reaction,  carried  out 
for  decades  by  William  A.  “Willy”  Fowler  (1911-1995)  and  his  colleagues; 
Fowler  was  awarded  the  1983  Nobel  Prize  in  Physics  for  his  theoretical  and 
experimental  studies  of  nuclear  reactions  of  importance  in  the  formation  of  the 


Table  8.2  Binding  energy,  EB,  and  binding  energy  per  nucleon,  /=  EB/A,  for  some  nuclei  of 
atomic  mass  number  Aa 


Nucleus 

Symbol 

A 

Eb  (MeV) 

/  =  Eb/A  (MeV) 

Proton 

'H 

1 

0.0000136 

0.0000136 

Deuteron 

2H  or  2D 

2 

2.22452 

1.11226 

Tralphium 

3He 

3 

7.7181 

2.5727 

Helium 

4He 

4 

28.3007 

7.0752 

Carbon 

12C 

12 

92.1617 

7.6801 

Oxygen 

16o 

16 

127.619 

7.9762 

Iron 

56Fe 

56 

492.258 

8.7903 

Uranium 

238U 

238 

1801.7688 

7.5704 

a  The  number  preceding  the  letter  symbol  of  the  nucleus  is  the  atomic  mass  number  A,  the 
number  of  nucleons 
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chemical  elements  in  the  universe.  The  fascinating  details  of  the  calculations  can 
be  found  in  Clayton  (1968,  1984)  and  Rolfs  and  Rodney  (1988),  with  summaries 
of  the  various  reactions  in  Lang  (1999).  The  details  for  non-resonant  reactions  are 
discussed  in  Focus  8.2. 


Focus  8.2  Non-resonant  thermonuclear  reaction  rates 

All  the  nuclei  in  a  star  are  positively  charged,  and  they  must  tunnel  through 
each  other’ s  Coulomb  barrier  of  electrical  repulsion  for  fusion  to  occur,  even 
at  the  high  temperatures  within  stars.  Quantum  mechanical  considerations 
show  that  in  the  absence  of  resonances  the  cross  section  for  the  tunneling, 
c j(E ),  can  be  written  (Gamow  1928;  Gurney  and  Condon  1928): 


(8.26) 


where  the  kinetic  energy,  E ,  of  two  nuclei  of  masses  Mx  and  M2  in  their 
center-of-mass  system  is: 


[iV2  _  M\M2  fv2\ 
-  mx  +m2  vT )  ’ 


(8.27) 


for  a  reduced  mass  /u  =  M1M2/(M1  +  M2)  and  relative  velocity  V  of  the  two 
nuclei.  The  Gamow  energy  EG  is  given  by: 


Eg  =  ( 7uxZiZ2)22fic 2 


0.98948 Z{Z2A112 


MeV. 


(8.28) 


The  fine  structure  constant  a  =  e2/(2s0hc )  =  7.29735  x  10-3  = 
[137.0356]-1,  for  elementary  charge  e ,  electric  constant  Planck  constant 
h  and  speed  of  light  c,  the  charges  of  the  two  nuclei,  in  units  of  the  proton 
charge,  are  Zx  and  Z2,  and  the  reduced  nuclear  mass  number 
A  =  A\A2/ {A i  +  A2),  where  the  mass  number  Ax  denotes  the  number  of 
nucleons,  or  the  number  of  protons  plus  the  number  of  neutrons,  in  nucleus  1 . 
For  the  fusion  reaction  of  two  protons,  Zx  =  Z2  =  1,  the  /u  =  mP/ 2  for  proton 
mass  mP,  the  A  =  0.5,  the  proton  rest  energy  mPc  =  938.2723  MeV,  and  the 
Gamow  energy  is  EG  =  (71  / 137)2  x  938.2723  =  0.494  MeV  «  0.5  MeV  = 
500  keV. 

For  most  nuclear  reactions  in  stars,  the  strength  factor  S(E)  is  between 
10  MeV-barns  and  1  keV-barns,  but  it  usually  cannot  be  calculated  from 
theoretical  considerations.  Far  from  a  nuclear  resonance,  the  factor  S(E)  is  a 
slowly  varying  function  of  E  and  may  conveniently  be  expressed  as  a  power 
series  expansion: 


S(E)  =  S0 


(8.29) 
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References  to  the  experimental  measurements  of  the  constants  in  this 
expression  are  provided  in  Lang  (1999). 

For  a  number  density  N\  and  N2  of  the  reacting  nuclei,  the  reaction  rate  r12 
is  given  by: 


ri2  =  N\N2  <  oV  >  m  3s  1 


(8.30) 


where  the  Gamow  tunneling  cross  section  is  combined  with  the  Maxwellian 
speed  distribution  at  temperature,  T,  to  obtain: 


oV) 


8  \ 


1/2 


oo 


7 1/.1 ) 


>0 


(kT) 


3/2 


exp 


/ -E ' 


o 


i 


kT 


exp 


a) 


1/2' 


The  integration  is  over  the  function: 


f(E)  =  exp 


(#) 


exp 


'tf 


(8.31) 


(8.32) 


where  the  first  term  is  due  to  the  fall  in  the  number  of  particles  with 
increasing  energy,  found  in  the  Maxwellian  distribution,  and  the  second  term 
expresses  the  exponential  rise  in  tunneling  probability  with  increasing 
energy.  The  overlap  of  these  two  terms,  previously  shown  in  Fig.  8.1,  defines 
the  region  of  energy  in  which  the  nuclear  reactions  occur. 

Most  of  the  reactions  take  place  at  the  effective  thermal  energy,  E0 ,  or 
Gamow  peak,  given  by  Fowler  and  Hoyle  (1964): 

E0=Epj  Eq3  =  0.1220(ZjZ|y4)1/37’g/3  MeV  (8.33) 

where  T9  is  the  temperature  in  billions  of  K,  or  T9  =  Tj  109.  For  light  nuclei 
and  temperatures  of  some  tens  of  millions  of  degrees,  the  most  effective 
energy  E0  is  usually  10  to  30  keV.  This  energy  is  greater  than  kT  =  86  T9 
keV,  reflecting  the  fact  that  the  particles  in  the  high-speed  tail  of  the  Max¬ 
wellian  distribution  contribute  to  the  reactions.  For  the  proton-proton  reac¬ 
tion  at  the  center  of  the  Sun,  Z x  =  Z2  =  1,  the  A  =  0.5,  and  T9  =  0.0156  for 
a  central  temperature  of  15.6  million  K,  with  kT  =  1.3  keV,  so  the 
Eq  «  0.006  MeV  =  6  keV  for  this  solar  reaction. 

Nuclei  with  energies  close  to  E0  and  spread  over  the  energy  range  AEq 
contribute  mainly  to  the  total  rate  of  the  thermonuclear  reactions,  where  AE0 
is  given  by  full  width  to  half  maximum  of  the  function  f  (E),  or 

AEq  =  4  f- yf)  «  0.65£'//6(^7’)5/6«  0.237 (z\z\A)  l/6T, 59/6  MeV. 

(8.34) 
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The  spread  AE0  is  usually  between  4  and  10  keV,  just  a  bit  smaller  than 
Eq.  For  the  proton-proton  fusion  reaction,  the  two  are  about  equal. 

The  reaction  rate  is  therefore  approximated  by  (Burbidge  et  al.  1957; 
Fowler  et  al.  1975;  Clayton  1968,  1984;  Rolfs  and  Rodney  1988) 


Y\2  —  N\  N2 


1/2  SqAEq 
(fer)3/2 


exp 


3Eo 

kT 


or  equivalently 


r  12 


«  0.65 NxN2 


1/2 


C  Z7I/6 

(1 kT )2/3 


exp 


(8.35) 


(8.36) 


For  the  fusion  of  two  protons,  S0  =  3.89  x  10-25  MeV 
barns  =  3.89  x  10-53  MeV  m2  (Kamionkowski  and  Bahcall  1994)  and  for 
the  Sun  center  where  T  =  15.6  million  K,  this  reaction  has  a  rate  of 
r12  ~  10-49  Ni  N2  m3  s_1  «  1015  m3  s_l,  where  N\  =  N2  ~  1032  m-3. 

The  energy  generation,  e12,  or  the  power  generated  per  unit  mass,  for  a 
reaction  that  generates  an  energy  Q  is 


£12 


r_nQ 

P 


Js  'kg 


(8.37) 


where  p  is  the  mass  density  at  the  reaction  site.  In  the  core  of  the  Sun,  the 
complete  chain  of  thermonuclear  reactions  releases  a  Q  «  26  MeV  «  4.2 
x  10-12  J  and  s12  ~  10-3  J  s_1  kg-1. 


8.3.3  Hydrogen  Burning 

The  detailed  nuclear  reactions  inside  stars  could  not  be  understood  until  the  1930s 
when  several  subatomic  particles  were  known,  including  the  neutron,  discovered 
in  1932,  the  positron,  detected  in  cosmic  ray  showers  in  1932,  and  the  neutrino, 
hypothesized  in  1933. 

The  German  physicist  Carl  Friedrich  von  Weizsacker  (1912-2007)  proposed 
that  the  solution  to  the  solar-energy  problem  lay  in  the  fusion  of  protons,  which 
Atkinson  previously  suggested  (Weizsacker  1937;  Atkinson  1936).  Then  Gamow, 
who  had  immigrated  to  the  United  States,  suggested  to  one  of  his  graduate  stu¬ 
dents,  Charles  Critchfield  (1910-1994),  that  he  calculate  the  details  of  the  reaction. 
The  results  were  sent  to  the  German-born  American  physicist  Hans  A.  Bethe 
(1906-2005)  at  Cornell  University,  who  found  them  to  be  correct,  and  the  two 


232 


8  What  Makes  the  Sun  Shine? 


published  a  joint  paper  titled  “The  Formation  of  Deuterons  by  Proton  Combina¬ 
tion”  (Bethe  and  Critchheld  1938). 

In  April  1939,  Gamow,  who  was  teaching  physics  at  George  Washington 
University  in  Washington,  DC,  organized  a  conference  to  bring  astronomers  and 
physicists  together  to  discuss  the  problem  of  stellar  energy  generation,  under  the 
sponsorship  of  the  Department  of  Terrestrial  Magnetism  of  the  Carnegie  Institu¬ 
tion.  At  this  conference,  the  astronomers  told  the  physicists  what  they  knew  about 
the  internal  constitution  of  stars,  which  was  quite  a  bit. 

By  then,  it  was  known  that  the  lightest  element,  hydrogen,  is  by  far  the  most 
abundant  element  in  the  outer  atmosphere  of  the  Sun  (Unsold  1928;  McCrea  1929; 
Russell  1929),  as  well  as  its  interior  (Stromgren  1931,  1932).  At  Gamow’s  con¬ 
ference,  the  Danish  astronomer  Bengt  Stromgren  (1908-1987)  additionally 
reported  that  because  the  Sun  was  predominantly  hydrogen,  it  would  have  a  central 
temperature  of  about  15  million  K  rather  than  40  million  K,  as  estimated  by 
Eddington  under  the  assumption  that  the  Sun  had  approximately  the  same 
chemical  composition  as  the  Earth,  with  a  preponderance  of  heavy  elements  rather 
than  hydrogen.  The  lower  temperature  meant  that  the  calculations  of  Bethe  and 
Critchheld  correctly  predicted  the  Sun’s  luminosity.  Bethe,  who  attended  the 
conference,  was  so  stimulated  by  the  meeting  that  within  six  months  he  had 
published  a  paper  titled  “Energy  Production  in  Stars,”  which  explains  how  the  Sun 
fuses  hydrogen  into  helium,  releasing  energy  to  heat  the  Sun’s  core  and  generate 
the  radiation  that  makes  it  shine  (Bethe  1939,  1967). 

At  about  the  same  time,  Weizsacker  showed  how  other  nuclear  reactions  could 
fuel  stars  using  carbon  as  a  catalyst  in  the  synthesis  of  helium  from  hydrogen,  but 
he  did  not  investigate  the  rate  of  energy  production  or  its  temperature  dependence, 
which  Bethe  subsequently  did  (Bethe  1939;  Weizsacker  1938).  He  eventually 
received  the  Nobel  Prize  in  Physics,  in  1967,  for  his  contributions  to  the  theory  of 
nuclear  reactions,  especially  his  discoveries  concerning  the  energy  production  in 
stars. 

The  sequence  of  nuclear  reactions  that  make  the  Sun  shine  is  called  the  proton- 
proton  chain  because  it  begins  with  the  fusion  of  two  protons.  The  complete  chain 
of  nuclear  reactions  also  is  known  as  the  hydrogen-burning  reaction  -  for  it  is 
hydrogen  nuclei,  protons,  which  are  being  consumed  to  make  helium.  However,  it 
is  not  combustion  in  the  ordinary  chemical  sense;  in  the  proton-proton  chain 
(Fig.  8.3),  four  protons  are  fused  together  to  form  a  helium  nucleus  that  contains 
two  protons  and  two  neutrons. 

Nuclear  reactions  often  are  written  in  shorthand  notation  using  letters  to  denote 
the  nuclei  and  other  subatomic  particles.  An  arrow  — ►  specifies  the  reaction. 
Nuclei  on  the  left  side  of  the  arrow  react  to  form  the  products  given  on  the  right 
side.  The  amount  of  energy  released  during  the  reaction  can  also  be  given  on  the 
far  right  side  of  the  arrow,  and  is  often  specified  in  units  of  MeV  where 

1  3 

1  MeV  =  1.692  x  10  J.  A  letter  and  a  preceding  superscript  designate  a 
nucleus.  For  historical  reasons,  the  nuclei  of  the  hydrogen  isotopes  H,  “H,  and  H, 
also  are  named  protons,  deuterons  and  tritons,  and  the  nucleus  of  4He  is  called  an 
alpha  particle.  A  Greek  letter  y  denotes  energetic  gamma-ray  radiation.  A  positron 
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Fig.  8.3  Proton-proton  chain  Hydrogen  nuclei,  or  protons,  are  fused  together  to  form  helium 
nuclei  within  the  solar  core,  providing  the  Sun’s  energy.  In  1939,  the  German-bom  American 
physicist  Hans  Bethe  (1906-2005)  described  the  detailed  sequence  of  nuclear-fusion  reactions, 
called  the  proton-proton  chain.  It  begins  when  two  protons,  here  designated  by  the  letter  *H, 
combine  to  form  the  nucleus  of  a  deuterium  atom,  the  deuteron  that  is  denoted  by  D,  together 
with  the  emission  of  a  positron,  denoted  by  e+,  and  an  electron  neutrino,  designated  by  ve. 
Another  proton  collides  with  the  deuteron  to  make  a  nuclear  isotope  of  helium,  denoted  by  He, 

a  #  Q 

and  then  a  nucleus  of  helium,  designated  by  He,  is  formed  by  the  fusion  of  two  He  nuclei, 
returning  two  protons  to  the  gas.  Overall,  this  chain  fuses  four  protons  together  to  make  one 
helium  nucleus.  Even  in  the  hot,  dense  core  of  the  Sun,  only  rare,  fast-moving  particles  can  take 
advantage  of  the  tunnel  effect  and  fuse  in  this  way 

is  denoted  by  e+  and  an  electron  is  designated  by  e~ .  The  symbol  denotes  an 
electron  neutrino. 

The  superscript  that  precedes  a  nucleus  letter  is  the  mass  number,  A,  which  is 
the  sum  of  the  neutrons  and  the  protons  and  the  total  number  of  nucleons  in  the 
nucleus.  Different  isotopes  of  an  element  have  the  same  number  of  protons  and  the 
same  letter  symbol  but  a  different  number  of  neutrons  and  a  different  superscript. 
For  instance,  a  rare  isotope  of  helium,  designated  He,  has  two  protons  and  one 
neutron  in  its  nucleus,  whereas  the  nucleus  of  the  common  form  of  helium,  4He, 
has  two  protons  and  two  neutrons. 

In  the  first  step  of  the  proton-proton  chain,  two  protons,  each  designated  by 
either  ]H  or  p,  meet  head  on  and  merge  into  each  other,  tunneling  through  the 
electrical  barrier  separating  them.  The  two  protons  combine  to  make  a  deuteron, 
D,  the  nucleus  of  a  heavy  form  of  hydrogen  known  as  deuterium. 

Because  a  deuteron  consists  of  one  proton  and  one  neutron,  one  of  the  protons 
must  be  neutralized.  It  is  turned  into  a  neutron,  n ,  with  the  ejection  of  a  positron, 
e+ ,  to  carry  away  the  proton’s  charge,  and  an  electron  neutrino,  ve,  to  balance  the 
energy  in  the  reaction.  This  is  the  positive  beta  decay  reaction,  denoted  by: 

p  — >  n  +  e+  H-  ve,  (8.38) 

which  applies  to  only  one  of  the  two  protons  making  the  deuteron.  The  initiating 
proton-proton  reaction  that  involves  both  protons  therefore  is  written: 

p  H-  p  — *  +  e+  +  Vg,  (8.39) 

which  releases  0.425  MeV  in  energy. 
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Each  proton  inside  the  Sun  is  involved  in  a  collision  with  other  protons  millions 
of  times  every  second,  but  only  exceptionally  hot  ones  can  tunnel  through  their 
electrical  repulsion  and  fuse  together.  Only  one  collision  in  every  ten  trillion 
trillion  initiates  the  proton-proton  chain. 

The  electron  neutrinos  produced  in  the  first  step  of  the  proton-proton  chain 
escape  from  the  Sun  without  reacting  with  matter,  carrying  energy  away.  How¬ 
ever,  the  positron  or  positive  electron  -  the  anti-matter  particle  of  the  electron  -  is 
consumed  immediately.  Anti-matter  and  matter  cannot  coexist.  As  soon  as  any 
anti-matter  is  produced,  it  is  immediately  wiped  out  of  existence  by  colliding  with 
an  electron,  and  the  reason  why  we  live  in  a  material  world  is  simply  because  there 
is  more  matter  than  anti-matter. 

The  positron,  e+,  created  in  the  first  step  of  the  proton-proton  chain  almost 
instantly  encounters  a  free  electron,  e~ ,  and  both  become  pure  energy.  The  two 
subatomic  particles  collide  and  annihilate  one  another  in  a  flash  of  radiation  at 
gamma  ray  wavelengths,  denoted  y.  This  energy-producing  pair-annihilation 
reaction  can  be  written  symbolically  as: 

e~  +  e+  — >  2 y,  (8.40) 

where  each  gamma-ray  photon  has  an  energy  of  0.511  MeV,  corresponding  to  the 
rest  mass  energy  of  an  electron,  for  a  total  reaction  energy  release  of  1 .022  MeV. 

The  next  step  follows  with  little  delay.  In  less  than  1  s,  the  deuteron  collides 

o 

with  another  proton  to  form  a  nucleus  of  light  helium,  He ,  and  releases  yet 
another  gamma-ray  photon,  with  about  5.49  MeV  in  energy.  In  symbolic  terms, 
the  second  step  of  the  proton-proton  chain  is  written: 

2D  +  p  -►  3He  +  y.  (8.41) 

This  reaction  occurs  so  easily  that  deuterium  cannot  be  synthesized  inside  stars; 
it  is  consumed  quickly  to  make  heavier  elements. 

In  the  final  part  of  the  proton-proton  chain,  two  such  light  helium  nuclei  meet 
and  fuse  together  to  form  a  nucleus  of  normal  heavy  helium,  4 He,  and  return  two 
protons  to  the  solar  gas.  This  step  takes  about  1  million  years  on  average  and  is 
written: 


3He  +  3 He  ->  4He  +  2 p,  (8.42) 

generating  another  12.86  MeV  of  energy.  This  normal  helium  nucleus  contains 
two  protons  and  two  neutrons.  So,  two  of  the  protons  that  contributed  to  the 
formation  of  helium  were  converted  into  neutrons  by  the  positive  beta-decay 
reaction.  The  total  energy  released  in  the  proton-proton  chain  is  the  sum  of  that 
released  by  all  of  the  contributing  reactions, 


Q  =  2  x  1.022  +  2  x  0.425  +  2  x  5.49  +  12.86  =  26.73  MeV.  (8.43) 
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This  last  reaction  in  the  proton-proton  chain  happens  86  %  of  the  time.  Less 
frequent  terminations  involve  the  interaction  of  light  helium  with  heavy  helium  to 
form  beryllium  (Focus  8.3). 


Focus  8.3  Secondary  nuclear  fusion  reactions  in  the  Sun 

The  proton-proton  chain  will  end  by  making  beryllium  about  14  %  of  the 
time.  A  nucleus  of  light  helium,  He ,  will  fuse  with  a  nucleus  of  heavier 
helium,  He ,  to  form  a  nucleus  of  light  beryllium,  Be ,  according  to  the 
nuclear  fusion  reaction: 


3  He  +  4He  -► 1  Be  +  y,  (8.44) 

where  y  denotes  energetic  gamma-ray  radiation. 

Most  of  the  time,  the  light  beryllium  will  combine  with  an  electron,  e~ ,  to 

q  i 

make  a  nucleus  of  lithium,  Li ,  which  then  joins  a  proton,  //,  to  make  two 
nuclei  of  heavy  helium,  4 He.  The  reactions  are  (Parker  et  al.  1964): 

^Be  T  e  — *  ^Li  vej 

and 

\i  +  XH  4He  +  4He ,  (8.46) 

where  ve  denotes  an  electron  neutrino. 

About  0.02  %  of  the  time,  the  light  beryllium  combines  with  a  proton  to 

o 

make  boron,  B.  The  boron  is  a  radioactive  nucleus  that  decays  in  just  one 

o  , 

second  into  beryllium,  Be ,  together  with  the  emission  of  a  positron,  e  ,  and 
an  electron  neutrino.  The  heavy  beryllium  then  decays  to  make  two  nuclei  of 
heavy  helium,  completing  the  conversion  of  protons  into  helium.  These 
secondary  nuclear  fusion  reactions  are: 

7Be  +  lH  -►  8B  +  y 

and 

8fi  ->  sBe  +  e+  +  ve  (8.48) 

where  the  positron  annihilates  with  an  electron  to  create  a  gamma  ray  y,  and 


(8.47) 


(8.45) 


8S  —  4He  +  AHe 


(8.49) 


A  total  of  six  protons  are  required  to  produce  the  two  He  nuclei  that  go  into 
this  last  reaction,  but  two  protons  are  returned  to  the  solar  interior  to  be  reused 
later.  Because  two  protons  and  a  helium  nucleus  are  produced,  the  net  result  of  the 
proton-proton  chain  is: 


4 p  — »  4He  +  gamma  ray  radiation  +  neutrinos, 


(8.50) 
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which  releases  a  net  energy  of  26.73  MeV  or  4. 28  x  10  J  each  time  it  occurs, 

_  IQ  Q 

using  1  MeV  =  1.602  x  10  J.  That  corresponds  to  the  energy  AE  =  Amc  , 
where  Am  is  the  mass  loss  in  converting  four  protons  into  one  helium  nucleus  and 

o  _ 1 

the  speed  of  light  c  =  2.9979  x  10  m  s  .  The  energy  released  in  each  reaction 

Q  O 

is  very  small,  but  there  are  a  lot  of  them,  about  10  every  second. 

Deducting  the  2  x  0.511  MeV  =  1.022  MeV  from  the  annihilation  of  the  two 
pre-existing  electrons  by  interaction  with  protons,  we  have  an  energy  of 
25.71  MeV,  which  corresponds  to  the  rest  mass  difference  between  four  protons 
and  a  4He  nucleus.  That  is: 


25.71  Mev  =  (4 mP  -  mHe)c 2  =  0.007  (4mPc2) ,  (8.51) 


where  mP  denotes  the  proton  mass,  mHe  designates  the  mass  of  a  helium  nucleus, 
the  speed  of  light  c  =  2.9979  x  108  m  s_1,  the  mPc 2  =  938.2720  MeV  and 

Q 

mHec  =  3727.379  MeV.  Thus,  the  rest-mass-to-energy  conversion  of  the  proton- 
proton  chain  is  0.007  or  0.7  %. 

This  energy  leaves  the  Sun  as  radiation,  and  the  part  of  this  radiation  that 
constitutes  visible  light  is  what  makes  the  Sun  shine.  The  subatomic  energy  that  is 
liberated  also  keeps  the  core  of  the  Sun  hot,  assuring  continuation  of  the  nuclear 
reactions.  The  time,  t,  to  radiate  away  just  10  %  of  the  energy  available  from  this 
source  is: 


T  = 


O.l(O.OO7)M0c: 


L 


3.26  x  1017s  ~  10iU  years 


10 


■© 


(8.52) 


QQ 

where  the  Sun’s  mass  M0  =  1.989  x  10  kg,  the  Sun’s  luminosity 
L0  =  3.828  x  1026  J  s"1,  and  1  year  =  3.1557  x  107  s. 


Example:  The  Sun  is  losing  mass 

The  Sun  shines  by  the  energy  released,  AE,  from  the  mass  loss,  Am,  every 
time  four  protons  are  converted  into  a  helium  nucleus.  That  is,  the  helium 
nucleus  is  slightly  less  massive,  by  a  mere  0.007,  or  0.7  %,  than  the  four 
protons  that  combine  to  make  it,  so  there  is  an  energy,  AE,  released  given  by: 


A E  =  Amc2  =  (4 mp  —  mHe)c 2  =  0.007 (4mp)c2  ~  4.2  x  10  12  J  (8.53) 
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where  the  mass  of  the  proton  is  mp  =  1.6726  x  10  "  kg,  the  mass  of  the 

_ Q  rJ  O  _  1 

helium  nucleus  is  mHe  =  6.644  x  10  kg,  and  c  =  2.9979  x  10°  m  s  is 
the  speed  of  light. 

The  number  of  reactions,  N,  that  must  occur  every  second  to  make  the  Sun 

qz:  _ i 

shine  with  its  present  luminosity  L0  =  3.828  x  10"  J  s  ,  is: 


N  = 


Lr. 


© 


AE 
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10  reactions  per  second, 


and  the  mass  loss,  AM,  from  the  Sun  in  just  one  second  is 


(8.54) 
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Since  1  ton  is  equal  to  1,000  kg,  about  4  million  tons  of  matter  disappears 
from  the  Sun  every  second,  vanishing  to  provide  the  Sun’s  energy.  Provided 
the  Sun  has  been  shining  at  the  same  rate  ever  since  it  formed  4.6  billion 

O  ZT  '~J 

years  ago,  it  has  lost  about  6.2  x  10"  kg,  using  1  year  =  3.1557  xlO  s. 
However,  this  mass  loss  is  trivial  compared  to  the  Sun’s  total  mass  of 
MQ  =  1.989  x  1030  kg.  The  Sun  has  lost  only  0.0003  of  its  original  mass  in 
all  that  time. 

The  time,  t ,  required  to  consume  the  entire  solar  mass  this  way  would  be: 

t  —  =  0.47  x  1021  s  «  1013  years,  (8.56) 

but  since  the  nuclear  reactions  are  confined  to  the  hot,  central  core,  the 
hydrogen  is  depleted  in  a  lifetime  of  about  one-tenth  this  value. 


8.3.4  Why  Doesn’t  the  Sun  Blow  Up? 

Unlike  a  nuclear  bomb  on  the  Earth,  the  temperature- sensitive  reactions  inside  the 
Sun  act  like  a  thermostat,  releasing  energy  in  a  steady,  controlled  manner  at 
exactly  the  rate  needed  to  keep  the  Sun  in  equilibrium  between  the  inward  pull  of 
gravity  and  the  outward  pressure  of  the  moving  subatomic  particles.  If  the  rate  of 
the  thermonuclear  reactions  in  the  central  regions  of  the  Sun  rises  as  the  result  of  a 
temperature  increase,  nuclei  move  faster  and  create  more  pressure.  The  entire  body 
of  the  Sun  then  would  expand  and  thereby  bring  down  its  central  temperature. 
If  the  rate  of  core  nuclear  reactions  were  to  drop,  gravity  would  pull  the  Sun 
inwards,  and  the  resulting  increase  in  central  temperature  would  bring  the  rate  of 
energy  production  back  into  equilibrium.  So  the  pendulum  continues  to  swing 
between  gravity  and  fusion,  with  no  winner.  That  is  how  the  Sun  harnesses  its 
nuclear  energy,  which  it  has  been  doing  for  4.6  billion  years. 


8.4  The  Mystery  of  Solar  Neutrinos 
8.4.1  The  Elusive  Neutrino 

Neutrinos,  or  “little  neutral  ones,”  are  tiny,  invisible  packets  of  energy  with  no 
electrical  charge  and  almost  no  mass,  traveling  at  nearly  the  speed  of  light  (see 
Sect.  7.4).  They  are  produced  in  great  profusion  by  thermonuclear  reactions  in  the 
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Sun’s  core,  removing  substantial  amounts  of  energy  that  is  never  seen  again.  Every 
second,  trillions  upon  trillions  of  the  solar  neutrinos  pass  right  through  the  Earth 
without  even  noticing  that  it  is  there  (Focus  8.4).  At  night,  the  solar  neutrinos 
travel  through  the  Earth  before  passing  through  the  walls  of  our  houses  and  even 
through  our  bodies,  without  us  ever  noticing  them. 


Focus  8.4  Trillions  upon  trillions  of  neutrinos 

The  aggregate  number  of  solar  neutrinos  is  staggering.  Every  time  the 
proton-proton  chain  creates  one  helium  nucleus,  it  releases  an  energy, 
AE  =  Amc2  ~  0.007  ( 4mp )  c 2  =  4.2  x  10“ 12  J,  where  Am  is  the  mass 
difference  between  the  helium  nucleus  and  the  four  protons  that  went  into 
making  it.  The  mass  of  the  proton  is  mp  =  1.6726  x  10  ^  kg,  and  the 
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speed  of  light  c  =  2.9979  x  10  m  s  .  The  energy  released  by  the  Sun’s 
ongoing  nuclear  fusion  reactions  works  its  way  out  of  the  Sun  to  provide  its 
present  luminosity  L0  =  3.828  x  10"  J  s  .  The  total  number  of  helium- 
producing  proton-proton  chains  required  to  fuel  the  Sun’s  energy  every 
second  is  L0/ AE  «  10  ,  and  since  two  neutrinos  are  emitted  every  time 
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one  helium  nucleus  is  made,  we  conclude  that  the  Sun  emits  2  x  10 
neutrinos  every  second.  If  the  Sun  has  been  shining  at  the  same  rate  for  the 
past  4.6  billion  years,  it  has  emitted  an  astonishing  3  x  1055  neutrinos. 

The  number  of  neutrinos  passing  through  the  Earth  each  second  is  less 
than  those  emitted  by  the  Sun,  diminished  by  the  ratio  of  the  Earth’s  cross 
sectional  area  to  the  area  of  a  sphere  with  a  radius  equal  to  the  mean  distance 
from  the  Sun  to  Earth.  Thus,  the  number  of  neutrinos  passing  through  Earth 
per  second  is: 


2 Le\  (  nR\  \ 
AEJ  \4n(De)2) 


(8.57) 


/r 

where  the  radius  of  the  Earth  is  RE  =  6.378  x  10  m  and  the  mean  distance 
between  the  Earth  and  the  Sun  is  D0  =  1  AU  =  1.496  x  10u  m. 

So,  there  are  0.1  million  trillion  trillion  neutrinos  passing  through  the 
Earth  every  second.  To  obtain  the  number  of  neutrinos  passing  through  every 
square  meter  of  the  side  of  the  Earth  facing  the  Sun,  just  divide  by  the 
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Earth’s  area  tzRe  to  get  about  7  x  10  ,  or  700  thousand  billion,  neutrinos 
per  square  meter. 


The  flux  of  solar  neutrinos  expected  at  the  Earth  is  calculated  using  super¬ 
computers  that  culminate  in  the  Standard  Solar  Model  that  describes  the  Sun’s 
luminous  output,  size,  and  mass  at  its  present  age.  Such  calculations  have 
been  developed  and  refined  by  John  N.  Bahcall  (1934-2005)  of  the  Institute 
for  Advanced  Study  at  Princeton  (Bahcall  1964,  1978;  Bahcall  and 
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Pinsonneault  2004),  and  other  astrophysicists  throughout  the  world,  such  as 
Sylvaine  Turck-Chieze  (1951-  )  at  Saclay,  France  (Turck-Chieze  et  al.  1988). 

The  computer  models  always  include  three  basic  assumptions: 

( 1 )  Energy  is  generated  by  hydrogen-burning  reactions  in  the  central  core  of  the  Sun, 
and  there  is  no  mixing  of  material  between  the  core  and  overlying  regions.  The 
nuclear  reaction  rates  depend  on  the  density,  temperature,  and  composition  of  the 
core,  as  well  as  coefficients  extrapolated  from  laboratory  experiments. 

(2)  The  outward  thermal  pressure,  due  to  the  energy-producing  reactions,  just 
balances  the  inward  pressure  due  to  gravity,  thereby  keeping  the  Sun  from 
either  collapsing  or  blowing  up. 

(3)  Energy  is  transported  from  the  deep  interior  to  the  visible  solar  disk  via 
radiation  and  convection  (see  the  next  Sect.  8.5).  The  great  bulk  of  energy  is 
carried  by  radiative  transport  with  an  opacity  determined  from  atomic  physics 
calculations. 

One  begins  with  a  newly  formed  Sun  having  a  uniform  composition,  and  an 
element  abundance  that  is  observed  in  the  visible  solar  disk  today.  The  model  then 
imitates  the  evolution  of  the  Sun  to  its  present  age  of  4.6  billion  years  by  slowly 
converting  hydrogen  into  helium  within  the  model  core.  The  central  nuclear 
reactions  supply  both  the  radiated  luminosity  and  the  local  heat  or  pressure,  while 
also  creating  neutrinos  and  producing  composition  changes  in  the  core.  The  Sun’s 
current  luminosity,  size,  and  neutrino  flux  are  obtained  after  4.6  billion  years  when 
about  37  %  of  the  hydrogen  in  the  core  has  now  been  transformed  into  helium. 
Once  the  Standard  Solar  Model  has  specified  the  neutrino  flux  the  predictions  are 
extended  to  specific  experiments  that  detect  neutrinos  of  different  energies. 


8.4.2  Solar  Neutrino  Detectors  Buried  Deep  Underground 

Unlike  a  conventional  optical  telescope,  which  is  placed  as  high  as  possible  to 
minimize  distortion  by  the  Earth’s  obscuring  atmosphere,  a  solar  neutrino  detector 
is  buried  beneath  a  mountain  or  deep  within  the  Earth’s  rocks  inside  mines.  This 
shields  the  instrument  from  deceptive  signals  caused  by  cosmic  rays.  There, 
beneath  tons  of  rock  that  only  a  neutrino  can  penetrate,  detectors  unambiguously 
measure  neutrinos  from  the  Sun.  If  neutrino  detectors  were  placed  on  the  Earth’s 
surface,  they  would  detect  high-energy  particles  and  radiation  produced  by  cosmic 
rays  interacting  with  the  Earth’s  atmosphere. 

Thus,  solar-neutrino  astronomy  involves  massive  subterranean  detectors  that 
look  right  through  the  Earth  and  observe  the  Sun  at  night  or  day.  The  first  such 
neutrino  telescope,  constructed  in  1967  by  Raymond  Davis,  Jr.  (1914-2006),  was  a 
615-ton  tank  located  1.5  km  underground  in  the  Homestake  Gold  Mine  near  Lead, 
South  Dakota.  The  huge  cylindrical  tank  was  filled  with  378,000  liters  of  cleaning 
fluid,  technically  called  perchloroethylene  or  “perc”  in  the  dry-cleaning  trade; 
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each  molecule  of  the  stain  remover  consists  of  two  carbon  atoms  and  four  chlorine 
atoms  (Davis  1964;  Davis  et  al.  1968). 

Most  solar  neutrinos  passed  through  the  tank  unimpeded.  Occasionally,  how¬ 
ever,  a  neutrino  scored  a  direct  hit  with  the  nucleus  of  a  chlorine  atom,  turning  one 
of  its  neutrons  into  a  proton,  emitting  an  electron  to  conserve  charge,  and  trans¬ 
forming  the  chlorine  atom  into  an  atom  of  radioactive  argon.  The  new  argon  atom 
rebounded  from  the  encounter  with  sufficient  energy  to  break  free  of  the  perc 
molecule  and  enter  the  surrounding  liquid.  Because  argon  is  chemically  inert,  it 
can  be  culled  from  the  liquid  by  bubbling  helium  gas  through  the  tank.  The  number 
of  argon  atoms  recovered  in  this  way  measured  the  incident  flux  of  solar  neutrinos. 

Every  few  months,  Davis  and  his  colleagues  flushed  the  tank  with  helium, 
extracting  about  15  argon  atoms  from  a  tank  the  size  of  an  Olympic  swimming 
pool.  That  was  a  remarkable  achievement  considering  that  the  tank  contained  more 

on 

than  1  million  trillion  trillion,  or  10  ,  chlorine  atoms.  The  scientists  persisted  for 
nearly  thirty  years,  capturing  signs  of  only  2,000  neutrinos  in  all  that  time. 
However,  the  consequences  were  enormous.  The  measurements  implied  not  only 
that  nuclear- fusion  reactions  indeed  were  providing  the  Sun’s  energy,  making  it 
shine,  but  there  also  was  a  small  unexpected  problem  with  the  result  that  led  to  a 
new  understanding  of  the  physics  of  neutrinos. 

The  neutrino  reaction  rate  with  atoms  in  neutrino  detectors  is  so  slow  that  a 
special  unit  was  invented  to  specify  the  experiment- specific  flux.  The  Solar 
Neutrino  Unit,  abbreviated  SNU  and  pronounced  “snew,”  is  equal  to  one  neutrino 
interaction  per  second  for  every  trillion  trillion  trillion,  or  10  ,  atoms.  Even  then, 
the  predictions  were  only  a  few  SNU  per  month  for  even  the  largest,  most-massive, 
detectors  first  constructed. 

The  Homestake  detector  always  yielded  results  in  conflict  with  the  most 
accurate  theoretical  calculations.  The  final  experiment  value  was  2.55  ±  0.25 
SNU,  where  the  zb  value  denotes  an  uncertainty  of  one  standard  deviation 
(Cleveland  et  al.  1998).  (A  standard  deviation  is  a  statistical  measurement  of  the 
uncertainty  of  a  measurement;  a  definite  detection  must  be  above  three  standard 
deviations  and  preferably  above  five.)  In  contrast,  theoretical  results  using  the 
Standard  Solar  Model  predicted  that  the  Homestake  detector  should  have  observed 
a  flux  of  8.5  ±1.8  SNU.  So  the  tank  full  of  cleaning  fluid  captured  almost  one 
third  of  the  expected  number  of  neutrinos  (Fig.  8.4). 

The  discrepancy  between  the  observed  and  calculated  values  is  known  as  the 
Solar  Neutrino  Problem.  Its  significance  was  confirmed  in  2002,  when  Davis 
received  the  Nobel  Prize  in  Physics;  sharing  it  with  Japanese  scientist  Masatoshi 
Koshiba  (1926-  ),  whose  group  used  another  giant,  underground  detector,  named 
Kamiokande,  to  detect  both  solar  and  supernova  neutrinos. 

In  1987,  the  Kamiokande  neutrino  detector  began  to  monitor  solar  neutrinos, 
confirming  the  neutrino  deficit  observed  by  Davis.  This  second  experiment, 
located  in  a  mine  at  Kamioka,  Japan,  consisted  of  a  4,500-ton,  or  4.5  million-liter, 
tank  of  pure  water.  Nearly  1,000  light  detectors  were  placed  in  the  tank  walls  to 
measure  signals  emitted  by  electrons  knocked  free  from  water  molecules  by 
passing  neutrinos. 
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Fig.  8.4  Solar  neutrino  fluxes  Calculated  and  measured  solar  neutrino  fluxes  have  disagreed  for 
several  decades.  The  fluxes  are  measured  in  solar  neutrino  units  (SNU)  which  are  defined  as  1 

o /: 

neutrino  interaction  per  trillion  trillion  trillion,  or  10  ,  atoms  per  second.  Measurements  from  the 
chlorine  neutrino  detector  ( small  dots)  give  an  average  solar  neutrino  flux  of  2.6  d=  0.2  SNU 
(lower  broken  line),  well  below  theoretical  calculations  ( large  dots)  that  predict  a  flux  of  8.5  d= 
1.8  SNU  ( upper  broken  line)  for  the  Standard  Solar  Model.  Other  experiments  also  have  observed 
a  deficit  of  solar  neutrinos,  suggesting  that  either  some  process  prevents  neutrinos  from  being 
detected  or  there  is  an  incomplete  understanding  of  the  nuclear  processes  that  make  the  Sun  shine 

When  an  energetic  solar  neutrino  collides  with  an  electron  in  the  water,  the 
neutrino  knocks  the  electron  out  of  its  atomic  orbit,  pushes  it  forward  in  the  direction 
of  the  incident  neutrino,  and  accelerates  the  electron  to  nearly  the  speed  of  light.  In 
water,  the  electron  moves  faster  than  the  light  it  radiates,  and  as  a  result  the  electron 
produces  a  cone-shaped  pulse  of  light  about  its  path.  The  faint  blue  glow  is  tech¬ 
nically  known  as  Cherenkov  radiation,  named  after  the  Soviet  physicist  Pavel  A. 
Cherenkov  (1904-1958)  who  discovered  the  effect  (Cherenkov  1937). 

The  axis  of  the  light  cone  gives  the  electron’s  direction,  which  is  the  direction 
from  which  the  neutrino  arrived.  Because  the  observed  electrons  were  preferen¬ 
tially  scattered  along  the  direction  of  an  imaginary  line  joining  the  Earth  to  the 
Sun,  the  Kamiokande  water  experiment  also  confirmed  that  the  neutrinos  indeed 
are  produced  by  nuclear  reactions  in  the  Sun’s  core.  After  1000  days  of  obser¬ 
vation,  Yoji  Totsuka  (1942-2008),  speaking  for  the  Kamiokande  collaboration  led 
by  Masatoshi  Koshiba  (1926-  ),  could  therefore  report  that  neutrinos  really  are 
coming  for  the  Sun,  where  nuclear  fusion  reactions  are  taking  place,  while  also 
confirming  the  neutrino  deficit  observed  by  Davis  (Totsuka  1991). 

The  deficit  of  solar  neutrinos  was  subsequently  confirmed  in  the  1990s  by  two 
massive  underground  detectors  using  gallium,  a  rare  and  expensive  metal  used  in 
the  red  lights  of  hand  calculators  and  other  pieces  of  electronic  equipment.  When 
combined  with  the  chlorine  and  the  Kamiokande  results,  all  four  experiments 
seemed  to  have  confirmed  that  solar  neutrinos  are  missing,  and  that  the  Solar 
Neutrino  Problem  is  real. 
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8.4.3  Solving  the  Solar  Neutrino  Problem 

After  almost  40  years  of  meticulous  measurements  and  calculations,  the  neutrino 
count  still  came  up  short!  Massive  underground  detectors  always  observed  fewer 
neutrinos  than  theory  states  they  should  detect,  and  many  scientists  thought  that 
the  discrepancy  was  due  to  an  incomplete  understanding  of  neutrinos  (Bahcall  and 
Bethe  1990). 

As  it  turned  out,  neutrinos  are  transforming  into  a  different  form  during  their 
journey  from  the  center  of  the  Sun,  escaping  detection  by  changing  character.  We 
haven’t  mentioned  it  yet,  but  scientists  have  learned  that  there  are  three  separate 
types,  or  flavors,  of  neutrinos,  each  named  after  the  fundamental,  subatomic 
particle  with  which  it  is  most  likely  to  interact  (Focus  8.5).  All  of  the  neutrinos 
generated  inside  the  Sun  are  electron  neutrinos,  designated  ve;  this  is  the  type  that 
interacts  with  electrons,  denoted  e~.  The  other  two  types,  the  muon  neutrino,  vM, 
and  the  tau  neutrino,  vT,  interact  with  muons,  p,  and  tau  particles,  t,  respectively. 


Focus  8.5  Leptons 

A  lepton  is  an  elementary,  subatomic  particle,  whose  name  comes  from  the 
Greek  word  lepton  meaning  “fine,  small,  thin,  subatomic  or  slender.” 
Altogether  there  are  six  types  of  leptons,  which  are  divided  into  two  classes: 
the  three  charged  leptons  and  the  three  neutral,  or  uncharged  leptons,  known 
as  neutrinos. 

The  electron,  denoted  by  e,  is  the  best-known  lepton  and  the  first  to  be 
discovered.  Unlike  the  stable  electron,  the  other  two  charged  leptons,  the 
muon  denoted  by  p  and  the  tau  particle  designated  t,  are  unstable  subatomic 
particles.  They  are  unfamiliar  to  most  of  us  because  they  die  shortly  after 
birth.  The  muon  decays  into  an  electron,  a  muon  neutrino  and  an  electron 
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anti-neutrino  in  just  2  millionths,  or  2  x  10  ,  of  a  second,  and  the  tau 

_  1  Q 

particle  disappears  just  three-tenths  of  a  million-millionth,  or  3  x  KT1",  of 
a  second  after  it  is  made. 

Every  charged  lepton  has  a  corresponding  antiparticle  of  opposite  charge 
but  equal  mass.  The  antiparticle  of  the  electron  is  thus  known  as  the  positron, 
or  positive  electron.  The  lepton  of  negative  charge  is  denoted  with  a  - 
superscript,  as  e~ ,  p~,  and  t-,  and  the  corresponding  antiparticle  with  the 
positive  charge  is  designated  by  a  +  subscript,  with  e+ ,  p+,  and  t+. 

The  three  electrically  neutral  leptons,  or  neutrinos,  have  a  small  but  non¬ 
zero  mass,  and  rarely  interact  with  anything.  Only  the  weak  subatomic  force 
affects  them,  and  this  weak  interaction  enables  them  to  travel  great  distances 
through  matter  without  being  affected  by  it.  The  three  types,  or  flavors,  of 
neutrinos  are  the  electron  neutrino,  designated  ve,  the  muon  neutrino, 
denoted  by  vM,  and  the  tau  neutrino,  denoted  vT.  Each  type  also  has  a  cor¬ 
responding  antiparticle,  called  an  antineutrino  and  denoted  by  a  bar  above 
the  symbol,  or  by  ve,v^vT. 
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Neutrinos  apparently  have  an  identity  crisis!  Each  type  of  neutrino  is  not 
completely  distinct,  and  the  different  types  can  be  transformed  into  one  another. 
In  the  language  of  quantum  mechanics,  neutrinos  do  not  occupy  a  well-defined 
state;  they  instead  consist  of  a  combination  or  mixture  of  states.  As  neutrinos  move 
through  space,  the  states  come  in  and  out  of  phase  with  one  another,  so  the 
neutrinos  change  form  with  time. 

The  effect  is  called  neutrino  oscillation  because  the  probability  of  metamor¬ 
phosis  between  neutrino  types  has  a  sinusoidal,  in  and  out,  oscillating  dependence 
on  path  length.  The  change  in  identity  is  not  one  way,  for  a  neutrino  of  one  type 
can  change  into  another  kind  of  neutrino  and  back  again  as  it  moves  along.  The 
three  possible  types  of  neutrinos  are  called  electron  neutrinos,  muon  neutrinos,  and 
tau  neutrinos,  each  named  for  the  type  of  particle  it  interacts  with. 

In  1967,  the  Italian  atomic  physicist  Bruno  Pontecorvo  (1913-1993)  proposed 
that  one  type  of  neutrino  might  transform,  or  oscillate,  into  another  type  in  the 
vacuum  of  space,  and  two  years  later,  Pontecorvo  and  Vladimir  Gribov 
(1930-1997),  proposed  that  the  Solar  Neutrino  Problem  could  be  explained  if  solar 
neutrinos  switch  from  electron  neutrinos  to  another  type  as  they  travel  in  the  near 
vacuum  of  space  from  the  Sun  to  Earth,  thereby  escaping  detection  (Gribov  and 
Pontecorvo  1969).  Almost  a  decade  later,  the  American  physicist  Lincoln 
Wolfenstein  (1923-  )  showed  that  the  neutrinos  could  oscillate,  or  change  type, 
more  vigorously  by  interacting  with  matter,  rather  than  in  a  vacuum  (Wolfenstein 
1978),  and  the  Russian  physicists,  Stanislav  P.  Mikheyev  (1940-  )  and  Alexei  Y. 
Smirnov  (1951-  )  subsequently  explained  how  the  matter  oscillations  might 
explain  the  Solar  Neutrino  Problem  (Mikheyev  and  Smirnov  1985). 

The  theory,  named  the  MSW  effect  after  the  first  letters  of  the  last  names  of  the 
scientists  who  developed  it,  proposed  that  the  electron  neutrinos  generated  in  the 
solar  core  could  change  type  on  their  way  out  of  the  Sun,  and  therefore  remain 
invisible  to  the  first  solar  neutrino  detectors. 

Such  a  transformation  was  suggested  first  by  observations  of  nonsolar  neutrinos 
using  the  Super-Kamiokande  detector,  which  replaced  the  older,  nearby  Kami- 
okande  instrument  in  1996.  Super-Kamiokande  can  observe  both  solar  electron 
neutrinos  and  atmospheric  muon  neutrinos.  The  former  are  created  by  nuclear 
fusion  at  the  center  of  the  Sun,  whereas  the  latter  are  created  when  fast-moving 
cosmic  rays  enter  the  Earth’s  atmosphere  from  outer  space.  Solar  electron  neu¬ 
trinos  are  distinguished  by  their  relatively  low  energy,  near  the  5  MeV  lower 
threshold  of  the  detector.  A  high  energy  of  1 ,000  MeV  is  typical  of  an  atmospheric 
muon  neutrino.  Neutrinos  of  higher  energy  produce  a  tighter  cone  of  light,  so  a 
solar  electron  neutrino  makes  a  fuzzy,  blurred  and  ragged  light  pattern,  while  an 
atmospheric  muon  neutrino  produces  a  neat,  sharp-edged  ring  of  light. 

After  monitoring  light  patterns  for  more  than  500  days,  the  Super-Kamiokande 
scientists  reported  that  there  were  roughly  twice  as  many  muon  neutrinos  coming 
from  the  atmosphere  directly  over  the  Super-Kamiokande  detector  than  those 
coming  from  the  other  side  of  the  Earth  (Fukada  et  al.  1998a,  b).  The  muon 
neutrinos  are  produced  in  the  atmosphere  above  every  place  on  our  planet,  but 
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some  of  them  apparently  disappeared  while  traveling  through  the  Earth  to  arrive  at 
the  detector  from  below. 

Subsequent  experiments  using  neutrinos  generated  by  particle  accelerators  on 
the  Earth  confirmed  the  effect  (Eguci  et  al.  2003).  They  suggest  that  although  all 
the  neutrinos  produced  by  nuclear  reactions  in  the  Sun  are  electron  neutrinos,  they 
do  not  stay  that  way.  Nevertheless,  the  terrestrial  neutrinos  did  not  come  from  the 
Sun  and  are  not  directly  related  to  nuclear  fusion  reactions  there.  So  the  solution  to 
the  Solar  Neutrino  Problem  was  not  known  definitely  until  2001,  when  a  new 
underground  solar  neutrino  detector  in  Canada,  the  Sudbury  Neutrino  Observatory, 
demonstrated  that  solar  neutrinos  are  changing  type  when  traveling  to  the  Earth. 

The  Sudbury  Neutrino  Observatory,  abbreviated  SNO  and  pronounced  “snow”, 
is  located  2  km  underground  in  a  working  nickel  mine  near  Sudbury  Ontario.  It  is 
a  water  detector,  but  unlike  Kamiokande  or  Super-Kamiokande,  the  SNO  detector 
contains  heavy  water. 

Heavy  water  is  chemically  similar  to  ordinary  water,  and  it  does  not  appear  or 
taste  any  different.  In  fact,  heavy  water  exists  naturally  as  a  constituent  of  ordinary 
tap  or  lake  water  in  a  ratio  of  about  1  part  in  7,000;  expensive  chemical  and 
physical  processes  can  separate  it. 

The  hydrogen  in  heavy  water  has  a  nucleus,  called  a  deuteron,  which  consists  of 
a  proton  and  a  neutron.  For  ordinary  water,  the  hydrogen  is  about  half  as  light,  with 
a  nucleus  that  contains  only  a  proton  and  no  neutron.  It  is  the  heavier  deuteron  that 
makes  SNO  sensitive  to  not  just  one  type  of  neutrino  but  instead  to  all  three  known 
varieties. 

One  thousand  tons,  or  1  million  liters,  of  heavy  water,  was  placed  in  a  central 
spherical  cistern  with  transparent  acrylic  walls.  A  geodesic  array  of  about  10,000 
photo-multiplier  tubes  surrounds  the  vessel  to  detect  the  flash  of  light  given  off  by 
heavy  water  when  it  is  hit  by  a  neutrino.  Both  the  light  sensors  and  the  central  tank 
are  enveloped  by  a  7,800-ton  jacket  of  ordinary  water  (Fig.  8.5),  to  shield  the 
heavy  water  from  emissions  of  the  underground  rocks.  As  with  other  neutrino 
detectors,  the  overlying  rock  blocks  energetic  particles  generated  by  cosmic  rays. 

The  Sudbury  Neutrino  Observatory  can  be  operated  in  two  modes:  one  sensitive 
only  to  electron  neutrinos  and  the  other  equally  sensitive  to  all  three  types  of 
neutrinos.  Observations  with  both  modes  have  confirmed  that  the  Solar  Neutrino 
Problem  is  caused  by  changes  in  the  neutrinos  as  they  travel  from  the  solar  core. 
When  this  is  taken  into  account,  the  total  number  of  electron  neutrinos  produced  in 
the  Sun  is  as  predicted  (McDonald  2005).  Haxton  et  al.  (2013)  have  provided  a 
recent  review  of  solar  neutrinos. 


8.5  How  the  Energy  Gets  Out 

All  of  the  Sun’s  nuclear  energy  is  created  deep  down  inside  its  high-temperature 
core,  and  no  energy  is  created  in  the  cooler  regions  outside  of  it.  The  energy¬ 
generating  core  extends  to  about  one  quarter  of  the  distance  from  the  center  of  the 
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Fig.  8.5  How  Sudbury  works  Neutrinos  from  the  Sun  travel  through  more  than  2  km  of  rock, 
entering  the  acrylic  tank  of  the  Sudbury  Neutrino  Observatory,  which  contains  1,000  tons 
(1  million  liters)  of  heavy  water.  When  one  of  these  neutrinos  interacts  with  a  water  molecule,  it 
produces  a  flash  of  light  that  is  detected  by  a  geodesic  array  of  photo-multiplier  tubes.  Some 
7,800  tons  (7.8  million  liters)  of  ordinary  water  surrounding  the  acrylic  tank  blocks  radiation 
from  the  rock,  and  the  overlying  rock  blocks  energetic  particles  generated  by  cosmic  rays  in  our 
atmosphere.  The  heavy  water  is  sensitive  to  all  three  types  of  neutrinos 

Sun  to  the  visible  solar  disk,  accounting  for  only  1.6  %  of  the  Sun’s  volume. 
However,  about  half  of  the  Sun’s  mass  is  packed  into  its  dense  core. 

Because  we  cannot  see  inside  the  Sun,  astronomers  combine  basic  theoretical 
equations,  such  as  those  for  equilibrium  and  energy  generation  or  transport,  with 
observed  boundary  conditions,  such  as  the  Sun’s  mass  and  luminous  output,  to 
create  models  of  the  Sun’s  internal  structure.  These  models  consist  of  two  nested 
spherical  shells  that  surround  the  hot,  dense  core  (Fig.  8.6). 

The  innermost  shell,  called  the  radiative  zone,  extends  from  the  core  to  71.3  % 
of  the  Sun’s  radius.  As  the  name  implies,  energy  moves  through  this  region  by 
radiation.  The  outermost  layer  is  known  as  the  convective  zone,  where  energy  is 
transported  in  a  churning,  wheeling  motion  called  convection. 
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Fig.  8.6  Anatomy  of  the  Sun  The  Sun  is  an  incandescent  ball  of  ionized  gas  powered  by  the 
fusion  of  hydrogen  in  its  core.  As  shown  in  this  interior  cross-section,  energy  produced  by  nuclear 
fusion  is  transported  outward,  first  by  countless  absorptions  and  emissions  within  the  radiative 
zone  and  then  by  convection.  The  visible  disk  of  the  Sun,  called  the  photosphere ,  contains  dark 
sunspots,  which  are  Earth-sized  regions  of  intense  magnetic  fields.  A  transparent  atmosphere 
envelops  the  photosphere,  including  the  low-lying  chromosphere  with  its  jet-like  spicules  and  the 
1  million-degree  corona  that  contains  holes  with  open  magnetic  fields,  the  source  of  the  high¬ 
speed  solar  wind.  Loops  of  closed  magnetic  fields  constrain  and  suspend  the  hot  million-degree 
gas  within  coronal  loops  and  cooler  material  in  prominences 


Radiation  does  not  move  quickly  through  the  solar  interior.  A  single  gamma  ray 
produced  by  nuclear  fusion  in  the  core  of  the  Sun  cannot  move  even  a  fraction  of  a 
millimeter  before  encountering  a  subatomic  particle,  where  the  radiation  is  scat¬ 
tered  or  absorbed  and  reemitted  with  less  energy.  This  radiation  quickly  interacts 
with  another  particle  in  the  radiative  zone  and  is  eventually  reradiated  at  yet  lower 
energy.  The  process  continues  again  countless  times  as  the  radiation  moves  out¬ 
ward  on  a  haphazard,  zigzag  path,  steadily  losing  photon  energy  at  each  encounter. 


Example:  Scattering  of  radiation  inside  the  Sun 

Free  electrons,  which  are  not  attached  to  atoms,  scatter  radiation  with  a 
Thomson  scattering  cross  section,  aT,  given  by  Thomson  (1903)  (Sect.  2.7): 

2 

=  6.6525  x  10-29  m2,  (8.58) 

where  re  =  2.8179  x  1 0  1 5  m  is  the  classical  electron  radius,  e  =  1.6022  x 
10“ 19  C  is  the  fundamental  unit  of  charge,  s0  =  8.854  x  10  12  F  m  1  is  the 

o  _ 1 

electric  constant,  and  c  =  2.9989  x  10°  m  s  is  the  speed  of  light. 
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A  radiation  photon  moving  into  plasma  with  an  electron  number  density 
of  Ne  will  travel  a  mean  free  path,  /,  before  encountering  an  electron,  where 
(Clausius  1858)  (Sect.  5.2) 


1 

Neoj 


(8.59) 


Hydrogen  is  by  far  the  most  abundant  element  in  the  Sun,  and  throughout 
the  solar  core  and  radiation  zone  it  is  completely  ionized  into  free  protons 
and  free  electrons.  In  the  solar  core,  the  mass  density 

_ Q 

p  «  1.5  x  10  kg  m  ,  and  since  it  is  the  protons  that  contribute  the  vast 
majority  of  this  mass  the  proton  number  density  is  NP  =  p/mP  k,  10  m_  , 

_ 0^7 

where  the  proton  mass  mP  =  1.6726  x  10  “  kg.  (The  proton  is  1836  times 
more  massive  than  the  electron,  so  we  can  ignore  the  electrons’  contribution 
to  the  mass.)  That  is,  there  are  100  million  trillion  trillion  protons  per  cubic 
meter  at  the  center  of  the  Sun,  so  it  is  not  surprising  that  they  don’t  move 
very  far  before  colliding  with  one  another.  Since  each  former  hydrogen  atom 
contained  one  proton  and  one  electron  NP  =  Ne.  Within  the  core,  a  photon 
will  only  travel  a  length  /  =  1  /(NeaT)  ^  1.5  x  10-4  m  before  encountering 
an  electron.  Further  out,  the  mass  density  is  lower,  but  even  at  the  mean  solar 

_ Q 

mass  density  of  1,409  kg  m  ,  we  still  obtain  average  values  of 

9Q  O 

Ne  *  8.4  x  10"  m  and  /  =  0.018  m.  If  the  radiation  was  always  headed 
straight  out  of  the  Sun,  it  would  make  about  2  x  1010,  or  20  billion,  colli¬ 
sions  in  working  its  way  across  the  radiation  zone,  whose  thickness  is  about 

o 

half  the  solar  radius,  or  about  3.5  x  10°  m. 

Each  photon  will,  however,  be  scattered  in  a  random  direction,  which  will 
generally  be  different  from  the  outward  direction.  The  decrease  in  temper¬ 
ature  with  increasing  distance  from  the  Sun  center  still  assures  that  more 
radiation  moves  outward  than  inward,  just  as  heat  normally  flows  from  a 
hotter  region  to  a  colder  one.  The  radiation  therefore  follows  the  path  of  least 
resistance,  heading  for  regions  of  lower  density  and  temperature. 

The  total  time  for  the  radiation  to  diffuse  through  the  radiation  zone  is  a 
random  walk  problem,  with  a  lengthening  step,  or  path  length,  at  larger 
distances  from  the  solar  center.  The  photon  spends  most  of  its  time  close  to 
the  core  where  the  mean  free  path  is  shortest,  and  the  average  step  length 
required  to  reach  the  inner  edge  of  the  convective  zone  is  /  =  9.0  x  10  4  m. 
The  diffusion  time  for  the  radiation  to  move  from  the  bottom  to  the  top  of  the 
radiative  zone  is  about  170,000  years  (Mitalas  and  Sills  1992). 


As  a  result  of  this  continued  ricocheting  and  innumerable  collisions  in  the 
radiative  zone,  it  takes  about  170,000  years,  on  average,  for  radiation  to  work  its 
way  out  from  the  Sun’s  core  to  the  bottom  of  the  convective  zone  (Mitalas  and 
Sills  1992),  where  the  temperature  has  become  cool  enough  for  heavy  nuclei  to 
capture  electrons  and  form  atoms  that  absorb  radiation.  These  atoms  block  the 
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outward  flow  of  radiation  like  dirt  on  a  window,  and  the  radiation  heats  the  bottom 
of  the  convective  zone. 

This  material  becomes  hotter  than  it  otherwise  would  be,  and  it  must  find  a  way 
to  release  the  pent-up  energy.  In  response  to  heating  from  below,  gases  in  the 
bottom  layer  of  the  convective  zone  expand,  thereby  becoming  less  dense  than  the 
gas  in  the  overlying  layers.  Due  to  its  low  density,  the  heated  material  rises  to  the 
visible  solar  disk  in  about  10  days  and  then  cools  by  radiation.  The  cooled  gas  then 
sinks  because  it  is  denser  than  the  hotter  gas,  only  to  be  reheated  and  rise  again 
(Focus  8.6).  Such  convective  motions  can  occur  whenever  a  layer  of  fluid  is  heated 
from  below  (Jeffreys  1926),  as  in  a  kettle  of  boiling  water  or  a  simmering  pot  of 
oatmeal,  with  hot  rising  bubbles  and  cooler  sinking  material. 


Focus  8.6  Convection 

Convection  is  a  method  of  transferring  heat  from  hotter  to  cooler  regions 
within  a  gas  or  liquid,  and  when  it  occurs  in  a  star,  energy  is  transported 
within  the  stellar  interior  by  a  wheeling  gas  motion.  When  a  region  of  high 
density  is  displaced  upward  into  a  region  of  lower  density  and  pressure, 
convection  will  take  place  if  the  displaced  volume  expands  and  becomes  less 
dense  than  its  surroundings.  It  will  then  continue  to  be  buoyed  up  like  a 
balloon  or  bubbles  in  a  boiling  pot  of  water. 

For  an  adiabatic  expansion  in  which  no  heat  is  exchanged  with  the  new 
surroundings,  convection  will  occur  if  the  structural  temperature  gradient  of 
the  star  is  greater  than  the  adiabatic  gradient,  or  when  (Schwarzschild  1906): 


> 


y  -  1  T(r)  dP 
y  P(r )  dr  ’ 


(8.60) 


where  the  adiabatic  index  y  =  5/3  for  ionized  hydrogen,  T(r)  is  the  gas 
temperature  at  radius  r,  the  dT / dr  is  the  gas  temperature  gradient  in  the 
radial,  r,  direction,  and  P(r)  is  the  gas  pressure  at  radius  r  given  by  the  ideal 
gas  law,  in  which  P  =  constant  x  pT,  for  a  mass  density  p. 

The  structural  temperature  gradient  in  a  star  is  given  by  Eddington  (1917) 


dT{r)  _  3  KpLjr) 

dr  4ac  r3  4nr2  ’  1  ’ 

where  the  radiation  constant  a  =  7.5657  x  10  16  J  m  3  K  4,  the  Rosseland 
mean  opacity  is  k,  and  L(r)  is  the  radiation  luminosity  at  radius  r. 

If  the  temperature  in  a  star  falls  fast  enough  with  increasing  radius, 
convection  sets  in.  This  happens  in  the  outer,  cooler  layers  of  stars  like  the 
Sun,  where  the  opacity  from  heavy  elements  becomes  high  enough  to  pro¬ 
duce  a  steep  temperature  gradient. 

In  stars  that  are  more  massive  than  the  Sun,  the  nuclear  fusion  of 
hydrogen  into  helium  occurs  by  the  CNO  cycle  rather  than  the  proton-proton 
chain  (see  Sect.  10.3),  and  the  CNO  process  is  a  very  sensitive  function  of 
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temperature.  As  a  result,  the  centers  of  these  stars  are  very  hot,  but  the 
temperature  falls  off  rapidly  with  distance  from  the  center  and  convection 
occurs  in  the  stellar  core. 

Chandrasekhar  (1961)  has  written  a  comprehensive  book  that  includes 
convective  instability  of  a  layer  of  gas  heated  from  below.  The  conditions  for 
the  instability  can  be  expressed  in  terms  of  a  Rayleigh  number  (Rayleigh 
1916).  Convective  energy  transport  in  stars  is  described  by  the  mixing-length 
theory  of  convection  (Bohm-Vitense  1953,  1958),  Schwarzschild  (1958), 
and  standard  textbooks  of  stellar  astrophysics).  Galloway  and  Weiss  (1981), 
Wilson  (1966,  1978),  and  Spiegel  (1971)  have  also  discussed  convection  in 
stars. 


The  convective  zone  is  capped  by  the  photosphere,  the  place  where  the  gaseous 
material  changes  from  being  completely  opaque  to  being  transparent  to  radiation. 
In  the  photosphere,  a  process  of  absorption  and  reemission  of  radiation  carries  the 
Sun’s  energy  out.  Rupert  Wildt  (1905-1976)  explained  the  detailed  observations 
of  sunlight  by  showing  that  both  hydrogen  atoms  and  negative  hydrogen  ions 
absorb  radiation  in  the  photosphere  (Wildt  1939).  Collisions  between  unionized,  or 
neutral,  hydrogen  atoms  and  free  electrons  lead  to  the  formation  of  the  negative 
hydrogen  ions.  Despite  their  low  concentration,  they  provide  the  absorption  and 
extra  opacity  needed  to  account  for  the  sunlight  that  escapes  from  the  photosphere. 

As  first  noticed  by  William  Herschel  (1738-1822),  the  photosphere  contains  a 
fine  granular  pattern  (Herschel  1801).  These  closely  packed  granulation  cells  now 
can  be  examined  using  high-resolution  images  taken  from  ground-based  telescopes 
under  conditions  of  excellent  observation  (Fig.  8.7)  or  from  spacecraft  located 
outside  of  the  Earth’s  obscuring  atmosphere.  The  images  reveal  a  host  of  granules 
with  bright  centers  surrounded  by  dark  lanes,  exhibiting  a  non-stationary,  over¬ 
turning  motion  caused  by  the  underlying  convection. 

The  bright  center  of  each  granule,  or  convection  cell,  is  the  highest  point  of  a 
rising  column  of  hot  gas.  The  dark  edges  of  each  granule  are  the  cooled  gas,  which 
sinks  because  it  is  denser  than  the  hotter  gas.  Each  individual  granule  lasts  only 
about  15  min  before  it  is  replaced  by  another  one,  never  reappearing  in  precisely 
the  same  location. 

The  mean  angular  distance  between  the  bright  centers  of  adjacent  granules  is 
about  2.0  s  of  arc,  corresponding  to  about  1,500  km  at  the  Sun.  That  seems  very 
large,  but  an  individual  granule  is  about  the  smallest  thing  you  can  see  on  the  Sun 
when  peering  through  our  turbulent  atmosphere. 

There  are  at  least  a  million  granules  on  the  visible  solar  disk  at  any  moment. 
They  are  constantly  evolving  and  changing,  producing  a  honeycomb  pattern  of 
rising  and  falling  gas  that  is  in  constant  turmoil,  bubbling  away  and  completely 
changing  on  time-scales  of  minutes. 

The  granules  are  superimposed  on  a  larger  cellular  pattern,  called  the  super¬ 
granulation,  studied  at  the  California  Institute  of  Technology  by  Robert  B. 
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Fig.  8.7  The  solar  granulation  Underlying  convection  shapes  the  photosphere,  producing  tiny, 
varying  regions  called  granules.  They  are  places  where  hot  and  therefore  bright  material  reaches 
the  visible  solar  disk.  The  largest  granules  are  approximately  1,400  km  across.  They  are  not 
circular  but  rather  angular  in  shape.  This  honeycomb  pattern  of  rising  ( bright )  and  falling  (dark) 
gas  is  in  constant  turmoil,  completely  changing  on  time-scales  of  minutes  and  never  exactly 
repeating  itself.  This  image  was  taken  with  exceptional  angular  resolution  of  0.2  s  of  arc,  or 
150  km,  at  the  Sun  using  the  National  Solar  Observatory’s  Vacuum  Tower  Telescope  at  the 
Sacramento  Peak  Observatory.  (Courtesy  of  Thomas  R.  Rimmele/ AURA/NO AO/NSF.) 


Leighton  (1919-1997)  and  his  collaborators  in  the  early  1960s.  They  subtracted  a 
long-wavelength  image  of  the  Sun  from  a  short-wavelength  one,  revealing  a 
supergranulation  pattern  of  horizontal  flow,  each  supergranulae  is  an  estimated 
16,000  km  across,  or  almost  three  times  as  large  as  the  Earth,  with  a  lifetime  of 
roughly  24  h.  Because  the  motion  is  predominantly  horizontal,  the  supergranules 
were  not  detected  when  looking  directly  at  the  center  of  the  solar  disk,  but  further 
out  toward  the  sides  of  the  round  solar  disk,  where  the  horizontal  motion  is 
partially  directed  along  the  line  of  sight.  Leighton  (1963)  has  provided  a  review  of 
the  solar  granulation. 

Roughly  3,000  supergranules  are  seen  on  the  visible  solar  disk  at  any  moment. 
And  like  the  ordinary  granulation,  the  changing  pattern  of  supergranulation  is 
caused  by  convection.  But  unlike  the  granules,  whose  gases  move  up  and  down, 
the  material  in  each  supergranule  cell  rises  in  the  center,  and  exhibits  a  sideways 
motion  as  it  moves  away  from  the  center  with  a  typical  velocity  of  about 
0.4  km  s_1.  Only  after  this  prolonged  horizontal  motion  does  the  material  even¬ 
tually  sink  down  again  at  the  cell  boundary.  The  supergranular  flow  carries  the 
magnetic  field  across  the  photosphere,  sweeping  the  magnetism  to  the  edges  of  the 
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supergranulation  cells  where  it  collects,  strengthens  and  forms  a  network  of 
concentrated  magnetic  field. 

When  studying  the  supergranulation  in  the  1960s,  Leighton  and  his  co-workers 
unexpectedly  discovered  vertical  up  and  down  motions  in  the  subtracted  difference 
between  long- wavelength  and  short- wavelength  solar  images.  They  exhibited  a 
periodic  oscillation  with  a  period  of  about  five  minutes  (Leighton  1961;  Leighton 
et  al.  1962;  Noyes  and  Leighton  1963).  These  oscillations  have  subsequently  been 
used  to  investigate  the  unseen  depths  of  the  Sun  (Focus  8.7). 


Focus  8.7  Helioseismology 

Vigorous  turbulent  motion  in  the  convective  zone  produces  sound  waves 
(Goldreich  and  Kumar  1990),  which  drive  five-minute  oscillations  in  the 
overlying  photosphere  (Ulrich  1970;  Leibacher  and  Stein  1971;  Deubner 
1975).  Each  five-minute  period  is  the  time  it  takes  for  the  localized  motion  to 
change  from  moving  outward  to  moving  inward  and  back  outward  again. 
Such  five-minute  oscillations  are  imperceptible  to  the  unaided  eye,  for  the 
photosphere  moves  a  mere  hundred-thousandth  (0.00001)  times  the  solar 
radius,  but  they  can  be  detected  using  the  Doppler  effect  of  a  single 
absorption  line  formed  in  the  photosphere.  Deubner  and  Gough  (1984) 
provided  a  review  of  helioseismology  at  that  stage  of  its  development.  More 
recent  accomplishments  of  helioseismology  are  included  with  references  to 
the  relevant  research  papers  in  Lang  (2009). 

The  information  obtained  from  oscillations  produced  by  sound  waves  that 
traveled  to  various  levels  within  the  Sun  can  be  combined  to  create  a  picture 
of  the  Sun’s  large-scale  internal  structure  (Fig.  8.8).  The  technique  is  known 
as  helioseismology,  a  hybrid  name  combining  the  Greek  words  Helios  for 
the  “Sun”  and  seismos  for  “earthquake”  or  “tremor.” 

Observations  from  space,  where  night  never  falls,  provide  the  best  data 
for  helioseismology.  The  SOlar  and  Heliospheric  Observatory ,  abbreviated 
SOHO ,  has  provided  them.  Instruments  aboard  this  spacecraft  have  observed 
the  solar  oscillations  24  h  a  day,  every  day  for  more  than  ten  years.  By 
considering  a  sequence  of  waves  with  longer  and  longer  wavelengths,  that 
penetrate  deeper  and  deeper,  the  radial  profile  of  the  sound  speed  has  been 
determined  and  used  to  establish  the  lower  boundary  of  the  convective  zone, 
at  a  radius  of  71.3  %  of  the  radius  of  the  Sun. 

Rotation  imparts  a  clear  signature  to  the  oscillation  periods,  lengthening 
them  in  one  direction  and  shortening  them  in  the  other.  These  opposite 
effects  make  the  oscillation  periods  divide,  and  such  rotational  splitting 
depends  on  both  depth  and  latitude  within  the  Sun.  The  helioseismological 
observations  indicate  that  differential  rotation,  in  which  the  equator  spins 
faster  than  the  poles,  is  preserved  throughout  the  convective  zone,  but  it 
disappears  in  the  radiative  zone  that  rotates  at  one  speed  (Sect.  4.3). 
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Fig.  8.8  Sound  paths  in  the  Sun  The  trajectories  of  sound  waves  are  shown  in  a  cross  section  of 
the  solar  interior.  The  rays  are  bent  inside  the  Sun,  like  light  within  the  lens  of  an  eye.  They  circle 
the  solar  interior  in  spherical  shells  called  resonant  cavities.  Each  shell  is  bounded  at  the  top  by  a 
large,  rapid  density  drop  near  the  photosphere  and  bounded  at  the  bottom  at  an  inner  turning  point 
where  the  bending  rays  undergo  total  internal  refraction  due  to  the  increase  in  sound  speed  with 
depth  inside  the  Sun.  (From  “The  Life  and  Death  of  Stars”  by  Kenneth  R.  Lang,  published  by 
Cambridge  University  Press,  2013.  Reprinted  with  permission.) 


8.6  The  Faint- Young-Sun  Paradox 

The  Sun  has  grown  slowly  in  luminous  intensity  since  it  formed;  with  a  steady, 
inexorable  brightening  that  is  a  consequence  of  the  increasing  amount  of  helium 
accumulating  in  the  Sun’s  core.  As  the  hydrogen  in  the  Sun’s  center  slowly 
depletes,  and  is  steadily  replaced  by  heavier  helium,  the  core  must  continue  pro¬ 
ducing  enough  pressure  to  prevent  the  Sun  from  collapsing.  The  only  way  to 
maintain  the  pressure  and  keep  supporting  the  weight  of  a  heavier  material  is  to 
increase  the  central  temperature.  As  a  result  of  the  slow  rise  in  temperature,  the 
rate  of  nuclear  fusion  gradually  increases  and  so  does  the  Sun’s  luminosity.  The 
Sun  is,  for  example,  now  30  %  more  luminous  than  it  was  4.6  billion  years  ago. 

The  Sun’s  luminosity  increases  as  time  goes  on,  so  of  course  the  Sun  was 
significantly  dimmer  in  the  remote  past.  Therefore,  the  Earth  should  have  been 
noticeably  colder  then.  However,  this  does  not  agree  with  geological  evidence. 
Assuming  an  unchanging  terrestrial  atmosphere,  with  the  same  composition  and 
reflecting  properties  as  today,  the  lower  solar  luminosity  in  the  past  would  have 
caused  the  Earth’s  global  surface  temperature  to  be  below  the  freezing  point  of 
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water  during  the  planet’s  first  2.6  billion  years.  The  oceans  would  have  been 
frozen  solid,  there  would  have  been  no  liquid  water,  and  the  entire  planet  would 
have  been  locked  into  a  global  ice  age. 

Yet,  sedimentary  rocks,  which  must  have  been  deposited  in  liquid  water,  date 
back  to  a  time  when  the  Earth  was  less  than  800  million  years  old.  There  is  fossil 
evidence  in  those  rocks  of  living  things  at  about  that  time.  Thus,  for  billions  of 
years,  the  Earth’s  surface  temperature  was  not  very  different  from  today;  condi¬ 
tions  have  remained  hospitable  for  life  on  the  Earth  throughout  most  of  the  pla¬ 
net’s  history. 

There  are  several  possible  explanations  for  the  discrepancy  between  the  Earth’s 
warm  climatic  record  and  an  initially  dimmer  Sun,  which  is  known  as  the  faint- 
young-Sun  paradox.  It  can  be  resolved  if  the  Earth’s  primitive  atmosphere  con¬ 
tained  about  a  thousand  times  more  carbon  dioxide  than  it  does  now  (Sagan  and 
Chyba  1997).  Greater  amounts  of  carbon  dioxide  would  enable  the  early  atmo¬ 
sphere  to  trap  more  solar  heat  near  the  Earth’s  surface,  warming  it  by  the  green¬ 
house  effect,  which  would  prevent  the  oceans  from  freezing.  Another  possibility  is 
that  the  Sun  was  more  magnetically  active  in  its  youth,  expelling  strong  winds, 
energetic  particles,  and  radiation  that  might  have  kept  the  Earth  warm  (Schilling 
2001;  Sackmann  and  Boothroyd  2003;  Minton  and  Malhotra  2007). 

What  about  the  future?  In  only  1  billion  years  the  Sun  will  have  brightened  by 
another  10  %.  Calculations  suggest  that  the  Earth’s  oceans  could  evaporate  then  at 
a  rapid  rate,  resulting  in  a  hot,  dry,  uninhabitable  Earth.  In  about  3  billion  years 
from  now,  the  Sun  will  then  be  hot  enough  to  boil  the  Earth’s  oceans  away,  leaving 
the  planet  a  burned-out  cinder,  a  dead  and  sterile  place. 


8.7  The  Sun’s  Destiny 

The  Sun  cannot  shine  forever,  because  eventually  it  will  deplete  the  hydrogen  fuel 
in  its  core.  Although  it  has  converted  only  a  trivial  part  of  its  original  mass  into 
energy,  the  Sun  has  processed  a  substantial  37  %  of  its  core  hydrogen  into  helium 
in  the  past  4.6  billion  years.  There  will  be  no  hydrogen  left  in  the  solar  core  about 
7  billion  years  from  now.  When  that  hydrogen  is  exhausted,  the  central  part  of  the 
Sun  will  undergo  a  slow  collapse,  and  the  gradually  increasing  core  temperature 
will  cause  the  outer  layers  of  the  Sun  to  expand  into  a  red  giant  star,  with  a 
dramatic  increase  in  size  and  a  powerful  rise  in  luminosity  (Sackmann  et  al.  1993). 
Eventually  the  Sun  will  become  170  times  larger  and  2,300  times  more  luminous 
than  it  is  now  (Fig.  8.9).  This  will  result  in  a  substantial  rise  in  temperatures 
throughout  the  solar  system,  becoming  hot  enough  to  melt  the  Earth’s  surface. 

Meanwhile,  the  core  of  the  Sun  will  continue  to  contract  until  the  central 
temperature  is  hot  enough  to  ignite  helium  -  which  is  at  about  100  million  K. 
However,  this  conversion  of  helium  into  carbon  will  not  last  long  compared  to  the 
Sun’s  12  billion  years  of  hydrogen  burning.  In  about  35  million  years,  the  core 
helium  will  have  been  used  up  and  there  will  be  no  heat  left  to  hold  up  the  Sun.  In 
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Fig.  8.9  The  Sun’s  fate  In 

about  7  billion  years,  the  Sun 
will  become  much  brighter 
(top)  and  larger  (bottom).  The 
time-scale  is  expanded  near 
the  end  of  the  Sun’s  life  to 
show  relatively  rapid 
changes.  (Courtesy  of 
I- Juliana  Sackmann  and 
Arnold  I.  Boothroyd.) 


Age  (billions  of  years) 


a  last  spurt  of  activity,  the  Sun  will  shed  the  outer  layers  of  gas  to  produce  an 
expanding  “planetary”  nebula  around  the  star,  and  the  core  will  collapse  into  a 
white  dwarf  star  (see  Sects.  13.1,  13.2). 

By  this  time,  the  intense  winds  will  have  stripped  the  Sun  down  to  about  half  of 
its  original  mass,  and  gravitational  collapse  will  squeeze  the  remaining  part  to 
about  the  size  of  the  present-day  Earth.  Nuclear  reactions  then  will  be  a  thing  of 
the  past,  and  there  will  be  nothing  left  to  warm  the  Sun  or  planets.  The  former  Sun 
will  gradually  cool  down  and  fade  away,  plunging  all  of  the  planets  into  a  deep 
freeze. 

Such  events  are  in  the  very  distant  future,  of  course;  but  even  now,  the  Sun 
threatens  the  Earth  with  its  perpetually  expanding  atmosphere  that  envelops  our 
planet  and  with  explosive  outbursts  that  can  send  energetic  particles,  intense 
radiation,  and  huge  magnetic  bubbles  toward  the  Earth. 


Chapter  9 

The  Extended  Solar  Atmosphere 


This  chapter  discusses  the  Sun’s  outer  atmosphere,  the  million-degree  corona, 
which  expands  away  from  the  Sun  in  fast,  uniform  winds  and  slow,  gusty  ones. 
Solar  flares  and  coronal  mass  ejections  are  also  reviewed  together  with  the  space- 
weather  effects  of  these  solar  explosions  on  the  Earth  and  nearby  space.  A  com¬ 
plete,  in-depth  treatment  of  all  of  these  topics,  with  numerous  references,  can  be 
found  in  Lang,  Kenneth  R.,  The  Sun  From  Space ,  Second  Edition ,  Heidelberg: 
Springer- Verlag  2009. 


9.1  Hot,  Volatile,  Magnetized  Gas 

9.1.1  The  Million-Degree  Solar  Corona 

The  apparent  edge  of  the  visible  solar  disk,  the  photosphere,  is  illusory,  for  a  hot, 
transparent  atmosphere  envelops  it,  extending  all  the  way  to  the  Earth  and  beyond. 
This  unseen  atmosphere  is  more  rarefied  than  the  best  vacuum  on  Earth,  and  so 
tenuous  that  we  see  right  through  it. 

The  diaphanous  outer  atmosphere  of  the  Sun  includes  -  from  its  deepest  part 
outward  -  the  underlying  photosphere,  from  the  Greek  word  photos  for  ‘flight”; 
the  thin  chromosphere,  from  the  Greek  word  chromos  for  “color”;  and  the 
extended  corona  from  the  Latin  word  for  “crown.”  We  can  observe  the  chro¬ 
mosphere  and  corona  during  a  total  solar  eclipse,  when  the  Moon  blocks  out  the 
intense  light  of  the  underlying  photosphere  (Fig.  9.1). 

Because  of  their  very  low  densities  and  high  temperatures,  the  chromosphere 
and  corona  produce  bright  spectral  features  called  emission  lines.  Atoms  and  ions 
in  a  hot  tenuous  gas  produce  such  emission  features,  heated  to  incandescence  and 
shining  at  precisely  the  same  wavelengths  as  the  dark  absorption  lines  produced  by 
the  same  substance  in  the  cooler  photosphere.  The  corona’s  emission  lines  pro¬ 
vided  the  initial  evidence  that  it  is  hundreds  of  times  hotter  than  the  underlying 
photosphere. 


K.  R.  Lang,  Essential  Astrophysics,  Undergraduate  Lecture  Notes  in  Physics, 
DOI:  10.1007/978-3-642-35963-7_9,  ©  Springer- Verlag  Berlin  Heidelberg  2013 
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Fig.  9.1  Eclipse  corona  The 

million-degree  solar 
atmosphere,  known  as  the 
corona,  was  seen  around  the 
shadowed  disk  of  the  Moon 
during  the  solar  eclipse  on  1 1 
July  1991.  The  electrically 
charged  gas  was  concentrated 
by  magnetic  fields  into 
numerous  fine  rays  as  well  as 
larger  helmet  streamers.  The 
expanding  corona  envelops 
the  Earth  and  all  the  other 
planets.  (Courtesy  of  HAO/ 
NCAR.) 


The  corona’ s  emission  lines  were  first  observed  during  total  eclipses  of  the  Sun 
(Table  9.1).  The  intense  green  line,  at  a  wavelength  of  530.3  nm,  for  example,  was 
first  observed  during  the  solar  eclipse  of  August  7,  1869,  and  for  decades  attributed 
to  a  previously  unknown  substance  dubbed  coronium  (Young  1869).  About 
70  years  later,  it  was  attributed  to  emission  of  iron  ions,  denoted  by  Fe  XIV,  by 
Walter  Grotrian  (1890-1954)  of  Potsdam  and  Bengt  Edlen  (1906-1933), 
a  Swedish  astronomer  who  specialized  in  spectroscopy  (Grotrian  1934,  1939; 
Edlen  1941,  1945).  These  ions  are  iron  atoms  missing  13  of  their  26  electrons. 

The  reason  it  took  so  long  to  identify  the  coronal  emission  lines  is  that  no  one 
realized  the  corona  was  so  hot  and  also  because  such  spectral  features  can  arise 
only  in  the  very  tenuous  corona.  They  are  “forbidden  transitions”  that  do  not  occur 
in  terrestrial  circumstances  where  collisions  between  atoms  keep  them  from 
happening  even  in  the  best  vacuum. 

Iron  must  be  at  a  temperature  of  a  few  million  K  for  atomic  collisions  to  remove 
so  many  electrons  from  the  atoms.  Edlen  provided  additional  evidence  for  this  hot 
temperature  from  the  observed  widths  of  the  emission  lines  (Edlen  1941).  Ele¬ 
ments  move  at  a  faster  speed  in  a  hotter  gas,  broadening  the  observed  spectral 
features  as  well  as  producing  them.  The  million-degree  temperature  of  the  corona 
was  subsequently  confirmed  by  observations  of  the  Sun’s  radio  radiation  (Pawsey 
1946)  and  intense  x-ray  radiation. 
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Table  9.1  Strong  forbidden  emission  lines  in  the  visible  light  of  the  Sun’s  low  corona 


Wavelength  (nm) 

Ion 

Name 

Wavelength  (nm) 

Ion 

338.8 

Fe  XIII 

670.2 

Ni  XV 

423.2 

Ni  XII 

789.2 

Fe  XI 

530.3 

Fe  XIV 

Green  line 

802.4 

Ni  XV 

569.4 

Ca  XV 

Yellow  line 

1074.7 

Fe  XIII 

637.4 

Fe  X 

Red  line 

1079.8 

Fe  XIII 

a  Adapted  from  Edlen  (1941)  and  Swings  (1943).  The  symbols  Ca,  Fe  and  Ni  denote,  respectively, 
Calcium,  Iron  and  Nickel.  Subtract  one  from  the  Roman  numeral  to  obtain  the  number  of  missing 
electrons.  Thus,  the  ion  Fe  XIII  is  an  iron  atom  missing  12  electrons.  The  wavelength  is  in  units  of 
nanometers,  or  1  nm  =10  m.  Astronomers  have  often  used  the  Angstrom  unit  of  wavelength, 

o  o  _ i 

where  1  Angstrom  =  1  A  =  0.1  nm  x  10  m 


Example:  The  million-degree  corona 

The  identification  of  an  emission  line  of  the  solar  corona  with  Fe  XIV 
indicated  that  the  coronal  gas  would  have  to  be  very  hot.  We  can  estimate 
how  hot  that  would  be  by  equating  the  thermal  energy  of  the  gas,  3kT/2,  at 
temperature  T,  to  the  ionization  potential  of  Fe  XIV,  which  is  235.04  eV; 
this  is  the  amount  of  energy  needed  to  remove  so  many  electrons  from  the 

_ no  _ 1 

iron  atom.  Using  k  =  1.38065  x  10  ^  J  K  for  the  Boltzmann  constant 
and  the  conversion  of  1  eV  =  1.602  x  10“19  J,  we  obtain  T  —  2  x  ioni¬ 
zation  potential /(3k)  «  1.82  x  106  K,  or  about  2  million  K. 


Withbroe  and  Noyes  (1977)  provided  a  review  of  mass  and  energy  flow  in  the 
solar  chromosphere  and  corona.  Aschwanden  (2006)  discussed  the  physics  of  the 
solar  corona,  and  Ashwanden,  Poland  and  Rabin  (2001)  and  Lang  (2009)  have 
provided  reviews  of  modem  observations  of  the  corona. 

Because  of  its  high  temperature,  the  corona  emits  most  of  its  energy  and  its 
most  intense  radiation  as  x-rays.  The  x-rays  can  be  used  to  image  the  hot  corona  all 
across  the  Sun’s  face  with  high  spatial  and  temporal  resolution.  This  is  because  the 
Sun’s  visible  photosphere,  being  so  much  cooler,  produces  negligible  x-ray  radi¬ 
ation  and  appears  dark  under  the  million-degree  corona.  Since  the  Sun’s  x-ray 
radiation  is  absorbed  totally  in  the  Earth’s  atmosphere,  it  must  be  observed  with 
telescopes  lofted  into  space  by  rockets  or  in  satellites. 

Modem  spacecraft  obtain  full-disk  images  of  the  corona  at  soft  x-ray  and 
extreme  ultraviolet  wavelengths  in  lines  of  ionized  iron,  Fe  XVII,  Fe  IX,  Fe  X, 
Fe  XIII,  Fe  XV,  Fe  XVI  and  one  line  of  ionized  helium,  He  II;  these  are  the 
permitted  lines  emitted  by  ionized  atoms,  sensitive  to  temperatures  from  60,000  to 

z 

4.0  x  10  K  (Table  9.2),  and  not  the  forbidden  lines  detected  at  visible 
wavelengths. 

Close  inspection  of  the  Sun’s  x-ray  radiation  shows  that  the  star  is  in  constant 
turmoil,  driven  by  intense,  variable  magnetic  fields.  This  magnetism  is  responsible 
for  dark  sunspots  that  temporarily  mark  the  visible  face  of  the  Sun. 
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Table  9.2  Prominent  soft  x-ray 
corona  and  transition  regiona 

and  extreme  ultraviolet 

emission  lines  from  the  Sun’s  low 

Wavelength  (nanometers) 

Emitting  ion 

Formation  temperature  (kelvin) 

1.70 

Iron,  Fe  XVII 

4,000,000 

1.90 

Oxygen,  O  VIII 

3,100,000 

2.16 

Oxygen,  O  VII 

2,000,000 

3.37 

Carbon,  C  VI 

1,300,000 

17.11 

Iron,  Fe  IX 

630,000 

17.45 

Iron,  Fe  X 

1,000,000 

18.40 

Oxygen,  O  VI 

320,000 

19.51 

Iron,  Fe  XII 

1,400,000 

28.42 

Iron,  Fe  XV 

2,100,000 

30.38 

Helium,  He  II 

60,000 

33.54 

Iron,  Fe  XVI 

2,500,000 

33.61 

Iron,  Fe  XVI 

2,500,000 

46.52 

Neon,  Ne  VII 

630,000 

60.98 

Magnesium,  Mg  X 

1,300,000 

155.0 

Carbon,  C  IV 

126,000 

a  Subtract  one  from  the  Roman  numeral  to  get  the  number  of  missing  electrons.  The  wavelengths 
are  in  nanometers,  abbreviated  nm,  where  1  nm  =  10-9  m.  Astronomers  sometimes  use  the 

O  #  #  O  O  _  1  A 

Angstrom  unit  of  wavelength,  abbreviated  A,  where  1  A  =  10  meters  =  0.1  nm 


9.1.2  Varying  Sunspots  and  Ever-Changing  Magnetic  Fields 

The  solar  corona  is  permeated  by  magnetic  fields  that  are  generated  inside  the  Sun 
and  rise  up  through  the  photosphere  into  the  overlying  atmosphere.  The  strongest 
magnetism  protrudes  to  blemish  the  visible  Sun  with  dark,  Earth-sized  sunspots 
(Fig.  9.2),  which  were  seen  by  the  unaided  human  eye  up  to  3,000  years  ago  - 
don’t  do  it,  staring  at  the  Sun  could  burn  your  eyes. 

In  the  early  20th  century,  the  American  astronomer  George  Ellery  Hale 
(1868-1938)  first  used  the  Zeeman  effect  to  show  that  sunspots  are  regions  of 
intense  magnetism,  thousands  of  times  stronger  than  the  Earth’s  magnetic  field 
(Hale  1908a,  b).  The  intense  sunspot  magnetism  acts  as  both  a  valve  and  a 
refrigerator,  choking  off  the  outward  flow  of  heat  and  energy  from  the  solar 
interior  and  keeping  the  sunspots  cooler  and  darker  than  their  surroundings. 

The  strong  magnetism  exerts  a  pressure  that  tends  to  push  apart  the  magnetic 
fields;  however,  by  using  helioseismology  to  look  under  the  photosphere  astron¬ 
omers  have  discovered  that  flowing  material  pushes  against  the  magnetic  fields  of 
sunspots,  holding  them  in  place. 

Because  the  Sun’s  magnetism  is  forever  changing  and  is  never  still,  the  sun¬ 
spots  are  temporary,  with  lifetimes  ranging  from  hours  to  months.  Moreover,  the 
total  number  of  sunspots  varies  periodically,  from  a  maximum  to  a  minimum  and 
back  to  a  maximum,  in  about  11  years  (Fig.  9.3).  Samuel  Heinrich  Schwabe 
(1789-1875),  an  amateur  astronomer  in  Dessau,  Germany,  discovered  this  periodic 
variation  in  the  mid-nineteenth  century  (Schwabe  1844).  At  the  maximum  in  the 
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Fig.  9.2  Sunspot  group  Intense  magnetic  fields  emerge  from  the  interior  of  the  Sun  through  the 
Sun’s  visible  disk,  the  photosphere,  producing  groups  of  sunspots.  The  sunspots  appear  dark 
because  they  are  slightly  cooler  than  the  surrounding  photosphere  gas.  This  composite  image  was 
taken  in  white  light;  that  is,  in  all  of  the  colors  combined.  The  enlarged  image  shows  the  biggest 
sunspot  group,  which  is  about  12  times  larger  than  the  Earth,  the  size  of  which  is  denoted  by  the 
black  spot  ( lower  left).  (Courtesy  of  SOHO /ESA/NASA.) 


90N 


90S  - - - — 

1980  1990  2000 

Date 


Fig.  9.3  Solar  magnetic  activity  cycle  The  1 1  year  solar  cycle  of  magnetic  activity  is  plotted 
from  1975  to  2007.  Both  the  numbers  of  sunspots  (left)  and  the  positions  of  sunspots  (right)  wax 
and  wane  in  cycles  that  peak  every  1 1  years.  Similar  1 1  year  cycles  have  been  observed  for  more 
than  a  century.  At  the  beginning  of  each  cycle,  the  first  sunspots  appear  at  about  30°  solar  latitude 
and  then  migrate  to  0°  solar  latitude,  at  the  solar  equator  (EQ),  when  the  cycle  ends.  This  plot  of 
the  changing  positions  of  sunspots  resembles  the  wings  of  a  butterfly,  and  therefore  has  been 
called  the  butterfly  diagram.  The  cycles  overlap  with  spots  from  a  new  cycle  appearing  at  high 
latitudes  while  the  spots  from  the  old  cycle  persist  in  the  equatorial  regions.  The  solar  latitude  is 
the  angular  distance  from  the  plane  of  the  Sun’s  equator,  which  is  very  close  to  the  plane  of  the 
Earth’s  orbit  about  the  Sun,  called  the  ecliptic.  (Courtesy  of  David  Hathaway /NAS A/MSFC.) 
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Fig.  9.4  Winding  up  the  field  A  model  for  generating  the  changing  location,  orientation,  and 
polarity  of  the  sunspot  magnetic  fields.  Initially,  the  magnetic  field  is  supposed  to  be  the  dipolar 
field  seen  at  the  poles  of  the  Sun  (left).  The  internal  magnetic  fields  then  run  just  below  the 
photosphere  from  the  Sun’s  South  to  North  Pole.  As  time  proceeds,  the  highly  conductive, 
rotating  material  inside  the  Sun  carries  the  magnetic  field  along  and  winds  it  up.  Because  the 
equatorial  regions  rotate  at  a  faster  rate  than  the  polar  regions,  the  internal  magnetic  fields  are 
stretched  out  and  wrapped  around  the  Sun’s  center,  becoming  concentrated  and  twisted  together 
like  ropes  (middle  and  right).  With  increasing  strength,  the  submerged  magnetism  becomes 
buoyant,  rises  and  penetrates  the  visible  solar  disk  -  the  photosphere  -  creating  magnetic  loops 
and  bipolar  sunspots  that  are  formed  in  two  belts,  one  each  in  the  northern  and  southern 
hemisphere  (right).  The  simplified  model  shows  only  two  magnetic  loops,  but  many  of  them  are 
created  at  about  the  same  time.  [Adapted  from  Horace  W.  Babcock,  “The  topology  of  the  Sun’s 
magnetic  field,  and  the  22  year  cycle,”  Astrophysical  Journal  133,  572-587  (1961).] 

sunspot  cycle,  there  may  be  100  or  more  spots  on  the  visible  hemisphere  of  the  Sun 
at  one  time;  at  sunspot  minimum,  very  few  are  seen,  and,  for  periods  as  long  as  a 
month  or  more,  none  can  be  found.  The  locations  where  sunspots  emerge  and 
disappear  also  vary  over  the  1 1  year  sunspot  cycle,  from  mid-latitudes  on  the  Sun 
to  the  solar  equator  (see  Fig.  9.3). 

The  American  astronomer  Horace  W.  Babcock  (1912-2003)  devised  a  con¬ 
ceptually  simple  model  for  the  varying  sunspots  (Babcock  1961).  Working  with 
his  son,  Harold,  he  had  shown  that  the  Sun  has  a  general  dipolar  magnetic  field  of 
about  10-4  tesla,  usually  limited  to  high  solar  latitudes  near  the  solar  poles 
(Babcock  and  Babcock  1955).  His  dynamo  theory  begins  at  sunspot  minimum  with 
a  global,  dipolar  magnetic  field  that  runs  inside  the  Sun  from  south  to  north,  or 
from  pole  to  pole.  Uneven,  or  differential,  rotation  -  in  which  the  equatorial 
regions  rotate  faster  than  the  polar  ones  -  shears  the  electrically  conducting  gases 
of  the  interior.  As  a  result,  the  entrained  magnetic  fields  are  stretched  out  and 
squeezed  together.  The  magnetism  is  coiled,  bunched,  and  amplified  as  it  is 
wrapped  around  the  inside  of  the  Sun.  The  surrounding  gas  buoys  up  the  con¬ 
centrated  magnetism,  and  eventually  the  magnetic  fields  become  strong  enough  to 
rise  up  to  the  photosphere  and  break  through  it  in  belts  of  bipolar  sunspot  pairs 
(Fig.  9.4). 

The  initial  dipolar  magnetic  field  is  twisted  into  a  submerged,  ring-shaped  field 
running  parallel  to  the  solar  equator,  or  east  to  west.  There  are  two  buried  magnetic 
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fields,  one  in  the  northern  hemisphere  and  one  in  the  southern  hemisphere,  but 
oppositely  directed,  which  bubble  up  at  mid-latitudes  to  spawn  two  belts  of  sun¬ 
spots,  symmetrically  placed  on  each  side  of  the  equator. 

As  the  1 1  year  cycle  progresses  toward  maximum  activity,  the  internal  mag¬ 
netic  field  is  wound  increasingly  tighter  by  the  shearing  action  of  differential 
rotation.  The  two  sunspot  belts  slowly  migrate  toward  the  solar  equator,  where  the 
sunspots  in  the  two  hemispheres  tend  to  merge. 

Diffusion  and  poleward  flows  sweep  the  remnant  magnetism  into  streams,  each 
dominated  by  a  single  magnetic  polarity,  that  slowly  wind  their  way  from  the  low- 
and  mid-latitude  belts  to  the  Sun’s  poles.  By  sunspot  minimum,  the  continued 
poleward  transport  of  their  debris  may  form  a  global  dipole  with  reversed  polarity. 
The  north  and  south  poles  switch  magnetic  direction  or  polarity  at  the  next  sunspot 
minimum.  When  the  Sun’s  magnetic  flip  is  considered,  we  see  that  it  takes  two 
activity  cycles,  or  about  22  years,  for  the  overall  magnetic  polarity  to  return  to 
where  it  began.  The  internal  magnetism  then  has  readjusted  to  its  submerged 
dipolar  form,  and  the  magnetic  cycle  begins  again. 


9.1.3  Coronal  Loops 

Magnetic  fields  are  described  by  lines  of  force,  like  those  joining  the  opposite 
poles  of  a  bar  magnet.  The  direction  of  the  lines  of  force  and  the  orientation  of  the 
magnetic  fields  can  be  inferred  from  the  polarization  of  the  spectral  lines  that  have 
been  split  by  the  Zeeman  effect.  Magnetic- field  lines  pointing  out  of  the  Sun  have 
positive  magnetic  polarity,  whereas  inward-directed  fields  have  negative  polarity. 

Sunspots  usually  appear  in  adjacent  pairs  or  other  close  groupings  of  opposite 
magnetic  polarity  (Hale  1919).  Invisible  magnetic  arches  loop  between  these 
oppositely  directed  magnetic  regions,  often  emerging  from  a  sunspot  with  one 
polarity  and  reentering  a  neighboring  sunspot  of  opposite  polarity.  Although  they 
remain  unseen  in  optically  visible  sunlight,  these  coronal  loops  shine  brightly  in 
x-ray  and  extreme  ultraviolet  images  of  the  Sun  taken  with  telescopes  in  space 
(Fig.  9.5).  Because  this  radiation  is  absorbed  in  our  atmosphere,  such  images 
cannot  be  obtained  from  the  ground.  Material  is  concentrated  to  higher  densities 
and  temperatures  within  these  loops,  so  they  emit  this  invisible  radiation  more 
intensely  than  their  surroundings.  This  intense  x-ray  and  extreme  ultraviolet 
emission  thus  outlines  the  magnetic  shape  and  structure  of  the  Sun’s  outer 
atmosphere,  indicating  that  the  corona  is  stitched  together  by  bright,  thin  mag¬ 
netized  loops. 

The  magnetized  atmosphere  in,  around,  and  above  bipolar  sunspot  groups  is 
called  a  solar  active  region.  Active  regions  are  places  of  concentrated,  enhanced 
magnetic  fields,  sufficiently  large  and  strong  to  stand  out  from  the  magnetically 
weaker  areas.  These  disturbed  regions  are  prone  to  awesome  explosions,  marking  a 
location  of  extreme  unrest  on  the  Sun. 
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Fig.  9.5  Magnetic  loops  made  visible  An  electrified,  million-degree  gas,  known  as  plasma ,  is 
channeled  by  magnetic  fields  into  bright  thin  loops.  The  magnetized  loops  stretch  up  to 
500,000  km  from  the  visible  solar  disk,  spanning  up  to  40  times  the  diameter  of  planet  Earth.  The 
magnetic  loops  are  seen  in  the  extreme  ultraviolet  radiation  of  eight  and  nine  times  ionized  iron, 
denoted  Fe  IX  and  Fe  X,  formed  at  a  temperature  of  about  1.0  million  K.  The  hot  plasma  is 
heated  at  the  bases  of  loops  near  the  place  where  their  legs  emerge  from  and  return  to  the 
photosphere.  Bright  loops  with  a  broad  range  of  lengths  all  have  a  fine  thread-like  substructure 
with  widths  as  small  as  the  telescope  resolution  of  1  s  of  arc,  or  725  km  at  the  Sun.  This  image 
was  taken  with  the  Transition  Region  And  Coronal  Explorer  (TRACE)  spacecraft.  [Courtesy  of 
the  TRACE  consortium,  LMSAL  and  NASA;  TRACE  is  a  mission  of  the  Stanford-Lockheed 
Institute  for  Space  Research,  a  joint  program  of  the  Lockheed-Martin  Solar  and  Astrophysics 
Laboratory  (LMSAL),  and  Stanford’s  Solar  Observatories  Group.] 


Example:  Coronal  loops  in  solar  active  regions 

Intense  magnetic  fields  of  strength  B  «  0.03  tesla  confine  a  hot  ionized  gas 
in  solar  active  regions  within  coronal  loops.  The  gas  pressure  Pg  =  NekT  of 
the  hot  electrons,  of  number  density  Ne ,  is  just  equal  to  the  magnetic  pressure 
PB  =  B  / (2 jjLq)  required  to  confine  the  hot  plasma  when  Ne  «  2.6  x 
1019  m-3,  assuming  a  temperature  of  T  =  1  million  or  106  K  and  using  a 
Boltzmann  constant  k  =  1.38065  x  10  “  J  K  and  the  permeability  of 
free  space  p0  =  4n  x  10-7  N  A-2  =  1.2566  x  10-6  N  A-2.  In  other 
words,  a  million-degree  gas  with  an  electron  density  less  than  10  m  will 
be  constrained  by  these  coronal  loops. 

If  these  loops  expel  their  electrons  during  a  solar  flare,  the  electrons  will 
be  accelerated  to  higher  energies  by  the  magnetic  interaction  that  triggers 
and  powers  the  flare  (see  Sect.  9.3).  The  energy  of  the  electrons  might  be 
comparable  to  that  of  x-rays,  with  E  =  30  keV  =  4.80  x  10-15  J.  For  a 
coronal  loop  of  radius  R  the  volume  can  be  approximated  as  4nR /3,  and  for 

/T 

a  loop  radius  comparable  to  the  size  of  the  Earth,  or  R  =  6.378  x  10  m,  the 
total  flare  energy  released  will  be  Ef  =  4nR3NeE/3  «  1.25  x  1026  J. 
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Fig.  9.6  Coronal  loops  The  corona  is  stitched  together  with  ubiquitous  coronal  loops  that  are 
created  when  upwelling  magnetic  fields  generated  inside  the  Sun  push  through  the  visible  solar 
disk  -  the  photosphere  -  into  the  overlying,  invisible  chromosphere  and  corona.  These  closed 
magnetic  structures  are  anchored  in  the  photosphere  at  foot  points  of  opposite  magnetic  polarity. 
Coronal  loops  can  be  filled  with  hot  gas  that  shines  brightly  at  extreme  ultraviolet  and  x-ray 
wavelengths.  Driven  by  motions  in  the  underlying  photosphere  and  below,  the  coronal  loops 
twist,  rise,  shear,  and  interact,  releasing  magnetic  energy  that  can  heat  the  solar  corona  and  power 
intense  solar  flares  or  coronal  mass  ejections.  Large  coronal  loops  are  found  in  the  bulb-like  base 
of  coronal  streamers,  whose  long,  thin  stalks  extend  out  into  space.  Magnetic  fields  anchored  in 
the  photosphere  at  one  end  can  also  be  carried  by  the  solar  wind  into  interplanetary  space, 
resulting  in  open  magnetic  fields  and  a  channel  for  the  fast  solar  wind 

The  number  of  active  regions,  with  their  bipolar  sunspots  and  coronal  loops, 
varies  in  step  with  the  sunspot  cycle,  peaking  at  sunspot  maximum  when  they 
dominate  the  structure  of  the  inner  corona.  At  sunspot  minimum,  the  active  regions 
are  largely  absent  and  the  strength  of  the  extreme-ultraviolet  and  x-ray  emission  of 
the  corona  is  greatly  reduced.  Because  most  forms  of  solar  activity  are  magnetic  in 
origin,  the  sunspot  cycle  also  is  called  the  solar  cycle  of  magnetic  activity. 

Unlike  the  Earth,  magnetism  on  the  Sun  does  not  consist  of  only  one  simple 
dipole;  it  contains  numerous  interlooped  pairs  of  opposite  magnetic  polarity. 
Powerful  magnetism,  spawned  deep  inside  the  Sun,  threads  its  way  through  the 
solar  atmosphere,  creating  a  dramatic,  ubiquitous,  and  ever-changing  panorama  of 
coronal  loops  (Fig.  9.6). 

Throughout  the  solar  atmosphere,  a  dynamic  tension  is  set  up  between  the  gas 
pressure  of  the  charged  particles  and  the  pressure  of  the  magnetic  field  (Focus  9.1). 
In  the  photosphere  and  convective  zone,  the  gas  pressure  dominates  the  magnetic 
pressure,  allowing  the  magnetic  field  to  be  carried  around  by  the  moving  gas. 
Because  the  churning  gases  are  ionized  and  hence  electrically  conductive,  they 
sweep  the  magnetic  field  along  (Fig.  9.7).  The  situation  is  reversed  in  the  low 
corona  within  active  regions,  where  hot  ionized  particles  are  confined  within 
coronal  loops  (Fig.  9.8). 
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Fig.  9.7  Gas  and  magnetic  pressure  The  ratio  of  gas  to  magnetic  pressure,  denoted  by  the  symbol  ft, 
is  plotted  as  a  function  of  height  above  the  photosphere.  The  magnetic  pressure  is  greater  than  the  gas 
pressure  in  the  low  corona,  where  (3  is  less  than  1,  and  magnetic  fields  determine  the  structure  of  the 
corona.  Farther  out,  the  gas  pressure  can  exceed  the  magnetic  pressure,  which  permits  the  solar  wind  to 
carry  the  Sun’s  magnetic  field  into  interplanetary  space.  In  the  photosphere,  below  the  corona  and 
chromosphere,  the  gas  pressure  also  exceeds  the  magnetic  pressure,  and  the  moving  gas  carries  around 
magnetic  fields.  [Adapted  from  Allen  (2001).] 


Focus  9.1  Magnetic  pressure  and  gas  pressure 

A  magnetic  field  tends  to  restrain  a  collection  of  electrons  and  protons, 
called  plasma,  while  the  plasma  exerts  a  pressure  that  opposes  this  field.  The 
magnetic  pressure  is  an  energy  density  associated  with  the  magnetic  field. 
The  magnetic  pressure,  PB ,  produced  by  a  magnetic  field  transverse  to  its 
direction  is  given  by: 


for  a  magnetic  field  of  strength  B  in  tesla,  where  the  permeability  of  free 
space  /iq  =  471  x  10-7  N  A-2  =  1.2566  x  10-6  N  A-2.  As  expected,  a 
stronger  magnetic  field  applies  a  greater  restraining  pressure. 

Hot  plasma  generates  a  gas  pressure,  PG ,  owing  to  the  motions  of  its 
particles.  The  ideal  gas  law  describes  it: 


PG  =  NkT ,  (9.2) 

_ OQ  _  1 

where  N  is  the  particle  number  density,  k  —  1.38065  x  10  “  J  K  is  the 
Boltzmann  constant,  and  T  is  the  temperature.  Hotter  particles  move  faster 
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and  create  greater  pressure  to  oppose  the  magnetic  field,  and  denser  plasma 
also  results  in  greater  gas  pressure. 

The  two  kinds  of  pressure  compete  for  control  of  the  solar  atmosphere.  In 
the  low  solar  corona,  strong  magnetic  fields  in  active  regions  hold  the  hot, 
dense  electrified  gas  within  coronal  loops.  The  magnetic  and  gas  pressures 
become  equal  for  a  magnetic  field,  B ,  given  by: 

B  =  [( 2n0k)NT ]1/2  =  [3.48  x  10“29AT]1/2  tesla.  (9.3) 

i  n 

If  a  coronal  loop  contains  a  hot,  dense  plasma  with  N  =  10  electrons  per 
cubic  meter  and  T  =  10  K,  the  magnetic  field  must  be  stronger  than 
B  «  0.002  tesla  to  restrain  the  plasma.  By  way  of  comparison,  the  magnetic 
field  strength  at  the  Earth’s  equator  is  0.00003  tesla,  or  at  least  100  times 
weaker  than  the  magnetic  field  in  some  coronal  loops. 


Fig.  9.8  The  Sun  in  x-rays  Ionized  gases  at  a  temperature  of  a  few  million  K  produce  the  bright 
glow  seen  in  this  x-ray  image  of  the  Sun.  It  shows  magnetic  coronal  loops  that  thread  the  corona 
and  hold  the  hot  gases  in  place.  The  brightest  features  are  called  active  regions  and  correspond  to 
the  sites  of  the  most  intense  magnetic  field  strength.  The  Soft  X-ray  Telescope  (SXT)  aboard  the 
Japanese  Yohkoh  satellite  recorded  this  image  of  the  Sun’s  corona  on  1  Feb  1992,  near  a 
maximum  of  the  11  year  cycle  of  solar  magnetic  activity.  Subsequent  SXT  images,  taken  about 
five  years  later  near  activity  minimum,  show  a  remarkable  dimming  of  the  corona  when  the  active 
regions  associated  with  sunspots  have  almost  disappeared,  and  the  Sun’s  magnetic  field  has 
changed  from  a  complex  structure  to  a  simpler  configuration.  (Courtesy  of  NASA 
/ISAS/LMSAL/NAO  Japan,  University  of  Tokyo.) 


266 


9  The  Extended  Solar  Atmosphere 


The  coronal  magnetic  fields  emerge  from  underneath  the  photosphere  where 
they  are  rooted,  and  they  are  continually  displaced  and  replaced  by  convective 
motions  just  below  the  photosphere.  As  a  result,  the  corona  has  no  permanent 
features  and  it  is  never  still,  quiet,  or  inactive.  It  is  always  in  a  continued  state  of 
metamorphosis. 


9.1.4  What  Heats  the  Corona? 

The  visible  solar  disk,  the  photosphere,  is  closer  to  the  Sun’s  center  than  the 
million-degree  corona,  but  the  photosphere  is  several  hundred  times  cooler,  with  a 
temperature  of  5,780  K.  This  temperature  difference  is  unexpected  because  energy 
should  not  flow  from  the  cooler  photosphere  to  the  hotter  corona  any  more  than 
water  should  flow  uphill.  It  violates  the  second  law  of  thermodynamics,  which 
states  that  heat  cannot  be  continuously  transferred  from  a  cooler  body  to  a  warmer 
one  without  doing  work. 

We  know  that  visible  sunlight  cannot  resolve  the  heating  paradox.  Radiation 
from  the  photosphere  does  not  go  into  the  corona;  it  goes  through  the  corona. 
There  is  so  little  material  in  the  corona  that  it  is  transparent  to  almost  all  of  the 
photosphere’s  radiation.  Therefore  sunlight  passes  right  through  the  corona 
without  depositing  substantial  quantities  of  energy  into  it,  traveling  out  to  warm 
the  Earth  and  to  also  keep  the  photosphere  cool. 

So,  radiation  cannot  resolve  the  heating  paradox.  We  must  look  for  alternate 
sources  of  energy,  and  they  are  related  to  either  moving  gases  or  the  magnetic 
fields  in  the  photosphere  and  below.  Unlike  radiation,  either  the  kinetic  energy  of 
moving  material  or  the  magnetic  energy  released  by  magnetic  fields  can  flow  from 
cold  to  hot  regions,  keeping  the  corona  hot. 

In  1948-1949,  astronomers  in  Germany,  the  United  States,  and  France  inde¬ 
pendently  proposed  that  sound  waves  generated  in  the  turbulent  convective  zone 
might  heat  the  overlying  atmosphere  (Biermann  1948;  Schwarzschild  1948; 
Schatzman  1949).  The  sound  waves  would  accelerate  and  strengthen  as  they  travel 
outward  through  the  increasingly  rarefied,  overlying  solar  atmosphere,  until 
supersonic  shocks  are  created,  dissipating  energy  and  heating  the  gas. 

Although  observations  from  the  eighth  Orbiting  Solar  Observatory ,  abbreviated 
OSO  8,  showed  that  sound  waves  do  not  transport  significant  amounts  of  energy  into 
the  corona,  these  measurements  indicated  that  the  sounds  might  warm  the  chromo¬ 
sphere  to  10,000  K,  or  roughly  twice  the  temperature  of  the  underlying  photosphere 
(Athay  and  White  1978, 1979;  Bruner  1981).  Modem  observations  indicate  that  even 
though  the  majority  of  sound  waves  generated  in  the  convective  zone  are  reflected 
back  into  the  solar  interior  at  the  photosphere,  a  small  percentage  of  them  do  manage 
to  slip  through  the  photosphere  along  inclined  magnetic  fields,  forming  shocks  that 
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heat  the  low  chromosphere  and  create  numerous  short-lived  spicules  there.  This 
method  of  chromosphere  heating  is  generally  consistent  with  the  fact  that  other  stars 
with  outer  convective  zones  have  chromospheres,  while  stars  that  have  no  convective 
zones  do  not  exhibit  a  detectable  chromosphere. 

For  coronal  heating,  other  kinds  of  waves  must  be  considered,  and  a  likely 
candidate  is  magnetic  waves  that  can  propagate  into  the  corona  and  carry  energy 
into  it.  The  Sun’s  ever-changing  coronal  magnetic  fields  are  always  being  jostled, 
twisted,  and  stirred  around  by  motions  deep  within  the  Sun  where  the  magnetism 
originates.  A  tension  acts  to  resist  the  motions  and  pull  the  disturbed  magnetism 
back,  generating  waves  that  propagate  along  magnetic  fields,  somewhat  like  a 
vibrating  string.  These  waves  do  not  form  shocks,  and  once  generated  they  can 
propagate  for  large  distances,  directing  their  energy  along  open  magnetic  fields 
into  the  overlying  corona. 

Such  waves  are  now  called  Alfven  waves  after  Hannes  Alfven  (1908-1995) 
who  first  described  them  mathematically  (Alfven  1942a,  b,  Sect.  5.6),  and  argued 
that  they  might  heat  the  corona  (Alfven  1947).  Ronald  G.  Giovanelli  (1915-1984), 
Jack  H.  Piddington  (1910-1997),  and  Donald  E.  Osterbrock  (1924-2007)  subse¬ 
quently  discussed  the  heating  of  the  chromosphere  and  corona  by  Alfven  waves 
(Giovanelli  1949;  Piddington  1956;  Osterbrock  1961).  An  important  ongoing 
controversy  is  whether  or  not  Alfven  waves  propagating  through  the  corona  dis¬ 
sipate  sufficient  energy  to  heat  it.  The  waves  have  been  detected  far  from  the  Sun, 
suggesting  that  they  might  heat  the  distant  corona  by  traveling  along  open  mag¬ 
netic  fields  (Belcher  1969;  Cranmer  and  van  Ballegooijen  2005). 

Strong,  interacting  magnetic  fields  also  play  a  role  in  heating  closed  magnetic 
regions  closer  to  the  Sun,  within  the  coronal  loops.  After  all,  the  hottest  and  densest 
material  in  the  low  corona  is  located  where  the  magnetic  field  is  strongest,  usually 
within  active  regions  above  sunspots.  Moreover,  observations  from  solar  spacecraft 
indicate  that  the  entire  magnetic  flux  in  the  so-called  quiet  solar  atmosphere  outside 
solar  active  regions  is  replenished  every  15-40  h  (Schrijver  and  Title  2003). 

Motions  down  inside  the  convective  zone  twist  and  stretch  the  overlying 
magnetic  fields,  slowly  building  up  their  energy,  and  magnetic  loops  of  all  sizes 
always  are  being  pushed  up  into  the  solar  corona  from  below.  When  oppositely 
directed  coronal  loops  are  pressed  together,  they  can  merge  and  join  at  the  place 
where  they  touch,  releasing  their  energy  to  heat  the  corona.  The  magnetic  fields 
then  reform  or  reconnect  in  new  magnetic  orientations,  so  this  method  of  coronal 
heating  is  termed  magnetic  reconnection. 

Thus,  magnetic  fields  seem  to  play  a  fundamental  role  in  channeling,  storing, 
and  transforming  energy  into  heat,  supplying  it  on  different  timescales  and  sending 
it  to  various  structures.  When  the  magnetic  geometry  does  not  change,  the  mag¬ 
netism  plays  a  passive  role,  guiding  the  flow  of  charged  particles,  heat,  and  waves 
along  the  field  lines.  And  when  the  magnetic  configuration  changes,  the  magnetism 
can  play  an  active  role  by  triggering  instabilities  and  releasing  stored  magnetic 
energy  through  merging  and  reconnection  of  closed  magnetic  field  lines. 
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9.1.5  Coronal  Holes 

In  contrast  to  the  dense,  bright  areas,  the  corona  also  contains  less  dense  regions 
called  coronal  holes.  These  so-called  holes  have  so  little  material  in  them  that  they 
appear  as  large  dark  areas  on  x-ray  or  extreme-ultraviolet  images,  seemingly 
devoid  of  radiation  (also  see  Fig.  9.8). 

The  coronal  holes  are  neither  constant  nor  permanent;  they  appear,  evolve,  and 
die  away  in  periods  ranging  from  a  few  weeks  to  several  months,  continuously 
changing  in  content,  shape,  and  form. 

At  times  of  low  solar  activity,  near  the  minimum  of  the  Sun’s  1 1  year  magnetic 
activity  cycle,  coronal  holes  cover  the  north  and  south  polar  caps  of  the  Sun. 
During  more  active  periods,  closer  to  the  cycle  maximum,  the  large  coronal  holes 
at  the  poles  shrink  and  even  disappear,  and  smaller  coronal  holes  appear  at  all  solar 
latitudes  -  even  at  or  near  the  solar  equator. 

The  rarefied  coronal  holes  are  not  completely  empty.  The  normally  constraining 
magnetic  forces  relax  and  open  up  in  the  coronal  holes  to  allow  an  unencumbered 
outward  flow  of  electrically  charged  particles  and  magnetic  fields  into  interplan¬ 
etary  space,  keeping  the  coronal  hole’s  density  low  and  expelling  a  relentless  high¬ 
speed  wind. 


9.2  The  Sun’s  Varying  Winds 

9.2.1  The  Expanding  Sun  Envelops  the  Earth 

The  Sun’s  radiation  is  not  all  that  passes  through  the  space  between  the  planets. 
It  is  filled  with  electrons,  protons,  and  magnetic  fields  emanating  from  the  Sun  in  a 
ceaseless  flow.  These  unseen  particles  and  fields  form  a  perpetual  solar  wind  that 
extends  all  the  way  to  the  Earth  and  far  beyond.  It  was  inferred  from  comet  tails, 
suggested  by  theoretical  considerations,  and  fully  confirmed  by  direct  measure¬ 
ments  from  spacecraft  in  the  early  1960s  (Focus  9.2).  So  the  space  between  the 
planets  is  not  completely  empty;  it  contains  the  Sun’s  winds  that  stream  out 
radially  in  all  directions  from  the  Sun. 

Focus  9.2  Discovery  of  the  solar  wind 

The  notion  that  something  is  always  being  expelled  from  the  Sun  first  arose 
from  observations  of  comet  tails.  Comets  can  appear  unexpectedly  almost 
anywhere  in  the  sky,  moving  in  every  possible  direction,  but  with  tails  that 
always  point  away  from  the  Sun.  A  comet  therefore  travels  headfirst  when 
approaching  the  Sun  and  tail  first  when  departing  from  it.  Ancient  Chinese 
astronomers  concluded  that  the  Sun  must  have  a  chi ,  or  “life  force”,  that 
blows  away  the  comet  tails.  In  the  early  1600s,  the  German  astronomer 
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Johannes  Kepler  (1571-1630)  proposed  that  solar  radiation  pushes  the  comet 
tails  away  from  the  Sun. 

When  a  comet  is  tossed  into  the  inner  solar  system,  the  dirty  ice  on  its 
surface  is  vaporized,  sometimes  forming  two  kinds  of  tails  that  always  point 
generally  away  from  the  Sun  rather  than  toward  it.  One  is  a  yellow  tail  of 
dust,  which  can  litter  the  comet’s  curved  path.  The  dust  is  pushed  away  from 
the  Sun  by  the  pressure  of  sunlight.  The  other  tail  is  colored  electric  blue, 
shining  in  the  light  of  ionized  particles.  The  ions  in  comet  tails  always 
stream  along  straight  paths  away  from  the  Sun. 

The  existence  of  the  solar  wind  of  charged  particles  was  suggested  from 
observations  of  comet  ion  tails  in  the  mid  20th  century.  The  German 
astronomer,  Ludwig  Biermann  (1907-1986)  noticed,  in  the  1950s,  that  the 
ions  in  a  comet’s  tail  move  with  velocities  many  times  higher  than  could  be 
caused  by  the  weak  pressure  of  sunlight,  and  proposed  that  a  continued  flow 
of  electrically-charged  particles  pours  out  of  the  Sun  at  all  times  and  in  all 
directions,  accelerating  the  ions  to  high  speeds  and  pushing  them  away  from 
the  Sun  in  straight  ion  tails  (Biermann  1951,  1957). 

Eugene  N.  Parker  (1927-  )  of  the  University  of  Chicago  showed  how  such 
a  relentless  flow  might  work,  dubbing  it  the  solar  wind  (Parker  1958).  It 
would  naturally  result  from  the  expansion  of  the  Sun’s  million-degree 
atmosphere,  the  corona.  He  also  demonstrated  how  a  magnetic  field  would 
be  pulled  into  interplanetary  space  from  the  rotating  Sun,  attaining  a  spiral 
shape  (Parker  1958,  1960,  1963).  The  expansion  begins  slowly  near  the  Sun, 
where  the  solar  gravity  is  the  strongest.  Then  the  expanding  corona  accel¬ 
erates  outward  into  space  until  the  winds  break  away  from  the  Sun,  and 
eventually  they  cruise  along  at  the  roughly  constant  and  supersonic  veloci¬ 
ties  needed  to  account  for  the  acceleration  of  comet  tails.  This  creates  a 
strong,  persistent  solar  wind,  forever  blowing  throughout  the  solar  system. 

The  first  direct  measurements  of  the  solar  wind’s  corpuscular,  or  particle, 
content  were  made  by  a  group  of  Soviet  scientists  led  by  Konstantin  I. 
Gringauz  (1918-1993),  using  four  ion  traps  aboard  the  Lunik  2  spacecraft 
launched  to  the  Moon  on  September  12,  1959.  In  the  following  year, 
Gringauz  reported  that  the  maximum  current  in  all  four  ion  traps  corre- 
sponded  to  a  solar  wind  flux  of  2  million  million  (2  x  10  )  ions  (pre¬ 
sumably  protons)  per  square  meter  per  second  (Gringauz  1961).  This  is  in 
rough  accord  with  all  subsequent  measurements. 

All  reasonable  doubt  concerning  the  existence  of  the  solar  wind  was 
removed  by  measurements  made  on  board  NASA’s  Mariner  2,  launched  on 
August  27,  1962.  Marcia  Neugebauer  (1932-  )  and  Conway  W.  Snyder  of 
the  Jet  Propulsion  Laboratory  used  more  than  one  hundred  days  of  Mariner  2 
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data,  obtained  as  the  spacecraft  traveled  to  Venus,  to  show  that  charged 
particles  are  continuously  emanating  from  the  Sun,  for  at  least  as  long  as 
instruments  on  Mariner  2  observed  them  (Neugebauer  and  Snyder  1966; 
Neugebauer  199").  It  also  unexpectedly  indicated  that  the  solar  wind  has  a 
slow  and  a  fast  component.  The  slow  one  moves  at  a  speed  of 
300-400  km  s_l ;  the  fast  one  travels  at  twice  that  speed. 

The  solar  wind  flux  determined  by  Neugebauer  and  Snyder  was  in  good 
agreement  with  the  values  measured  with  the  ion  traps  on  Lunik  2.  The 

z: 

average  wind  ion  number  density  was  shown  to  be  5  million  (5  x  10  ) 
protons  per  cubic  meter  near  the  distance  of  the  Earth  from  the  Sun.  We  now 
know  that  such  a  low  density  close  to  the  Earth’s  orbit  is  a  natural  conse¬ 
quence  of  the  wind’s  expansion  into  an  ever-greater  volume,  but  that  vari¬ 
able  wind  components  can  gust  with  higher  densities. 


In  the  low  corona,  strong  magnetic  fields  constrain  the  hot  ionized  gas  within 
coronal  loops.  But  further  out  in  the  corona,  the  magnetic  fields  decrease  in 
strength  and  cannot  restrain  the  outward  flow  of  the  million-degree  gas;  it  also 
flows  out  unencumbered  from  the  open  magnetic  fields  in  coronal  holes. 


Example:  Hot  enough  to  break  away  from  the  Sun’s  gravity 

The  corona  is  fully  ionized,  with  a  temperature  of  several  million  K  and 
consisting  of  electrons  and  protons.  We  can  determine  if  these  particles  are 
hot  enough  to  escape  from  the  Sun’s  gravity  by  equating  their  thermal 

velocity  (3 kT/m)1^2  to  the  escape  velocity  of  the  Sun  (2 GMq/Rq)1^2  1  to 
obtain  (Sects.  3.2,  5.2,  5.3): 


T 


2GmMQ 
3  kRQ 


«  9.2  x  1033 


m  K, 


where  the  Sun’s  mass  M0  =  1.989  x  1030  kg  and  radius  R®  =  6.955  x 
108  m,  the  Boltzmann  constant  k  =  1.38065  x  10-23  J  K-1  and  the  con- 

_ 11  o  _ o 

stant  of  gravitation  G  =  6.674  x  10  N  m  kg  .  The  temperature  is 
denoted  by  T  and  the  particle  mass  is  designated  by  m. 

_ 0^7 

For  a  proton  of  mass  mP  =  1.6726  x  10  ~  kg,  the  temperature  required 

'■7 

for  escape  is  T  &  1.5  x  10  K,  while  for  the  electron  of  mass 
me  =  9.1094  x  10-31  kg  it  is  T  «  8.4  x  103  K.  So  the  electrons  in  the 
million-degree  solar  corona  would  have  no  problem  escaping  from  the  Sun, 
and  the  protons  would  require  just  a  little  extra  push  to  put  them  out  where 
the  solar  gravity  is  somewhat  diminished. 
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The  solar  gale  brushes  past  the  planets,  wraps  itself  around  the  Earth,  and 
carries  the  Sun’s  corona  out  to  interstellar  space.  This  radial,  supersonic  outflow 
creates  a  huge  bubble  of  plasma,  with  the  Sun  at  its  center  and  the  planets  inside; 
this  is  called  the  heliosphere ,  from  Helios  the  “God  of  the  Sun”  in  Greek 
mythology. 


9.2.2  Properties  of  the  Solar  Wind 


The  million-degree  corona  is  so  hot  that  it  cannot  stand  still.  Indeed,  the  solar  wind 
consists  of  an  overflow  corona,  which  is  too  hot  to  be  entirely  constrained  by  the  Sun’s 
inward  gravitational  pull  and  perpetually  moves  out  into  surrounding  space.  The  hot 
gas  creates  an  outward  pressure  that  tends  to  oppose  the  inward  pull  of  the  Sun’s 
gravity.  At  great  distances,  where  the  solar  gravity  weakens,  the  hot  protons  and 
electrons  overcome  the  Sun’s  gravity  and  continue  to  accelerate,  like  water  over¬ 
flowing  a  dam.  So,  the  solar  corona  is  really  the  visible,  inner  base  of  the  solar  wind,  and 
the  solar  wind  is  simply  the  hot  corona  expanding  into  interplanetary  space. 

The  Sun’s  continuous  wind  travels  with  two  main  velocities.  There  is  a  fast, 
uniform  wind  that  blows  at  a  speed  of  about  750  km  s_1,  and  a  variable,  gusty 
slow  wind  that  moves  about  half  as  fast.  Both  winds  are  supersonic,  moving  at 
least  10  times  faster  than  the  speed  of  sound  in  the  solar  wind.  The  Sun’s  wind  also 
rushes  on  with  little  reduction  in  speed  because  there  is  almost  nothing  out  there  to 
slow  it  down.  Both  the  fast  and  slow  winds  from  the  Sun  are  much  more  tenuous, 
hotter,  and  faster  than  any  wind  on  the  Earth. 


Example:  The  supersonic  solar  wind 

The  speed  of  sound,  cs,  in  the  solar  wind  can  be  determined  from  the 
equation  (Laplace  1816,  Sect.  5.8): 


= 


where  the  adiabatic  index  y  =  5/3  for  a  fully  ionized  gas  like  the  solar  wind, 

_ oo  _ 1 

the  Boltzmann  constant  k  =  1.38065  x  10  ^  J  K  ,  and  the  mean 
molecular  mass  m  =  p  x  u  where  the  mean  molecular  weight  p  =  0.5  for 
the  solar  wind  (fully  ionized  almost  entirely  composed  of  protons),  and  the 
atomic  mass  unit  u  =  1.66054  x  10  ^  kg.  For  a  temperature  of  T  —  1.2  x 
105  K  for  solar- wind  protons  near  the  Earth’s  orbit,  this  equation  gives 
cs  «  5.8  x  104  m  s  1  I 58  km  s  \  By  way  of  comparison,  the  slow  and 
fast  solar  winds  have  respective  velocities  of  about  375  and  750  km  s-1, 
which  indicates  that  the  solar  wind  is  everywhere  supersonic,  or  moving  at 
speeds  faster  than  sound. 
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Table  9.3  Mean  values  of  solar- wind  parameters  at  the  Earth’s  orbita 


Parameter 

Mean  value 

Particle  density,  N 

n  'X 

N  «  10  m  (5  electrons  and  5  protons  per  cubic  centimeter) 

Velocity,  V 

Fast  wind  V  ^750  km  s-1 

Slow  wind  V  ~  375  km  s-1 

Average  V  «  600  km  s-1 

Mass  density,  p 

p  =  10-20  kg  m-3  (protons) 

Flux,  F 

F  &  6  x  1012  particles  m-2  s-1 

Temperature,  T 

T  w  120,000  K  (protons)  to  140,000  K  (electrons) 

Particle  thermal  energy,  kT 

kT  x  2  x  10”18  J  «  12  eV 

Proton  kinetic  energy,  0.5  mpV 

0.5  mpV2  x  10" 16  J  «  1,000  eV  =  1  keV 

Particle  thermal  energy  density 

NkT  x  10”  UJ  m”3 

Proton  kinetic  energy  density 

0.25  N  mp  V2x  10"9  J  m"3 

Radial  magnetic  field,  Br 

Br  =  2.5  x  10”9  T  =  2.5  nT  =  2.5  x  10”5  G 

Alfven  velocity,  VA 

VA  =  32  km  s-1 

Sound  speed,  cs 

cs  «  50  km  s'1 

a  These  solar-wind  parameters  are  at  the  mean  distance  of  the  Earth  from  the  Sun,  or  at  one 
astronomical  unit,  1  AU,  where  1  AU  =  1.496  x  1011  m.  The  Boltzmann  constant  ^  =1.38  x 
10  J  K  relates  temperature  and  thermal  energy.  The  proton  mass  mp  =  1.67  x  10  kg 


Because  the  electrified  wind  material  is  an  excellent  conductor  of  heat,  the 
temperature  falls  off  only  gradually  with  distance  from  the  Sun,  reaching  between 
120,000  K  and  140,000  K  at  the  Earth’s  distance.  The  tenuous  wind  has  been 
diluted  to  a  rarefied  plasma  by  the  time  it  reaches  the  Earth’s  distance  from  the 
Sun,  where  there  are  approximately  5  million  electrons  and  5  million  protons  per 
cubic  meter  of  solar  wind.  The  density  of  the  solar  wind  is  so  low  that  there  are  not 
enough  particles  to  heat  astronauts  that  might  venture  outside  their  spacecraft. 

Because  it  is  wrapped  into  a  spiral,  the  interplanetary  magnetic  field  strength 
falls  off  linearly  with  distance  from  the  Sun,  in  contrast  to  the  solar  wind  number 
density  that  decreases  more  rapidly,  as  the  inverse  cube  of  the  distance,  as  it  fills  a 
larger  volume. 

Physical  properties  of  the  solar  wind  at  the  Earth’s  distance  from  the  Sun  are 
listed  in  Table  9.3. 

At  a  mean  speed  of  about  600  km  s-1,  the  flux  of  solar  wind  particles  is  far 
greater  than  anything  else  in  nearby  space.  Between  one  and  ten  million  million 

1  O  IQ 

(10  -10  )  particles  in  the  solar  wind  cross  every  square  meter  of  space  each 
second.  That  flux  far  surpasses  the  flux  of  more  energetic  cosmic  rays  that  enter 
our  atmosphere,  with  the  abundant  protons  arriving  with  a  flux  of  only  640  protons 
per  square  meter  per  second.  The  peak  local  energy  density  of  cosmic  rays  is  about 
one  million  electron  volts  per  cubic  meter,  or  about  one  ten  thousandth  (10-4)  the 
kinetic  energy  density  of  solar  wind  protons. 

Moreover,  interplanetary  magnetic  fields  act  as  a  barrier  to  electrically  charged 
cosmic  rays  that  are  coming  from  the  depths  of  space,  preventing  them  from 
reaching  the  Earth.  During  the  maximum  in  the  solar  cycle,  stronger  solar  mag¬ 
netic  fields  are  carried  out  into  interplanetary  space  by  the  Sun’s  wind,  deflecting 
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more  cosmic  rays.  Less  extensive  interplanetary  magnetism,  during  a  minimum  in 
the  1 1  year  cycle  of  magnetic  activity,  lowers  the  barrier  to  the  cosmic  particles 
and  allows  more  of  them  to  arrive  at  Earth.  This  unexpected  anti-correlation 
between  solar  activity  and  cosmic  rays  arriving  at  Earth  is  often  called  the  Forbush 
effect  after  its  discovery  by  Scott  Forbush  (1904-1984)  in  the  early  1950s  (Forbush 
1950).  It  was  explained  by  Peter  Meyer  (1920-2002),  Eugene  Parker  and  John 
Simpson  (1916-2000)  who  proposed  that  enhanced  interplanetary  magnetism  near 
the  maximum  in  the  11  year  solar  activity  cycle  deflects  cosmic  rays  from  their 
Earth-bound  paths  (Meyer  et  al.  1956). 

Because  the  Sun  is  blowing  itself  away  continuously,  we  might  imagine  that  it 
would  eventually  vanish  from  view  after  expelling  all  of  its  substance  into  space. 
Every  second,  the  solar  wind  carries  about  a  billion  kilograms,  or  1  million  tons,  of 
the  Sun  into  surrounding  space.  That  seems  significant,  but  in  4.6  billion  years  the 
solar  wind  has  only  carried  away  about  0.0001,  or  one  ten  thousandth,  of  the  Sun’s 
mass  Mq  =  1.989  x  1030  kg.  Moreover,  that  is  about  three  times  less  than  the 
amount  of  mass  turned  into  energy  during  this  time  by  nuclear  reactions  near  the 
center  of  the  Sun  (Focus  9.3). 


Focus  9.3  Mass  loss  from  the  Sun 

The  Sun  is  relentlessly  expelling  mass  into  the  solar  wind,  most  of  it  in  protons 
with  a  mass  mP  =  1.6726  x  10  ~  kg.  Using  measurements  at  the  Earth, 
located  at  a  mean  distance  from  the  Sun  of  1  AU  =  1 .496  x  1011  m,  we  can 
estimate  the  total  mass,  AM sw\ost  from  the  Sun  in  the  solar  wind  every  second: 

AMSW  =  4nmPNPV (AU)2  =  1.41  x  109  kg,  (9.6) 

where  the  number  density  of  protons  in  the  solar  wind  at  the  Earth’s  distance 

/r  _ o 

is  NP  =  5  x  10  m  and  we  have  assumed  an  average  wind  velocity  of 
V  =  600  km  s-‘  =6  x  105  ms  '.  The  total  mass  MSw  lost  at  this  rate  over 

7 

the  past  4.6  billion  years,  using  1  year  =  3.156  x  10  s,  is: 

Msw  =  dMsw  x  4.6  xlO9  x  3.156  x  107  =  2.05  x  1026  kg  «  0.0001MQ, 

(9.7) 

where  M0  =  1.989  x  103°  kg  denotes  the  Sun’s  current  mass. 

About  three  times  more  mass  is  consumed  every  second  by  nuclear 
reactions  that  make  the  Sun  shine.  For  just  one  fusion  of  four  protons  into 
one  helium  nucleus  the  mass  lost  is  Am  =  0.007  (4 mP)  with  an  energy 

o  o  _ 1 

release  AE  =  Amc  ,  for  a  velocity  of  light  c  =  2.9989  x  10  m  s  .  Since 
these  nuclear  reactions  are  supplying  the  Sun’s  absolute  luminosity  L0  = 
3.828  x  1026  J  s  1 ,  the  mass  loss  AMNR  in  one  second  is: 


AM 


NR 


Am 


m 


L 


© 


r^j 


C ‘ 


4.26  x  109  kg, 


(9.8) 
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and  over  the  past  4.6  billion  years,  assuming  the  Sun  has  always  been 
shining  at  the  same  rate,  the  total  mass  MNR  lost  by  nuclear  reactions  is: 

Mnr  =  4.6  x  109  x  3.156  x  107  x  AMNR  w  6.2  x  1026  kg  «  0.0003  M0. 

(9.9) 

This  mass  is  carried  off  by  the  radiation  that  makes  the  Sun  shine. 


9.2.3  Where  Do  the  Two  Solar  Winds  Come  From? 

Instruments  aboard  spacecraft  have  detected  two  solar  winds  with  different 
physical  properties.  There  is  a  fast  wind  that  moves  at  a  speed  of  about  750  km  s-1 
and  a  slow  wind  that  blows  at  about  half  that  speed.  The  high-speed  wind  is  steady 
and  uniform,  whereas  the  slow-speed  wind  is  variable  and  gusty. 

The  two  solar  winds  do  not  blow  uniformly  from  all  places  on  the  Sun;  instead 
they  depend  on  solar  latitude.  The  spatial  distribution  of  the  two  types  of  winds 
also  depends  on  the  Sun’s  magnetic  field  configuration,  which  varies  dramatically 
with  the  1 1  year  solar-activity  cycle. 

As  suggested  by  Sir  William  I.  Axford  (1933-2010),  the  steady,  uniform,  high¬ 
speed  wind  emanates  from  magnetically  open  configurations  in  the  corona  (Axford 
1985).  The  open  magnetic  fields  in  coronal  holes  provide  a  conduit  for  the  fast 
wind.  In  contrast  the  slow  wind  -  which  is  filamentary  and  transient  -  involves  the 
intermittent  release  of  material  from  previously  closed  magnetic  regions,  so  the 
slow  wind  may  not  be  treated  as  an  equilibrium  flow  in  a  steady  state. 

The  distribution  of  the  open  and  closed  magnetic  regions  on  the  Sun,  and 
therefore  the  places  of  origin  for  the  two  solar  winds,  depends  on  the  1 1  year  cycle 
of  solar  magnetic  activity. 

Near  activity  minimum,  the  high-speed  wind  moves  out  of  the  open  magnetic 
fields  in  large  coronal  holes  located  at  the  Sun’s  polar  regions.  A  slow,  gusty,  and 
variable  wind  then  moves  away  from  closed  magnetic  regions  near  the  Sun’s 
equator  (McComas  et  al.  2000). 

The  simple,  bimodal  distribution  of  fast  and  slow  wind  structures  disappears 
near  the  maximum  in  the  11  year  solar  activity  cycle.  The  large  polar  coronal 
holes  then  shrink  and  even  disappear  and  smaller  coronal  holes  appear  at  all  solar 
latitudes.  A  chaotic  and  complex  mixture  of  varying  solar  wind  flows  therefore  is 
found  at  all  solar  latitudes  near  activity  maximum  (McComas  et  al.  2003).  The 
slow  winds  still  seem  to  be  associated  with  closed  magnetic  structures,  such  as 
active  regions,  whereas  the  fast  winds  rush  out  of  the  interior  of  coronal  holes  all 
over  the  Sun.  Solar  active  regions,  with  their  explosive  behavior,  provide  a 
noticeable  third  source  for  the  solar  winds  near  the  activity-cycle  maximum. 
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9.2.4  Where  Does  the  Solar  Wind  End? 


How  far  does  the  solar  wind  extend,  and  where  does  its  influence  end?  The  solar 
wind  carves  out  a  cavity  in  the  interstellar  medium  known  as  the  heliosphere. 
Zurbuchen  (2007)  has  reviewed  the  coupling  of  the  Sun  and  the  heliosphere;  also 
see  Lang  (2009). 

Since  the  solar  wind  is  weakened  by  expansion,  thinning  out  as  it  moves  into  a 
greater  volume,  it  eventually  becomes  too  dispersed  to  repel  interstellar  forces. 
The  winds  are  no  longer  dense  or  powerful  enough  to  withstand  the  pressure  of  gas 
and  magnetic  fields  coursing  between  the  stars.  The  radius  of  this  celestial  standoff 
distance,  in  which  the  pressure  of  the  solar  wind  falls  to  a  value  comparable  to  the 
interstellar  pressure,  has  been  estimated  at  about  100  AU,  or  one  hundred  times  the 
mean  distance  between  the  Earth  and  the  Sun. 

The  radius  of  the  heliosphere  can  be  estimated  by  determining  the  standoff 
distance,  or  stagnation  point,  in  which  the  ram  pressure,  Pw ,  of  the  solar  wind  falls 
to  a  value  comparable  to  the  interstellar  pressure,  Pj.  As  the  wind  flows  outward, 
its  velocity  remains  nearly  constant,  while  its  density  decreases  as  the  inverse 
square  of  the  distance.  The  dynamic  pressure  of  the  solar  wind  therefore  also  falls 
off  as  the  square  of  the  distance,  and  we  can  use  the  solar-wind  properties  at  the 
Earth’s  distance  of  1  AU  to  infer  the  pressure,  Pws,  at  the  stagnation-point  dis¬ 
tance,  Rs.  Equating  this  to  the  interstellar  pressure  we  have: 


PWS  —  P\AU  X 


(9.10) 


where  the  number  density  of  the  solar  wind  near  the  Earth  is  about  N1AU  =  5 
million  particles  per  cubic  meter  and  the  velocity  there  is  about  V  1AU  =  500  km 
per  second. 

To  determine  the  distance  to  the  edge  of  the  solar  system,  Rs ,  we  also  need  to  know 
the  interstellar  pressure,  which  is  the  sum  of  the  thermal  pressure,  the  dynamic 
pressure,  and  the  magnetic  pressure  in  the  local  interstellar  medium.  Its  estimated 
value  results  in  Rs  =  100  AU  or  more,  well  beyond  the  orbits  of  the  major  planets. 

The  termination  shock  at  the  edge  of  the  solar  wind  has  now  been  measured 
from  the  Voyager  1  and  2  spacecraft  (Decker  et  al.  2005;  Stone  et  al.  2005,  2008). 

Instruments  aboard  the  twin  Voyager  1  and  2  spacecraft,  launched  in  1977  and 
now  cruising  far  beyond  the  outermost  planets,  have  approached  this  edge  of  the 
solar  system  from  different  directions.  Voyager  1  is  moving  in  the  northern 
hemisphere  of  the  heliosphere  and  Voyager  2  in  the  southern  hemisphere.  Voyager 
1  crossed  the  termination  shock  of  the  supersonic  flow  of  the  solar  wind  on 
December  16,  2004  at  a  distance  of  94  times  the  mean  distance  between  the  Earth 
and  the  Sun,  or  at  94  AU  from  the  Sun.  At  this  distance,  the  spacecraft’s  instru¬ 
ments  recorded  a  sudden  increase  in  the  strength  of  the  magnetic  field  carried  by 
the  solar  wind,  as  expected  when  the  solar  wind  slows  down  and  its  particles  pile 
up  at  the  termination  shock. 
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Voyager  2  crossed  the  termination  shock  on  August  30,  2007  at  a  distance  of  84 
AU  from  the  Sun.  It  appears  that  there  is  a  significant  north/south  asymmetry  in 
the  heliosphere,  likely  due  to  the  direction  of  the  local  interstellar  magnetic  field. 

Both  Voyager  1  and  2  have  therefore  now  crossed  into  the  vast,  turbulent 
heliosheath,  the  region  where  the  interstellar  gas  and  solar  wind  interact,  due  to  the 
reflection  and  deflection  of  the  solar- wind  ions  by  the  magnetized  wind  beyond  the 
heliosheath.  In  technical  terms,  the  solar- wind  ions  in  the  heliosheath  are  deflected 
by  magnetosonic  waves  reflecting  off  of  the  heliopause,  causing  the  ions  to  flow 
parallel  to  the  termination  shock  toward  the  heliotail. 

Both  Voyager  spacecraft  are  equipped  with  plutonium  power  sources  expected 
to  last  until  at  least  2020  and  perhaps  2025.  So  they  ought  to  eventually  measure 
the  heliopause  at  the  outer  edge  of  the  heliosheath.  It  is  the  place  where  interstellar 
space  begins. 

In  the  meantime,  the  Interstellar  Boundary  EXplorer,  abbreviated  IBEX ,  was 
launched  on  19  Oct  2009.  Instruments  on  this  spacecraft,  which  operates  in  Earth 
orbit,  detect  neutral,  or  unionized,  atoms  coming  from  the  termination  shock  and 
the  boundary  between  the  solar  wind  and  interstellar  space. 


9.3  Explosions  on  the  Sun 
9.3.1  Solar  Flares 

Suddenly,  without  warning,  a  part  of  the  Sun  explodes,  creating  a  solar  flare.  Some 
of  them  are  the  biggest  explosions  in  the  solar  system,  releasing  energy  of  ten 
million,  billion,  billion  joule,  or  10  J,  in  just  100  s.  This  is  comparable  in 
strength  to  20  million  nuclear  bombs  exploding  simultaneously,  and  each  with 
energy  of  100  Megatons  of  TNT. 

A  substantial  fraction  of  the  flare  energy  goes  into  accelerating  electrons  and 
protons  to  nearly  the  speed  of  light.  Some  of  these  high-energy  particles  are  hurled 
into  the  Sun,  briefly  raising  the  temperature  of  Earth- sized  regions  of  the  Sun  to 
more  than  10  million  K.  Other  accelerated  particles  are  tossed  out  into  inter¬ 
planetary  space  and  emit  intense  radio  and  x-ray  radiation. 

The  short-lived  solar  flares  unleash  their  energy  in  the  vicinity  of  sunspots, 
covering  just  a  few  tenths  of  a  percent  of  the  solar  disk.  These  incredible  explosive 
outbursts  become  more  frequent  and  violent  when  the  number  of  sunspots  is 
greatest;  several  solar  flares  can  be  observed  on  a  busy  day  near  the  maximum  of 
the  sunspot  cycle.  However,  they  are  not  caused  directly  by  sunspots;  solar  flares 
instead  are  powered  by  magnetic  changes  in  the  corona  above  sunspots. 

Although  it  emits  very  large  amounts  of  energy,  a  solar  flare  usually  releases 
less  than  1/1 ,000th  of  the  total  energy  radiated  by  the  Sun  every  second,  so  they 
are  only  minor  perturbations  in  the  combined  colors,  or  white  light,  of  the  Sun.  The 
first  record  of  a  solar  flare  detected  on  the  visible  solar  disk  therefore  did  not  occur 
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until  the  mid-nineteenth  century  -  on  September  1,  1859,  when  the  English 
astronomers  Richard  Christopher  Carrington  (1826-1875)  and  Richard  Hodgson 
(1804-1872)  independently  noticed  one  (Carrington  1860;  Hodgson  1860). 

A  new  perspective,  which  demonstrated  the  frequent  occurrence  of  solar 
explosions,  was  made  possible  when  flares  were  observed  in  the  red  Balmer  alpha 
emission  line  of  hydrogen  at  656.3  nm,  originating  in  the  chromosphere.  This 
wasn’t  possible  until  the  early  20th  century,  after  the  invention  of  instruments  that 
isolated  the  red  hydrogen-alpha  emission  from  the  Sun’s  intense  visible  light  at 
adjacent  wavelengths  in  the  solar  spectrum.  Such  observations  showed  that  at  the 
chromospheric  level  in  the  solar  atmosphere  a  solar  flare  consists  in  simplest  form 
as  two  extended,  parallel  flare  ribbons.  But  a  fundamental  understanding  of  the 
physical  processes  responsible  for  solar  flares  had  to  wait  until  they  were  detected 
at  invisible  radio  wavelengths  from  the  ground  and  in  x-rays  from  space. 

During  World  War  II  (1939-1945)  it  was  discovered  that  sudden,  intense  radio 
outbursts  from  the  Sun,  associated  with  solar  flares,  could  interfere  with  radio 
communications  and  radar  systems.  Soon  after  the  war  ended,  J.  Paul  Wild’s 
(1923-2008)  group  of  Australian  radio  astronomers  used  swept  frequency  radio 
receivers  to  show  that  some  flares  eject  particles  at  about  half  the  speed  of  light,  or 
about  1.5  x  108  m  s_1,  while  others  moved  at  about  1/ 100th  this  speed  or  at 

/r  _ i 

about  10  m  s  ,  and  were  attributed  to  shock  waves.  Wild  and  Smerd  (1972)  and 
Wild  et  al.  (1963)  provided  early  summaries  of  observations  of  solar  radio  bursts. 
Bastian  et  al.  (1998)  provided  a  more  recent  review  of  the  radio  emission  from 
solar  flares. 

Example:  Watching  flare-accelerated  electrons  and  shock  waves 

When  a  flare-associated  disturbance,  such  as  an  electron  beam  or  a  shock 
wave,  moves  through  the  coronal  plasma,  the  local  electrons  are  displaced 
with  respect  to  the  protons,  which  are  more  massive  than  the  electrons.  The 
electrical  attraction  between  the  electrons  and  protons  pulls  the  electrons 
back  in  the  opposite  direction,  and  an  oscillation  is  set  up  at  the  plasma 
frequency,  vp,  given  by  (Sect.  5.5): 

1/2 

=  8.98A/'2  Hz.  (9.11) 

for  an  electron  density  Ne,  where  the  electron  charge  e  =  1.6022  x  10-19 

_ O  1 

coulomb,  the  electron  mass  rae  =  9.1094  x  10  kg,  the  permittivity  of 
free  space  is  s0  =  8.8542  x  10“ 12  F  m  \  and  n  =  3.14159.  Note  that  the 
wavelength,  of  radiation  at  the  plasma  frequency  is  given  by 
Apvp  =  c  =  2.9979  x  108  m  s_1,  the  speed  of  light. 

Low  in  the  solar  corona,  where  Ne  k,  10  m  ,  the  plasma  frequency  is 
about  90  MHz,  at  radio  frequencies,  or  radio  wavelengths  of  about  3.3  m, 
which  can  be  observed  from  the  ground.  Since  the  plasma  frequency  decreases 
with  the  diminishing  coronal  electron  density  at  greater  distances  from  the  Sun, 
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where  the  corona  occupies  an  increasing  volume  of  space,  beams  or  shocks 
sent  out  from  a  solar  flare  excite  progressively  lower  plasma  frequencies.  Near 

/r  _ o 

the  Earth,  for  example,  Ne  %  5  x  10  m  and  the  plasma  frequency  is  about 
20  kHz  corresponding  to  a  wavelength  of  15  km. 

When  an  electron  density  model  of  the  solar  atmosphere  is  used,  the 
emission  frequency  can  be  related  to  height,  and  combined  with  the  time 
delays  observed  at  successively  lower  frequencies,  to  obtain  the  outward 
velocity  of  the  moving  disturbance.  For  a  type  III  radio  burst,  an  average  of 
about  0.4  c  is  determined.  The  slower  drift  associated  with  type  II  radio 
bursts  suggests  an  outward  motion  at  about  1,000  km  s-1  and  has  been 
attributed  to  shock  waves. 

Radiation  from  low-frequency  plasma  oscillations,  between  100  kHz  and 
10  MHz,  is  absorbed  in  the  Earth’s  atmosphere,  but  they  have  been  observed 
for  decades  using  instruments  aboard  spacecraft.  These  observations  can  be 
used  to  track  the  spiral  magnetic  field  that  guides  the  flare  electrons  as  they 
move  out  through  the  increasingly  rarefied  coronal  plasma. 


The  bulk  of  radiation  from  high- temperature  solar  flares  is  not  emitted  as  radio 
waves  but  instead  at  extreme  ultraviolet  and  x-ray  wavelengths,  where  they  can 
briefly  outshine  the  entire  Sun.  This  radiation  is  absorbed  in  the  Earth’s  atmo¬ 
sphere;  therefore,  astronomers  have  observed  it  from  outer  space,  beginning  with 
primitive  instruments  aboard  balloons  or  sounding  rockets  and  continuing  with 
increasingly  sophisticated  telescopes  in  many  satellites,  including  the  Yohkoh , 
Ulysses,  Wind ,  SOHO ,  ACE ,  TRACE ,  Hinode,  RHESSI ,  STEREO ,  and  SDO 
spacecraft  (Lang  2009). 

Why  does  a  solar  flare  occur?  What  triggers  the  instability  and  suddenly  ignites 
an  explosion  from  magnetic  fields  that  remain  unperturbed  for  long  intervals  of 
time?  Answers  to  these  questions  were  proposed  in  1960  by  the  Austrian-born 
American  Thomas  Gold  (1920-2004),  then  at  Cornell  University,  and  by  the 
Englishman  Fred  Hoyle  (1915-2001),  at  Cambridge  University,  when  they  showed 
how  solar  flares  could  be  powered  by  stressed  magnetic  loops  that  interact,  dis¬ 
sipating  their  energy  in  the  corona  (Gold  and  Hoyle  1960).  The  flares  are  triggered 
when  magnetized  coronal  loops  are  pressed  together,  driven  by  motions  beneath 
them,  meeting  to  touch  one  another  and  merge. 

Magnetic  fields  have  a  direction  associated  with  them  and  if  oppositely  directed 
magnetic  fields  are  pushed  together,  they  can  interact.  When  these  merging  magnetic 
fields  are  closed  coronal  loops,  they  will  break  open  to  release  magnetic  energy  in 
the  form  of  flare  heating  and  particle  acceleration.  The  magnetic  fields  are  not  broken 
permanently;  they  simply  reconnect  to  their  closed  state.  For  this  reason,  this 
merging  and  coupling  is  known  as  magnetic  reconnection  (Sweet  1969; 
Hirajama  1974). 

Benz  and  Giidel  (2010)  provided  a  review  of  magnetically  driven  flares  on  the 
Sun  and  other  stars,  whereas  Zweibel  and  Yamada  (2009)  reviewed  magnetic 
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Fig.  9.9  Solar  flare  model  A  solar  flare  is  powered  by  magnetic  energy  released  from  a 
magnetic  interaction  site  above  the  top  of  a  coronal  loop.  Electrons  are  accelerated  to  high  speed 
during  a  solar  flare,  generating  a  burst  of  radio  energy  as  well  as  impulsive  loop-top  hard  X-ray 
emission.  Some  of  these  nonthermal  electrons  are  channeled  down  the  loop  and  strike  the 
chromosphere  at  nearly  the  speed  of  light,  emitting  hard  X-rays  by  electron— ion  bremsstrahlung 
at  the  loop  footpoints.  When  beams  of  accelerated  protons  enter  the  dense,  lower  atmosphere, 
they  cause  nuclear  reactions  that  result  in  gamma-ray  spectral  lines  and  energetic  neutrons. 
Material  in  the  chromosphere  is  heated  very  quickly  and  rises  into  the  coronal  loop,  accompanied 
by  a  slow,  gradual  increase  in  soft  X-ray  radiation.  This  upwelling  of  heated  material  is  called 
chromospheric  evaporation  and  it  occurs  in  the  decay  phase  of  the  flare 


reconnection  in  astrophysical  plasmas.  Bhattacharjee  (2004)  discussed  magnetic 
reconnection  in  the  Earth’s  magneto  tail  and  the  solar  corona. 

So,  powerful  solar  flares  stem  from  the  interaction  of  coronal  loops.  These  loops 
are  always  moving  about,  and  often  are  brought  into  contact  by  these  movements. 
Magnetic  fields  coiled  up  in  the  solar  interior,  where  the  Sun’s  magnetism  is 
produced,  also  can  bob  into  the  corona  to  interact  with  preexisting  coronal  loops. 
In  either  case,  the  coalescence  leads  to  the  rapid  release  of  magnetic  energy 
through  magnetic  reconnection. 

Because  flares  apparently  originate  in  the  low  corona  and  the  ubiquitous 
coronal  loops  dominate  their  structure,  it’s  not  surprising  that  solar-flare  models 
involve  a  coronal  loop  (Fig.  9.9).  Magnetic  reconnection  triggers  the  release  of 
magnetic  energy  just  above  the  loop  top,  where  electrons  and  protons  are  accel¬ 
erated.  In  less  than  1  s,  electrons  are  accelerated  to  nearly  the  speed  of  light, 
producing  intense  radio  signals.  Protons  likewise  are  accelerated  to  high  speeds, 
and  both  the  electrons  and  protons  are  hurled  down  into  the  Sun  and  out  into  space. 

Instruments  aboard  spacecraft  have  observed  nuclear  reactions  and  the  creation 
of  anti-matter  during  solar  flares  (Chupp  1984;  Share  et  al.  2004;  Hurford  et  al. 
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2006).  When  protons  and  heavier  ions  are  accelerated  to  high  speed  during  solar 
flares,  and  beamed  downward  into  the  Sun,  they  slam  into  the  dense,  lower 
atmosphere,  shattering  the  nuclei  of  atoms  and  sometimes  tearing  energetic  neu¬ 
trons  out  of  them.  Many  of  these  neutrons  are  eventually  captured  by  ambient,  or 
non-flaring,  hydrogen  nuclei,  the  protons,  in  the  photosphere,  emitting  one  of  the 
Sun’s  strongest  gamma-ray  lines  at  2.223  MeV. 

Another  strong  gamma-ray  line  emitted  during  solar  flares  is  the  0.511  MeV 
pair-annihilation  line.  Positrons,  the  anti-matter  counterpart  of  electrons,  are 
released  during  the  decay  of  radioactive  nuclei  produced  when  flare-accelerated 
protons  and  heavier  nuclei  are  hurled  down  into  the  lower  solar  atmosphere  during 
a  solar  flare.  The  positrons  and  the  electrons  annihilate,  producing  radiation  at 
0.511  MeV,  which  is  the  energy  contained  in  the  entire  mass  of  a  nonmoving 
electron. 

Example:  Anti  matter  created  during  solar  flares 

Positrons,  or  positive  electrons,  are  the  anti-matter  particles  of  electrons,  and 
they  are  produced  during  solar  flares.  The  positrons,  denoted  e+ ,  are  inferred 
from  observations  of  the  pair- annihilation  reaction  during  solar  flares.  This 
reaction  is  written  as: 

e+  +  e~  =7  +  7,  (9.12) 

where  e~  denotes  an  electron  and  7  is  a  gamma-ray  photon.  The  energy,  E , 
of  this  photon  will  be  equal  to  the  energy  taken  to  completely  destroy  an 
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electron  of  rest  mass  me  =  9.1094  x  10  kg,  or  E  =  me  c  =  8.187  x 
10“ 14  J  =  0.511  MeV,  where  the  speed  of  light  c  =  2.9979  x 
108  m  s-1,  and  1  MeV  =  1.60217  x  10-13  J.  Since  the  positron  has  the 
same  mass  as  an  electron,  both  gamma-ray  protons  have  this  energy.  The 
0.511  MeV  line  has  been  observed  during  solar  flares  using  instruments 
aboard  spacecraft. 


Because  the  chromosphere  has  been  heated  very  rapidly  by  the  accelerated 
particles  that  were  hurled  down  into  it,  that  part  of  the  chromosphere  explodes,  or 
evaporates,  up  into  the  corona  to  release  the  excess  energy.  This  process  may 
include  the  gradual  release  of  energy  when  the  coronal  loop  relaxes  into  a  more 
stable  configuration  during  the  decay  phase  of  a  solar  flare. 


Example:  Chromospheric  evaporation 

Energetic  particles  are  hurled  down  into  the  chromosphere  during  a  solar 

•n 

flare,  heating  it  to  temperatures  of  T  =  20  million,  or  2  x  10  ,  K,  which  is 
slightly  hotter  than  the  center  of  the  Sun.  The  hot  gas  flows  back  up  into 
coronal  loops,  emitting  x-ray  radiation  during  the  decay  phase  of  the  solar 
flare.  The  photon  energy  of  the  x-rays  is  roughly  equal  to  the  thermal  energy 
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of  the  gas,  or  3kT/2  =  4.14  x  10  J,  where  the  Boltzmann  constant 

OQ  1 

k  =  1.38065  x  10  ^  J  K  .  X-ray  astronomers  like  to  specify  this  energy 

_  1  z 

in  keV,  where  1  keV  =  1.602  x  10  J,  and  chromospheric  evaporation  is 
detected  at  x-ray  photon  energies  of  about  2.6  keV. 


9.3.2  Coronal  Mass  Ejections 

Coronal  mass  ejections  are  gigantic  magnetic  bubbles  that  can  rush  away  from  the 
Sun  at  supersonic  speeds,  expanding  to  become  larger  than  the  Sun  in  a  few  hours 
(Figs.  9.10,  9.11).  They  carry  about  10  kg  or,  10  billion  tons,  of  material  out  into 
space,  produce  intense  shock  waves,  and  accelerate  vast  quantities  of  energetic 
particles  in  interplanetary  space.  A  coronal  mass  ejection  moves  through  space  at  a 

_  1  O  A 

speed  of  about  400  km  s  ,  carrying  a  kinetic  energy  of  about  10  J,  which  is 
comparable  to  the  explosive  energy  of  a  large  solar  flare.  When  directed  at  the 
Earth,  a  coronal  mass  ejection  arrives  at  the  planet  about  four  days  after  being 
ejected  from  the  Sun  (Focus  9.4). 


Fig.  9.10  Coronal  mass  ejection  A  huge  coronal  mass  ejection  is  seen  in  this  image,  taken  on 
27  Feb  2000  with  a  coronagraph  on  the  SOHO  spacecraft.  The  white  circle  denotes  the  edge  of 
the  Sun’s  visible  disk,  so  this  mass  ejection  is  about  twice  as  large  as  the  Sun.  The  dark  area 
corresponds  to  the  occulting  disk  of  the  coronagraph  that  blocks  intense  sunlight  and  permits  the 
overlying  solar  atmosphere,  or  corona,  to  be  seen.  (Courtesy  of  the  SOHO  LASCO  consortium, 
SOHO  is  a  project  of  international  cooperation  between  ESA  and  NASA.) 
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Focus  9.4  Physical  properties  of  coronal  mass  ejections 

Coronal  mass  ejections  are  detected  as  localized  brightness  increases  in 
white-light  coronagraph  images.  Integration  of  the  brightness  increase,  that 
depends  only  on  the  electron  density,  Ne ,  permits  evaluation  of  the  total 
mass,  M,  of  the  ejection.  For  a  sphere  of  radius,  R ,  we  have: 

M  =  4nR3Nemp/3,  (9.13) 

_ ry~j 

where  n  k,  3.14159  and  the  proton  mass  mp  =  1.6726  x  10  kg.  The 
corona  is  a  fully  ionized,  predominantly  (90  %)  hydrogen,  plasma,  so  the 
number  density  of  protons  and  electrons  are  equal,  but  since  the  protons  are 
1,836  times  more  massive  than  the  electrons,  the  protons  dominate  the  mass. 

1  Q  _ Q 

For  a  mass  ejection  with  an  electron,  or  proton,  density  of  Ne  =  10  m  , 

o 

that  has  grown  as  large  as  the  Sun,  with  a  radius  of  R  =  6.955  x  10  m,  this 
expression  gives 

Mk2x  1013  kg,  (9.14) 

or  about  20  billion  tons. 

At  the  rate  of  one  ejection  per  day,  and  10  kg  per  ejection,  this  amounts 

o  _ 1 

to  a  mass  flow  rate  of  about  2  x  10  kg  s  ,  since  there  are  86,400  s  per  day. 
By  way  of  comparison,  the  solar  wind  flux  observed  in  the  ecliptic  at 

i  o  _ o  _ i 

the  orbit  of  the  Earth  is  about  6  x  10  “  protons  m  s  ,  or  about 
10“ 14  kg  m-2  s'1  (Table  9.3).  If  this  flux  is  typical  of  that  over  the  entire 

Sun-centered  sphere,  with  an  average  Sun-Earth  distance  of  D  =  1 

11  2 
AU  =  1.496  x  1011  m,  we  can  multiply  by  the  sphere’s  surface  area,  4nD~, 

to  obtain  a  solar  wind  mass  flow  rate  of  about  2.8  x  109  kg  s  '.  That’s 
roughly  15  times  the  mass  flow  rate  from  coronal  mass  ejections,  or  in  other 
words  the  coronal  mass  ejection  rate  is  about  5  %  that  of  the  steady,  per¬ 
petual  solar  wind,  and  for  just  a  relatively  brief  time  during  the  ejection. 

The  kinetic  energy,  KE ,  of  a  coronal  mass  ejection  with  a  speed  of 
V  =  400  km  s-1  and  a  mass  M  =  1013  kg  is: 

KE  =  MV2/ 2  «  1024  J.  (9.15) 

9 1 

This  is  comparable  to  the  energies  of  large  solar  flares  that  lie  between  10^ 
and  1025  J. 

At  a  speed  of  V  =  400  km  s  ,  the  time,  T,  to  travel  from  the  Sun  to  the 
Earth,  at  an  average  distance  of  D  =  AU  =  1.496  x  1011  m  is: 

T  =  D/V  «  3.74  x  105  s  «  4.3  days,  (9.16) 

where  1  day  =  86,400  s.  Zhang  and  Low  (2005)  have  reviewed  the 
hydromagnetic  nature  of  solar  coronal  mass  ejections. 


9.3  Explosions  on  the  Sun 

Fig.  9.11  Model  of  coronal 
mass  ejection  A  magnetic 
reconnection  takes  place  at  a 
current  sheet  (dark  vertical 
line )  beneath  a  prominence 
and  above  closed  magnetic 
held  lines.  The  coronal  mass 
ejection  (CME)  traps  hot 
plasma  below  it  (, shaded 
region).  The  solid  curve  at 
the  top  is  the  bow  shock 
driven  by  the  CME.  The 
closed  held  region  above  the 
prominence  ( center )  is 
supposed  to  become  a  hux 
rope  in  the  interplanetary 
medium.  [Adapted  from 
Martens  and  Kuin  (1989).] 
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The  physical  size  of  the  mass  ejections  dwarfs  that  of  solar  flares  and  even  the 
active  regions  in  which  flares  occur.  However,  like  solar  flares,  the  rate  of 
occurrence  of  coronal  mass  ejections  varies  in  step  with  the  11  year  cycle  of  solar 
magnetic  activity,  ballooning  out  of  the  corona  several  times  a  day  during  activity 
maximum.  Large  coronal  mass  ejections  can  occur  with  or  without  a  solar  flare, 
but  they  both  appear  to  be  powered  by  the  abrupt  release  of  the  corona’ s  magnetic 
energy,  with  threatening  effects  for  the  Earth  and  nearby  space. 


9.4  Space  Weather 

9.4.1  Earth’s  Protective  Magnetosphere 

Our  planet  is  immersed  within  the  hot,  electrically  charged  solar  wind  that  blows 
out  from  the  Sun  in  all  directions  and  never  stops,  carrying  with  it  a  magnetic  field 
rooted  in  the  Sun.  Solar  flares  and  coronal  mass  ejections  create  powerful  gusts  in 
the  Sun’s  winds,  producing  space  weather  -  the  cosmic  equivalent  of  a  terrestrial 
blizzard  or  hurricane.  Fortunately,  we  are  protected  from  the  full  force  of  this 
relentless,  stormy  gale  by  the  Earth’s  magnetic  field. 

William  Gilbert  (1544-1603),  physician  to  Queen  Elizabeth  I  of  England, 
authored  a  treatise  in  Latin,  with  the  grand  title  De  Magnete ,  Magneticisque 
Corporibus ,  et  de  Magno  Magnete  Tellure ,  which  translated  into  English  is 
Concerning  Magnetism ,  Magnetic  Bodies ,  and  the  Great  Magnet  Earth  (Gilbert 
1600).  In  this  work,  which  is  still  available  in  an  English  version,  Gilbert  showed 
that  the  Earth  is  itself  a  great  magnet,  which  explains  the  orientation  of  compass 
needles.  It  is  as  if  there  was  a  colossal  bar  magnet  at  the  center  of  the  Earth. 
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At  the  Equator,  the  two  ends  of  a  compass  needle  point  north  or  south,  toward 
the  Earth’s  magnetic  poles.  At  each  magnetic  pole,  the  needle  stands  upright, 
pointing  into  or  out  of  the  ground.  In  between,  at  intermediate  latitudes,  the 
compass  needle  points  north  or  south  with  a  downward  dip  of  one  end  but  not 
vertically  as  at  a  pole.  Because  the  geographic  poles  are  located  near  the  magnetic 
ones,  a  compass  needle  is  aligned  in  the  north-south  direction.  We  usually  put  an 
arrow  on  the  north  end  of  the  needle;  therefore,  an  arrowed  compass  points  north. 

We  can  describe  the  Earth’s  magnetism  by  invisible  magnetic  field  lines,  which 
orient  compass  needles.  These  lines  of  magnetic  force  emerge  from  the  south 
magnetic  pole,  loop  through  nearby  space  and  reenter  at  the  north  magnetic  pole. 
The  lines  are  close  together  near  the  magnetic  poles  where  the  magnetic  force  is 
strong,  and  spread  out  above  the  Earth’s  Equator  where  the  magnetism  is  weaker 
than  at  the  poles.  We  cannot  see  the  invisible  magnetic  field  lines,  but  compass 
needles  point  along  them,  and  other  instruments  can  be  used  to  measure  their 
strength. 

The  magnetic  field  strength  at  the  Earth’s  magnetic  equator  is  0.0000305  tesla, 
or  0.305  x  10  4  tesla.  Measurements  of  the  surface  magnetic  fields  of  the  Earth 
show  stronger  fields  near  the  poles  where  the  magnetic  field  lines  congregate,  at 
roughly  twice  the  strength  of  the  field  at  the  Equator.  Although  these  fields 
decrease  in  strength  as  the  inverse  cube  of  the  distance,  they  remain  strong  enough 
to  divert  most  of  the  solar  wind  around  the  Earth  at  a  distance  far  above  the 
atmosphere,  thereby  protecting  humans  on  the  ground  from  possibly  lethal  solar 
particles. 

When  any  charged  particle  encounters  a  magnetic  field,  it  must  change  direc¬ 
tion,  moving  away  from  or  around  the  magnetism.  When  the  protons  and  electrons 
in  the  gusts  or  steady  flow  of  the  solar  wind  encounter  the  Earth’s  magnetic  fields, 
they  are  deflected  around  it,  like  a  rock  in  a  stream  or  a  windshield  deflecting  air 
around  an  automobile. 

The  Earth’s  magnetic  fields  hollow  out  a  protective  cavity  in  the  solar  wind, 
which  is  called  the  magnetosphere ,  a  term  coined  in  1959  by  Austrian-born 
American  Thomas  Gold  (1920-2004),  then  at  Cornell  University  (Gold  1959a,  b). 
It  is  that  region  surrounding  any  planet  in  which  its  magnetic  field  dominates  the 
behavior  of  electrically  charged  particles,  such  as  electrons,  protons,  and  other 
ions. 

The  dipolar  (two  poles)  magnetic  configuration  applies  near  the  surface  of  the 
Earth,  but  farther  out,  the  magnetic  field  is  distorted  by  the  Sun’s  perpetual  wind. 
Although  it  is  exceedingly  tenuous,  the  solar  wind  is  powerful  enough  to  mold  the 
outer  edges  of  the  Earth’s  magnetosphere  into  a  changing  asymmetric  shape 
(Fig.  9.12). 

The  solar  wind  usually  bends  around  the  Earth’s  magnetic  field  at  a  distance 
from  the  Earth’s  center  of  about  10  times  the  Earth’s  radius  on  the  dayside  that 
faces  the  Sun  (Focus  9.5).  Here,  the  solar  wind  pushes  the  Earth’s  magnetism  in, 
compressing  its  outer  magnetic  boundary  and  forming  a  shock  wave,  shaped  like 
waves  that  pile  up  ahead  of  the  bow  of  a  moving  ship  and  resembling  the  flow  of 
air  around  a  supersonic  aircraft.  After  forming  this  bow  shock,  the  solar  wind  is 
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Fig.  9.12  Elements  of  the  magnetosphere  The  Earth’s  magnetic  held  carves  out  a  hollow  in  the 
solar  wind,  creating  a  protective  cavity  called  the  magnetosphere.  A  bow  shock  forms  at  about  10 
Earth  radii  on  the  sunlit  side  of  our  planet.  The  location  of  the  bow  shock  is  highly  variable 
because  it  is  pushed  in  and  out  by  the  gusty  solar  wind.  The  magnetopause  marks  the  outer 
boundary  of  the  magnetosphere,  at  the  place  where  the  solar  wind  takes  control  of  the  motions  of 
charged  particles.  The  solar  wind  is  deflected  around  the  Earth,  pulling  the  terrestrial  magnetic 
held  into  a  long  magnetotail  on  the  nightside.  Plasma  in  the  solar  wind  is  deflected  at  the  bow 
shock  (left),  hows  along  the  magnetopause  into  the  magnetic  tail  (right),  and  then  can  be  injected 
back  toward  the  Earth  within  the  plasma  sheet  (center).  The  Earth,  its  auroras,  atmosphere,  and 
ionosphere,  and  the  two  Van  Allen  radiation  belts  all  lie  within  this  magnetic  cocoon 

deflected  around  the  Earth,  pulling  the  terrestrial  magnetic  field  into  a  long 
magnetotail  on  the  night  side.  Thus,  the  Earth’s  magnetosphere  is  not  precisely 
spherical.  It  has  a  bow  shock  facing  the  Sun  and  a  long  magnetotail  in  the  opposite 
direction.  The  term  magnetosphere  therefore  does  not  refer  to  form  or  shape  but 
instead  implies  a  sphere  of  influence. 


Focus  9.5  Planetary  magnetospheres 

Six  planets  are  known  to  have  magnetospheres.  The  size  of  the  magneto¬ 
sphere,  on  the  day  side  facing  the  Sun,  is  determined  by  the  distance,  RMp , 
along  the  planet-Sun  line  at  which  the  pressure  of  the  planetary  magnetic 
field  balances  the  dynamic  ram  pressure  of  the  solar  wind. 

The  ram  pressure,  PR ,  exerted  on  a  stationary  body  by  a  gas,  or  fluid,  of 
mass  density,  p ,  moving  at  a  velocity,  V ,  is  given  by: 

PR  =  pV2.  (9.17) 
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This  expression  can  also  be  used  for  the  ram  pressure  exerted  on  a  body  that 
is  moving  at  this  velocity  through  a  gaseous  or  fluid  medium. 

For  the  solar  wind,  we  can  use: 


PR  =  mPNpV2,  (9.18) 

where  the  proton  mass  mP  =  1.6726  x  10  ^  kg,  the  NP  is  the  number 
density  of  protons  at  the  distance  from  the  Sun  that  is  under  consideration, 
and  V  is  the  solar  wind  velocity  at  that  distance. 

The  magnetic  pressure,  PB ,  at  the  surface  of  the  planet  is  given  by: 

Bl 

P,  =  ^.  (9.19) 

_ ^7 

where  B0  is  the  equatorial  magnetic  field  strength,  /a0  =  4ti  x  10~;  N 

7  /:  7 

A  =  1.2566  x  10  N  A  ~  is  the  permeability  of  free  space.  Since  the 
dipole’s  magnetic  field  strength  falls  off  as  the  cube  of  the  distance  from  the 
planet,  the  magnetic  pressure  decreases  as  the  sixth  power  of  that  distance. 

The  standoff  distance,  RMp ,  from  the  planet  at  which  the  two  pressures  are 
equal,  or  where  PR  =  PB ,  therefore  occurs  when: 


d 6  d2 
0 

^tip 


mpNpV 2, 


where  the  planet’s  radius  is  RP.  Solving  for  RMP  we  have: 


(9.20) 


(9.21) 


Example:  Solar- wind  bow  shock  distance  for  the  Earth  and  Jupiter 

When  the  solar  wind  encounters  the  Earth  the  number  density  of  protons  is 

/I  _ Q 

NP  «  5  x  10  m  ,  and  the  average  solar  wind  velocity  is  about 
V  =  600  km  s-1  (see  Table  9.3).  The  equatorial  magnetic  field  strength  of 
the  Earth  is  B0E  =  3  x  10-5  tesla.  Substituting  these  numbers  into  the 
equation  for  the  standoff  point,  where  the  solar  wind  ram  pressure  equals  the 
Earth’s  magnetic  pressure,  gives  RME  «  7 RE,  or  seven  times  the  Earth’s 
radius. 

The  values  of  RMP  for  the  other  planets  can  be  inferred  by  noting  that  the 
solar  wind  number  density,  A,  falls  off  with  the  inverse  cube  of  the  distance 
of  the  planet  from  the  Sun,  while  the  solar  wind  velocity  remains  relatively 
constant.  The  giant  planets  have  stronger  magnetic  fields  than  the  Earth,  so 
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the  bow  shock  distances  for  Jupiter  and  Saturn,  for  example,  are  42  and  19 
times  the  radius  of  these  planets.  The  planet  Venus  has  no  global  dipolar 
magnetic  field,  but  the  solar  wind  can  be  diverted  around  this  planet  by  its 
thick  atmosphere. 

The  planet  Jupiter  has  an  equatorial  magnetic  field  strength  of  B0J  =  4.28 
tesla,  with  a  bow  shock  distance  of  RMJ  =  42  Rj  «  3.0  x  109  m,  where  the 

'-j 

radius  of  Jupiter  is  =  7.15  x  10  m.  So,  the  magnetosphere  of  Jupiter  is 
larger  than  the  Sun,  whose  radius  is  RQ  =  6.955  x  108  m.  The  solar  wind 
proton  density  is  diminished  as  it  spreads  out  to  a  greater  volume  between 
the  Earth  and  Jupiter. 


The  Earth’s  magnetic  shield  is  so  perfect  that  only  0.1  %  of  the  mass  of  the 
solar  wind  that  hits  it  manages  to  penetrate  inside.  Yet,  even  that  small  fraction  of 
wind  particles  has  a  profound  influence  on  the  Earth’s  nearby  environment  in 
space;  they  create  an  invisible  world  of  energetic  particles  and  electric  currents 
that  flow,  swirl,  and  encircle  the  Earth. 


9.4.2  Trapped  Particles 

One  of  the  first  scientific  discoveries  of  the  Space  Age  was  the  finding,  by  James 
A.  Van  Allen  (1914-2006)  and  his  students,  of  high-energy  electrons  and  protons 
that  girdle  the  Earth  far  above  the  atmosphere  (Van  Allen  et  al.  1959).  They  move 
within  two  belts  that  encircle  the  Earth’s  magnetic  equator  but  do  not  touch  it,  like 
a  gigantic,  invisible,  torus-shaped  doughnut. 

These  regions  sometimes  are  called  the  inner  and  outer  Van  Allen  radiation 
belts.  Van  Allen  used  the  term  “radiation  belt”  because  the  charged  particles  were 
then  known  as  corpuscular  radiation;  the  nomenclature  does  not  imply  either 
electromagnetic  radiation  or  radioactivity.  The  radiation  belts  lie  within  the  inner 
magnetosphere  at  distances  of  1.5  and  4.5  Earth  radii  from  the  center  of  the  Earth, 
and  contain  high-speed  electrons  and  protons. 

In  1907,  about  a  half-century  before  the  discovery  of  radiation  belts,  the 
Norwegian  geophysicist  Carl  Stprmer  (1874-1957)  showed  how  electrons  and 
protons  could  be  almost  permanently  confined  and  suspended  in  space  by  the 
Earth’s  dipolar  magnetic  field  (Stprmer  1907,  1955).  An  energetic  charged  particle 
moves  around  the  magnetic  fields  in  a  spiral  path  toward  one  magnetic  pole.  Its 
trajectory  becomes  more  tightly  coiled  in  the  stronger  magnetic  fields  close  to  a 
magnetic  pole,  where  the  intense  polar  fields  act  like  a  magnetic  mirror,  turning  the 
particle  around  so  that  it  moves  back  toward  the  other  pole. 

Thus,  the  electrons  and  protons  bounce  back  and  forth  between  the  north  and 
south  magnetic  poles.  It  takes  about  1  min  for  an  energetic  electron  to  make  one 
trip  between  the  two  polar  mirror  points.  The  spiraling  electrons  also  drift 
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eastward,  completing  one  trip  around  the  Earth  in  about  a  half-hour.  There  is  a 
similar  drift  for  protons  but  in  the  westward  direction.  The  bouncing  can  continue 
indefinitely  for  particles  trapped  in  the  Earth’s  radiation  belts,  until  the  particles 
collide  with  one  another  or  some  external  force  distorts  the  magnetic  fields. 

The  problem  at  the  time  that  Stprmer  developed  his  theory  was  that  there  was 
no  mechanism  known  to  allow  electrically  charged  particles  into  the  dipolar 
magnetic  field.  After  all,  if  electrons  and  protons  cannot  leave  the  magnetic  cage, 
how  could  they  get  into  it  in  the  first  place?  The  answer  is  the  solar  wind.  They  can 
arrive  via  the  solar  wind  and  penetrate  the  Earth’s  magnetic  defense  through  a 
temporary  opening  in  it. 

The  solar  wind  carries  the  Sun’s  magnetic  field  with  it,  and  the  solar  magnetism 
is  draped  around  the  magnetosphere  when  encountering  it.  As  postulated  by  the 
English  physicist  James  Dungey  (1923-  ),  a  solar  magnetic  field  can  open  up  the 
Earth’s  magnetic  field  when  the  two  fields  are  pointing  in  opposite  directions  when 
they  touch  (Dungey  1961).  With  this  orientation  they  can  join  one  another  and 
become  linked,  similar  to  how  the  opposite  poles  of  two  toy  magnets  stick  toge¬ 
ther.  The  merging  process,  known  as  magnetic  reconnection ,  can  create  an 
opening  in  the  Earth’s  magnetic  field,  forming  a  portal  through  which  the  solar 
particles  can  flow.  Tons  of  high-energy  particles  may  then  flow  into  the  magne¬ 
tosphere  through  the  opening  before  it  closes  again. 


9.4.3  Earth’s  Magnetic  Storms 

The  Earth’s  magnetic  field  can  be  compressed  and  distorted  when  a  coronal  mass 
ejection  arrives  at  the  Earth.  The  ejections  have  magnetic  field  strengths  of  about 
30  nT,  or  3  x  10-8  T,  so  the  much  stronger  terrestrial  magnetic  field,  which  is 
about  a  thousand  times  more  intense,  at  3  x  10-5  T,  usually  provides  good  pro¬ 
tection  from  them.  However,  if  the  magnetic  fields  of  a  coronal  mass  ejection  and 
of  the  Earth  are  pointing  in  opposite  directions  when  they  meet,  the  two  fields 
become  linked,  resulting  in  intense  geomagnetic  storms  that  cause  compass  nee¬ 
dles  to  swing  widely. 

The  flow  of  currents  associated  with  these  great  magnetic  storms  can  interfere 
with  electrical  power  grids  here  on  the  Earth,  creating  voltage  surges  on  long¬ 
distance  power  lines  and  overheating  or  melting  the  windings  of  transformers. 
They  can  send  cities  into  complete  darkness,  especially  in  high-latitude  regions 
where  the  currents  are  strongest,  such  as  Canada,  the  northern  United  States  and 
Scandinavia.  This  doesn’t  occur  often,  perhaps  once  a  year;  however,  the  potential 
consequences  are  serious  enough  to  employ  early  warning  systems. 
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Fig.  9.13  Magnetic  spiral  The  trajectory  of  flare  electrons  in  interplanetary  space  as  viewed 
from  above  the  Sun’s  polar  regions  using  the  Ulysses  spacecraft.  The  squares  and  crosses  show 
Ulysses  radio  measurements  of  Type  III  radio  bursts.  As  the  high-speed  electrons  move  out  from 
the  Sun  into  progressively  more  tenuous  plasma,  they  excite  radiation  at  successively  lower 
plasma  frequencies.  The  numbers  denote  the  observed  frequency  in  kilohertz  (kHz).  Because  the 
flaring  electrons  are  forced  to  follow  the  interplanetary  magnetic  field,  they  do  not  move  in  a 
straight  line  out  from  the  Sun  but  instead  travel  along  the  spiral  pattern  of  the  interplanetary 
magnetic  field,  shown  by  the  solid  curved  lines.  The  magnetic  fields  are  drawn  out  into  space  by 
the  radial  solar  wind  and  remain  attached  at  one  end  to  the  rotating  Sun.  The  locations  of  the 
orbits  of  Mercury,  Venus,  and  the  Earth  are  shown  as  circles.  (Courtesy  of  Michael  J.  Reiner. 
Ulysses  is  a  project  of  international  collaboration  between  ESA  and  NASA.) 


9.4.4  Solar  Explosions  Threaten  Humans  in  Outer  Space 

When  directed  at  our  planet  or  at  humans  in  deep  space,  both  solar  flares  and 
coronal  mass  ejections  produce  dangerous  gusts  and  squalls  in  the  Sun’s  winds. 
Here  on  the  ground,  we  are  shielded  from  many  of  the  effects  by  the  Earth’s 
atmosphere  and  magnetic  fields,  but  out  in  space  there  can  be  no  protection,  and 
both  humans  and  satellites  are  vulnerable. 

Energetic  charged  particles  generated  during  a  solar  flare  threaten  our  planet 
only  if  the  flare  occurs  at  the  right  place  on  the  Sun  -  that  is,  at  one  end  of  the 
spiral  magnetic  field  that  connects  the  Sun  to  the  Earth.  Given  this  circumstance, 
when  a  flare  occurs  near  the  west  limb  and  the  solar  equator,  the  magnetic  spiral 
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guides  the  high-speed  charged  particles  that  can  threaten  astronauts  or  satellites. 
The  spiral  magnetic  pattern,  produced  when  the  solar  wind  carries  the  rotating 
Sun’s  magnetic  field  into  surrounding  space,  has  been  confirmed  by  tracking  the 
radio  emission  of  charged  particles  thrown  out  during  solar  flares,  as  well  as  by 
spacecraft  that  have  sampled  the  interplanetary  magnetism  near  the  Earth 
(Fig.  9.13). 

As  suggested  by  Cornell  astrophysicist  Thomas  Gold  (1920-2004),  closed 
magnetic  fields  can  be  ejected  from  the  Sun,  generating  shocks  as  they  move  into 
interplanetary  space  (Gold  1959a,  b).  And  when  a  coronal  mass  ejection  travels  out 
into  space,  it  can  take  the  form  of  a  magnetic  cloud  that  moves  behind  an  inter¬ 
planetary  shock  (Fig.  9.14).  The  mass  ejection  plows  into  the  slower-moving  solar 
wind,  driving  huge  shock  waves  that  cross  magnetic  field  lines  and  accelerate 
particles  as  they  go.  Following  the  shocks,  the  magnetic  cloud  can  remain  attached 
magnetically  to  the  Sun,  carrying  its  looping  magnetic  fields  all  the  way  to  the 
Earth. 

Because  of  their  higher  mass,  it  is  solar-flare  protons  rather  than  electrons  that 
provide  the  greatest  threat  to  humans  in  outer  space.  Such  high-speed  protons, 
called  solar  energetic  particles,  can  endanger  the  health  and  even  the  lives  of 
astronauts  when  they  are  in  outer  space,  unprotected  by  the  Earth’s  magnetic  field 
that  deflects  the  charged  particles. 

High-energy  protons  from  a  solar  flare  or  coronal  mass  ejection  easily  can 
pierce  a  spacesuit,  causing  damage  to  human  cells  and  tissues,  and  even  threaten 
the  life  of  unprotected  astronauts,  who  venture  into  space  to  unload  spacecraft 
cargo,  construct  a  space  station  or  walk  on  the  Moon,  which  has  no  global  mag¬ 
netic  field  or  atmosphere,  or  Mars,  which  has  no  magnetosphere  and  only  a  thin 
atmosphere.  So,  solar  astronomers  keep  careful  watch  over  the  Sun  during  space 
missions  to  warn  of  possible  activity  occurring  in  just  the  wrong  place  or  time. 


9.4.5  Disrupting  Communication 

Eight  minutes  after  an  energetic  solar  flare,  a  strong  blast  of  x-rays  and  extreme 
ultraviolet  radiation  reaches  the  Earth  and  radically  alters  the  structure  of  the 
planet’s  upper  atmosphere,  known  as  the  ionosphere,  altering  its  ability  to  reflect 
radio  waves.  During  even  moderately  intense  flares,  long-distance  radio 
communications  can  be  silenced  temporarily  over  the  Earth’s  entire  sunlit 
hemisphere.  These  radio  blackouts  are  particularly  troublesome  for  the  com¬ 
mercial  airline  industry,  which  uses  radio  transmissions  for  weather,  air  traffic, 
and  location  information.  The  U.S.  Air  Force  operates  a  global  system  of 
ground-based  radio  and  optical  telescopes  and  taps  into  the  output  of  national 
space-borne  x-ray  telescopes  and  particle  detectors  to  continuously  monitor  the 
Sun  for  intense  flares  that  might  severely  disrupt  military  communications  and 
satellite  surveillance. 
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Fig.  9.14  Magnetic  cloud 

When  a  coronal  mass  ejection 
(CME)  travels  into 
interplanetary  space,  it  can 
create  a  huge  magnetic  cloud 
containing  beams  of  electrons 
that  flow  in  opposite 
directions  within  the 
magnetic  loops  that  are 
rooted  at  both  ends  in  the 
Sun.  The  magnetic  cloud  also 
drives  a  shock  ahead  of  it. 
Magnetic  clouds  are  present 
only  in  a  subset  of  observed 
interplanetary  coronal  mass 
ejections.  (Courtesy  of 
Deborah  Eddy  and  Thomas 
Zurbuchen.) 


Shook 


Although  solar  flares  do  not  affect  short-wavelength  microwave  signals  that 
pass  right  through  the  ionosphere  to  communication  satellites,  solar  explosions  can 
destroy  the  satellites. 


9.4.6  Satellites  in  Danger 

More  than  1,000  commercial,  military,  and  scientific  satellites  are  now  in  opera¬ 
tion,  affecting  the  lives  of  millions  of  people.  And  the  performance  and  lifetime  of 
all  of  these  satellites  are  affected  by  Sun-driven  space  weather. 

Geosynchronous  satellites,  which  orbit  the  Earth  at  the  same  rate  that  the  planet 
spins,  stay  above  the  same  place  on  the  Earth  to  relay  and  beam  down  signals  used 
for  cellular  phones,  global  positioning  systems,  and  Internet  commerce  and  data 
transmission.  They  are  endangered  by  the  coronal  mass  ejections  that  cause  intense 
geomagnetic  storms.  These  satellites  orbit  our  planet  at  about  6.6  Earth  radii,  or 
about  4,200  km,  moving  around  the  Earth  once  every  24  h.  A  coronal  mass 
ejection  can  compress  the  Earth’s  protective  magnetic  fields  from  their  usual 
location  at  about  10  Earth  radii  above  the  equator  to  below  the  satellites’  syn¬ 
chronous  orbits,  exposing  them  to  the  full  brunt  of  the  gusty  solar  wind  and  its 
charged,  energized  ingredients. 

Other  satellites  revolve  around  our  planet  in  closer,  low-Earth  orbits  at  altitudes 
of  300  to  500  km,  scanning  the  air,  land,  and  sea  for  environmental  change, 
weather  forecasting  and  military  reconnaissance.  Space  weather  can  increase  the 
atmospheric  friction  exerted  on  these  satellites,  causing  their  orbits  to  decay  more 
quickly  than  expected.  The  enhanced  extreme  ultraviolet  and  x-ray  radiation  from 
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solar  flares  heats  the  atmosphere  and  causes  it  to  expand;  similar  or  greater  effects 
are  caused  by  coronal  mass  ejections.  The  expansion  of  the  terrestrial  atmosphere 
brings  higher  gas  densities  to  a  given  altitude,  increasing  the  friction  and  drag 
exerted  on  a  satellite,  pulling  it  to  a  lower  altitude,  and  sometimes  causing  ground 
controllers  to  lose  contact  with  them.  Space  stations,  for  example,  periodically 
must  be  boosted  in  altitude  to  a  higher  orbit  to  avoid  a  similar  fate. 

Infrequent,  anomalously  large  eruptions  on  the  Sun  can  hurl  energetic  protons 
toward  the  Earth  and  elsewhere  in  space.  The  solar  protons  can  enter  a  spacecraft, 
producing  erroneous  commands  and  crippling  their  microelectronics.  Such  single¬ 
event  upsets  already  have  destroyed  at  least  one  weather  satellite  and  disabled 
several  communications  satellites.  However,  to  put  the  space- weather  threat  in 
perspective,  only  a  few  commercial  satellites  have  been  lost  to  storms  from  the 
Sun  out  of  thousands  deployed.  The  U.S.  military  builds  satellites  that  can  with¬ 
stand  the  effects  of  a  nuclear  bomb  exploded  in  space. 


9.4.7  Forecasting  Space  Weather 

Recognizing  our  vulnerability,  astronomers  use  telescopes  on  the  ground  and 
in  situ  particle  detectors  or  remote-sensing  telescopes  on  satellites  to  carefully 
monitor  the  Sun,  and  government  agencies  post  forecasts  that  warn  of  threatening 
solar  activity.  This  enables  evasive  action  that  can  reduce  disruption  or  damage  to 
communications,  defense  and  weather  satellites,  as  well  as  electrical  power  sys¬ 
tems  on  the  ground.  Once  it  is  known  that  a  Sun  storm  is  imminent,  the  launch  of 
manned  space  flight  missions  can  be  postponed,  and  a  walk  outside  a  spacecraft  or 
on  the  Moon  or  Mars  can  be  delayed.  Airplane  pilots  can  be  warned  of  potential 
radio  communication  failures.  Operators  can  power  down  sensitive  electronics  on 
communication  and  navigation  satellites  until  the  danger  passes.  Utility  companies 
can  reduce  load  in  anticipation  of  trouble  on  power  lines,  in  that  way  trading  a 
temporary  “brown  out”  for  a  potentially  disastrous  “black  out.” 

Everyone  wants  to  know  how  strong  a  space  storm  is  and  when  it  is  going  to  hit. 
Like  winter  storms  on  the  Earth,  some  of  the  effects  can  be  predicted  days  in 
advance.  A  coronal  mass  ejection,  for  example,  arrives  at  the  Earth  one  to  four 
days  after  solar  astronomers  watch  it  leave  the  Sun.  Solar  flares  are  another  matter; 
as  soon  as  one  is  observed  on  the  Sun,  its  radiation  and  fastest  particles  have 
already  reached  the  Earth,  taking  just  8  min  to  travel  from  the  Sun.  One  promising 
prediction  technique  is  to  observe  when  the  magnetism  on  the  Sun  has  become 
twisted  into  a  stressed  situation,  because  it  then  may  be  about  to  release  a  solar 
flare.  Another  technique  employs  helioseismology  to  look  through  the  Sun  and 
watch  active  regions  develop  before  they  rotate  to  face  the  Earth. 


Chapter  10 

The  Sun  Amongst  the  Stars 


10.1  Comparisons  of  the  Sun  with  Other  Stars 

10.1.1  How  Far  Away  are  the  Stars? 

To  determine  the  distance  of  a  nearby  star  (other  than  the  Sun),  astronomers 
measure  its  angular  displacement  when  viewed  from  opposite  sides  of  the  Earth’s 
orbit,  or  from  a  separation  of  twice  the  AU.  The  AU  is  the  mean  distance  between 
the  Earth  and  the  Sun,  with  a  value  of  about  149.6  million  km.  This  angle  is  known 
as  the  annual  parallax ,  from  annual  for  the  Earth’s  yearlong  orbit  and  the  Greek 
word  parallaxis  for  the  “value  of  an  angle”.  Once  the  parallax  is  combined  with 
the  known  value  of  the  AU,  the  star’s  distance  can  be  established  by  triangulation , 
the  geometry  of  a  triangle. 

The  measurement  involves  careful  scrutiny  of  two  stars  that  appear  close 
together  in  the  sky:  a  bright  one  relatively  nearby  and  another  fainter  one  much 
farther  away  (Fig.  10.1).  The  annual  parallax  of  the  nearer  star  can  then  be 
determined  by  comparing  its  position  to  that  of  the  distant  one  for  a  year  or  more. 

By  definition,  the  annual  parallax,  denoted  7iA,  is  half  the  apparent  angular 
displacement  of  a  nearby  star  observed  against  the  more  distant  stars  at  intervals  of 
six  months  from  opposite  sides  of  the  Earth’s  orbit.  That  is: 

sin7iA  =  AU/D  «  tza  (10.1) 

for  a  star  at  distance  D  and  1  AU  =  1.496  x  1011  m.  As  long  as  the  stellar 
distances  are  much  larger  than  the  AU,  which  is  always  the  case,  the  parallax  angle 
7 zA  is  small  and  sin  nA  «  nA.  The  distance  D  can  therefore  be  given  by: 

D  =  —  parsec,  (10.2) 

7lA 

if  7i A  is  given  in  seconds  of  arc,  denoted  by  the  symbol  The  name  of  this  distance 
unit,  the  parsec ,  derives  from  the  italicized  part  of  the  two  words  par allax  and 
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Fig.  10.1  Annual  parallax  When  a  distant  and  nearby  star  are  observed  at  six-month  intervals, 
from  opposite  sides  of  the  Earth’s  orbit  around  the  Sun,  astronomers  measure  the  angular 
displacement  between  the  two  stars.  It  is  twice  the  annual  parallax,  designated  by  nA,  which  can 
be  used  to  determine  the  distance,  D,  of  the  nearby  star.  From  trigonometry,  sin 
n A  =  AU/D  «  7i A  for  small  angles,  where  1  AU  is  the  mean  distance  between  the  Earth  and 
the  Sun.  The  distance  D  to  the  star  in  units  of  parsecs  is  given  by  1  / nA,  if  the  parallax  angle  is 
measured  in  seconds  of  arc.  This  angle  is  greatly  exaggerated  in  the  figure,  for  all  stars  have  a 
parallax  of  less  than  1  s  of  arc  or  less  than  1 /3,600th  of  a  degree.  The  German  astronomer 
Friedrich  Wilhelm  Bessel  (1784-1846)  announced  the  first  reliable  measurement  of  the  annual 
parallax  of  a  star  in  1838 


seconds  of  arc.  The  parsec,  abbreviated  pc,  is  a  convenient  unit  for  measuring 
stellar  distance  since  neighboring  stars  often  are  separated  by  about  1  parsec. 

For  conversion  purposes,  it  is  useful  to  know  that: 

1  parsec  =  1  pc  =  3.26  light  years  =  3.0857  x  1016m  =  206, 265  AU.  (10.3) 

The  light  year  is  the  distance  light  travels  in  one  year,  moving  at  the  speed  of 
light  c  =  2.9979  x  108  m  s  1  where  1  year  =  3.1557  x  107  s. 

The  fact  that  light  travels  through  space  at  a  constant  speed  provides  another 
convenient  unit  of  astronomical  distance,  by  the  time  it  takes  light  to  move  through 
space  from  the  object  to  the  Earth.  This  is  known  as  the  light-travel  time.  Light 
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from  the  Moon  takes  1.5  s  to  reach  the  Earth,  so  we  say  that  the  Moon  is  1.5  light- 
seconds  away  from  the  Earth.  Sunlight  takes  499  s  or  8.3  min  to  cover  the  average 
distance  between  the  Sun  and  the  Earth.  The  nearest  star  (other  than  the  Sun)  is  at  a 
distance  of  a  little  more  than  4  light-years. 

Many  of  the  brightest  stars  are  hundreds  of  light-years  away,  and  some  stars  in 
our  Milky  Way  are  many  millions  of  light-years  away.  Starlight  may  reach  us  from 
stars  that  have  now  extinguished  the  internal  nuclear  fires  that  make  them  shine. 
The  most  distant  objects  in  the  universe  are  billions  of  light-years  away,  and  the 
light  we  now  detect  from  them  was  generated  that  long  ago.  So,  radiation  provides 
a  method  of  looking  back  into  time,  to  decipher  the  history  of  the  universe  - 
looking  back  to  the  time  before  the  Sun  and  the  Earth  were  formed,  some 
4.6  billion  years  ago. 

The  German  astronomer  Friedrich  Wilhelm  Bessel  (1784-1846)  made  the  first 
reliable  determination  of  the  annual  parallax  of  a  star,  reporting  a  parallax  of  0.3 1" 
for  61  Cygni,  with  an  uncertainty  of  ±0.02"  (Bessel  1839).  The  modern  mea¬ 
surement  of  0.286//  indicates  that  61  Cygni  is  at  a  distance  of  3.49  pc  or 
721,000  AU.  Traveling  at  the  speed  of  light,  it  takes  11.4  years  to  cross  the  dis¬ 
tance  from  61  Cygni  to  Earth. 

Bessell  was  closely  followed  by  the  Scottish  astronomer  Thomas  Henderson 
(1798-1844),  who  obtained  a  parallax  of  1.19"  ±  0.11"  for  the  bright  star  Alpha 
Centauri  (Henderson  1839).  Bessell  and  Henderson  were  both  observing  stars  that 
had  exceptionally  large  motions  across  their  line  of  sight,  suggesting  that  they 
would  be  the  nearest  stars  if  all  stars  move  at  the  same  speed.  Jackson  (1956)  and 
Hirshfeld  (2001)  describe  attempts  to  obtain  the  first  measurement  for  the  distance 
of  a  nearby  star  (other  than  the  Sun). 

There  is  no  known  star,  other  than  the  Sun,  whose  annual  parallax  is  greater 
than  1",  or  whose  distance  from  Earth  is  less  than  1  pc.  The  star  with  the  largest 
parallax  is  Proxima  Centauri,  with  a  parallax  of  0.769",  making  it  the  closest  star 
to  the  Earth  other  than  the  Sun.  The  distance  to  Proxima  is  1.30  pc,  and  light  takes 
only  4.24  years  to  reach  us  from  this  star.  It  is  about  268,000  times  more  distant 
than  the  Sun,  indicating  that  there  are  vast,  seemingly  empty  spaces  between  the 
stars. 

Proxima  Centauri  is  the  nearest  and  dimmest  companion  of  a  triple  star  system 
that  includes  Alpha  Centauri,  the  third  brightest  star  in  the  sky.  It  takes  about  a 
million  years  for  Proxima  to  orbit  Alpha  Centauri,  and  we  only  think  it  is  asso¬ 
ciated  with  it  because  all  three  stars  move  together  through  space  in  about  the  same 
direction  and  with  about  the  same  speed.  The  large  angular  separation  of  Proxima 
Centauri  from  Alpha  Centauri  is  a  little  more  than  2.18°,  or  four  times  the  angular 
diameter  of  the  full  Moon,  which  makes  it  possible  to  view  the  much  dimmer 
companion  outside  the  glare  of  the  brighter  star. 

Because  the  Earth’s  atmosphere  usually  limits  the  angular  resolution  of  a 
ground-based  telescope  to  no  more  than  0.05",  the  annual  parallax  method  can  be 
used  only  for  the  very  nearest  stars,  those  that  are  closer  than  about  65  light-years 
or  20  pc.  However,  instruments  aboard  the  ESA  HIPPARCOS  satellite,  which 
orbited  the  Earth  above  its  atmosphere  in  the  1990s,  pinpointed  the  position  of 
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more  than  100,000  stars  with  an  astonishing  precision  of  0.00 l"  and  obtained 
accurate  measurements  for  the  distances  of  stars  up  to  1,000  pc  and  3,260  light- 
years  away.  This  explains  the  spacecraft’s  name,  which  is  an  acronym  for  High 
Precision  PARallax  Collecting  Satellite ;  the  name  also  alludes  to  the  ancient 
Greek  astronomer  Hipparchus,  who  recorded  accurate  star  positions  more  than 
2,000  years  ago.  A  successor  to  this  mission  is  the  ESA  GAIA  mission,  short  for 
Global  Astrometric  Interferometer  for  Astrophysics ,  currently  scheduled  for 
launch  in  March  2013.  This  mission  is  intended  to  measure  1  billion  stellar  dis¬ 
tances  to  perhaps  10,000  pc  and  32,600  light-years. 


10.1.2  How  Bright  are  the  Stars? 


The  apparent  brightness  of  a  star  is  how  bright  it  appears  to  us  when  its  radiation 
reaches  the  Earth.  The  celestial  positions  and  physical  parameters  of  the  ten 
brightest  stars  are  given  in  Table  10.1,  together  with  the  brightest  star,  the  Sun. 

Because  a  human  eye  does  not  register  directly  the  relative  amount  of  radiation 
entering  it,  the  Greek  astronomer  Hipparchus  (c.  190  BC-c.  120  BC)  divided  the 
stars  that  he  could  see  into  six  groups  to  better  measure  their  relative  brightness, 
relative  to  the  eyes.  This  way  of  measuring  brightness  is  called  the  apparent  visual 
magnitude  and  is  designated  by  the  lowercase  letter  m  or  to  be  explicit  about  the 
visual  aspect,  by  mv  with  the  subscript  V  denoting  “visual”.  Hipparchus  desig¬ 
nated  the  brightest  stars,  such  as  Sirius  or  Rigel,  with  the  first  and  most  important 
magnitude,  m  =  1 ;  Polaris  and  most  of  the  stars  in  the  Big  Dipper  were  designated 
m  =  2;  and  the  faintest  stars  visible  to  the  unaided  eye  received  the  sixth  mag¬ 
nitude,  or  m  =  6.  Thus,  in  the  magnitude  system,  brighter  stars  have  lower 
magnitudes  and  fainter  stars  have  higher  ones. 

About  two  millennia  later,  the  British  astronomer  Sir  Norman  Pogson 
(1829-1891)  noted  that  the  stars  of  the  first  magnitude  were  100  times  as  bright  as 
stars  of  the  sixth  magnitude  and  that  each  magnitude  unit  is  2.512  times  brighter 
than  the  next  one  down,  where  the  number  2.152  is  the  fifth  root  of  100,  or  lOO1^5 
(Pogson  1856).  The  apparent  magnitudes  mi  and  m2 ,  of  two  objects  of  apparent 
brightness,  or  apparent  radiation  flu x/i  and  f2,  are  related  by: 


mi  —  m2  =  —2.5 12  log 


2.5121og 


(10.4) 


where  the  subscripts  denote  objects  1  and  2,  and  log  denotes  the  logarithm  to  the 
base  ten.  An  equivalent  relation  is: 


4 

h 


2.512(m2_mi) 


|Q0.4(m2-mi )  2  5  ]  2“(m'  m2 )  ^ Q— 0.4(mi  -m2) 


(10.5) 


Table  10.1  The  ten  brightest  stars  as  seen  from  Earth 
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Table  10.2  Apparent  visual 
magnitudes,  mv,  of  some 
astronomical  objects 


Object  name 

mv 

Sun 

-26.74 

Full  moon 

-12.7 

Venusa 

-4.5 

Jupitera 

-2.5 

Sirius 

-1.44 

Rigel 

0.12 

Saturna 

0.7 

Polaris 

1.97 

a  At  maximum  brightness  when  the  planet  is  in  the  part  of  its 
orbit  that  brings  it  closest  to  the  Earth 


The  apparent  magnitude  relation  takes  into  account  the  nonlinear,  roughly 
logarithmic  response  of  the  human  eye  to  light  intensity,  and  the  apparent  radiation 
flux  is  a  technical  name  for  a  star’s  apparent  brightness. 

This  logarithmic  scale  caused  the  very  brightest  stars  to  climb  to  negative 
apparent  magnitudes.  Sirius  is  m  =  —1.44,  the  planets  Venus  and  Jupiter  are  a 
little  brighter  than  Sirius,  and  the  Sun  is  so  close  and  bright  that  it  is  m  =  —26.74 
(Table  10.2). 

The  number  of  stars  increases  dramatically  with  increasing  apparent  visual 
magnitude.  There  are  14  stars  brighter  than  m  =  1  and  about  5,600  stars  brighter 
than  m  =  6,  which  are  all  of  the  stars  detectable  by  the  unaided  human  eye.  There 
are  335,000  stars  brighter  than  m  =  10,  1.5  million  stars  brighter  than  m  =  12, 
and  4.8  billion  stars  brighter  than  m  =  25.  A  backyard  telescope  can  detect  stars  of 
apparent  magnitude  between  10  and  15;  the  Hubble  Space  Telescope  can  approach 
apparent  magnitude  30.  These  stars  are  4  billion  times  fainter  than  the  human  eye 
can  see  without  a  telescope,  and  there  are  100  billion  of  them. 


10.1.3  How  Luminous  are  the  Stars? 

Luminosity  is  an  intrinsic  measure  of  a  star,  and  it  is  not  related  to  the  star’s  distance 
from  the  observer.  It  is  the  amount  of  energy  a  star  radiates  per  unit  time  in  units  of 
J  s_1,  which  also  is  the  emitted  power  in  watts,  where  1  J  s_1  =  1  watt.  A  star’s 
luminosity  usually  is  compared  to  the  luminosity  of  the  Sun,  designated  L0,  with  a 

o/:  _ i 

value  of  Lq  =  3.828  x  10“  J  s  .  There  are  stars  that  are  1  million  times  more 
luminous  than  the  Sun  (Table  10.3)  and  other  stars  that  are  1  million  times  less 
luminous  than  the  Sun.  The  exceptionally  luminous  beacons  are  rare,  and  the  most 
common  stars  are  not  even  as  luminous  as  the  Sun;  they  are  so  dim  that  telescopes  are 
required  to  see  them.  The  most  luminous  stars  also  are  amongst  the  most  massive, 
largest,  and  hottest  stars,  and  the  progressive  decrease  in  stellar  luminosity  usually 
corresponds  to  a  decrease  in  stellar  mass  and  radius  (see  Table  10.3). 
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Table  10.3  The  range  in  stellar  luminosity a 


Star  name 

Luminosity 

0®) 

Mass 

(M0) 

Radius 

(*©) 

Temperature 

(K) 

R  136alb 

8,700,000c 

265 

35.4 

53,000 

LBV  1806  -  20 

5,000,000c 

200 

150 

36,000 

Pistol  Star 

1,700,000° 

150 

340 

20,000 

Betelgeuse 

200,000c 

19 

1,180 

3,500 

Rigel 

85,000c 

17 

78 

11,000 

Polaris 

2,000 

7.5 

30 

3,200 

Aldebaren 

425 

1.7 

44 

4,010 

Vega 

37 

2.1 

2.5 

9,600 

Sirius  A 

25.4 

2.0 

1.7 

9,940 

Alpha  Centauri  A 

1.5 

1.1 

1.2 

5,790 

Sun 

1.0 

1.0 

1.0 

5,780 

Sirius  Bd 

0.026 

0.978 

0.0084 

25,200 

Gliese  229Be 

0.000006 

0.03  to  0.05 

0.1 

950 

a  The  luminosity  is  in  units  of  the  Sun’s  luminosity  Le  =  3.828  x  1026  J  s  \  the  mass  is  in 
units  of  the  Sun’s  mass  M0  =  1.989  x  10  kg,  the  radius  is  in  units  of  the  Sun’s  radius 

o 

Rq  =  6.955  x  10  m,  and  temperature  is  the  effective  temperature  of  the  visible  stellar  disk  in 
degrees  kelvin,  denoted  K 

b  In  the  Large  Magellanic  Cloud,  a  nearby  irregular  galaxy  and  satellite  of  the  Milky  Way 
c  Bolometric  luminosity 

d  Sirius  B  is  an  Earth- sized  white  dwarf  star,  which  has  a  mass  about  equal  to  that  of  the  Sun  but 
has  depleted  its  thermonuclear  fuel 

e  Gliese  229B  is  a  sub- stellar  brown  dwarf  object,  whose  mass  is  below  the  lower  limit,  at  about 
0.08  solar  masses,  to  sustain  hydrogen  fusion.  This  brown  dwarf  object  has  a  radius  about  equal  to 
that  of  Jupiter,  which  is  very  close  to  one  tenth  the  Sun’s  radius,  but  a  mass  of  30-50  times  the 
mass  of  Jupiter,  or  about  0.03-0.05  times  the  mass  of  the  Sun 


Astronomers  observe  an  apparent  brightness  or  apparent  radiation  flux ,  des¬ 
ignated  by  the  symbol  fs,  which  is  the  amount  of  radiant  energy  per  unit  time  per 

_  1  _ o 

unit  area  reaching  the  Earth.  This  quantity  has  units  of  J  s  m  ,  and  it  depends 
on  a  star’s  distance  from  the  Earth. 

As  stellar  radiation  travels  out  into  space,  it  is  distributed  over  an  imaginary 
sphere  of  surface  area  4tzD2s  at  distance  Ds  from  the  star.  The  apparent  radiation 
flux  reaching  a  terrestrial  observer  from  a  star  of  luminosity  Ls  is: 


4nD2s 

This  is  sometimes  called  the  inverse  square  law  of  light,  since  the  apparent 
brightness,  or  radiation  flux,  falls  off  as  the  inverse  square  of  the  distance. 

The  apparent  brightness  of  stars  can  be  combined  with  measurements  of  their 
distances  to  determine  their  luminosity.  Some  of  the  stars  that  appear  bright  to  the 
eye  are  relatively  nearby  and  no  more  luminous  than  the  Sun,  but  many  are  distant 
stars  that  are  hundreds  of  thousands  of  times  more  luminous  than  the  Sun.  Thus, 
the  exceptional  brightness  of  the  brightest  stars,  as  seen  from  the  Earth,  can  be  due 


(10.6) 
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to  either  the  immense  power  of  the  radiation  they  emit  or  to  their  relative  close¬ 
ness,  compared  with  other  stars. 


Example:  The  Sun’s  apparent  brightness  and  intrinsic  luminosity 

Instruments  aboard  satellites  have  measured  the  Sun’s  apparent  brightness, 
known  as  the  solar  constant,/©.  It  is  the  total  amount  of  radiant  solar  energy 
per  unit  area  reaching  the  top  of  the  Earth’s  atmosphere  at  the  Earth’s  mean 
distance  from  the  Sun,  and  the  measurements  indicate  that/©  =  1,361  J  s-1 

_ r\  11 

m  .  Using  a  mean  distance  of  1  AU  =  1.496  x  1011  m,  we  obtain  the 
Sun’s  luminosity  L©  =  47i/©(AU)2  =  3.828  x  1026  J  s_1. 

By  extrapolating  the  Sun’s  radiation  flux  back  to  its  visible  disk,  we  can 
use  that  flux,  designated  F©,  to  specify  the  disk’s  effective  temperature,  Teff, 
by  the  relation  (Sect.  3.4): 


47 rF© 


(10.7) 


o 

where  the  Sun’s  radius  F©  =  6.955  x  10  m  and  the  Stefan-Boltzmann 
constant  o  =  5.6704  xlO  Js  m~K  .  Given  these  quantities,  we 
obtain  Teff  «  5,780  K  for  the  visible  solar  disk. 


To  compare  stellar  luminosities,  distance  has  to  be  removed  from  the  com¬ 
parison,  and  a  system  of  absolute  magnitudes  is  used  for  that.  The  absolute 
magnitude,  M,  is  defined  as  the  apparent  magnitude  the  star  would  have  if  it  was  at 
an  arbitrary  standard  distance  of  10  parsecs,  10  pc,  or  32.6  light-years.  So 

M  =  m  +  5  -  51ogD,  (10.8) 

where  D  is  the  distance  in  parsecs.  The  absolute  magnitude  may  also  be  derived 
from  the  apparent  magnitude  and  the  annual  parallax  by  the  formula 

M  =  m  -j-  5  +  51og7r^,  (10-9) 

where  nA  is  the  annual  parallax  in  seconds  of  arc  denoted 


Example:  The  Sun’s  absolute  visual  magnitude 

The  Sun  has  an  apparent  visual  magnitude  of  mv©  =  —26.74,  and  it  lies  at  a 
mean  distance  of  1  AU  =  1.496  x  1011  m  =  4.848  x  10-6  pc,  using  the 
conversion  of  1  parsec  =  1  pc  =  3.0857  x  10  m.  So  the  absolute  magni¬ 
tude  of  the  Sun  is  given  by  Mv©  =  —26.74  +  5  -  5  log  (4.848  x  10-6)  = 
-26.74  +  5  -  5  log  (4.848)  +  30  =  4.83. 


The  luminosity,  Z©  and  L2,  of  two  objects  of  absolute  magnitude,  Mx  and  M2, 
are  related  by: 
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—  =  2  512^M2-Ml)  ^q0.4(m2-M!) 

l2 


(10.10) 


where  2.512  =  1001/5  =  10° 4. 

Absolute  magnitudes  range  from  about  —12  to  fainter  than  +20,  but  most  stars 
are  between  —5  and  +15  in  absolute  magnitude.  It  can  have  a  negative  value,  so  it 
is  not  absolute  in  a  mathematical  sense.  The  brightest  star  in  the  night  sky,  Sirius, 
has  M  =  +1.4,  but  that  is  much  less  luminous  than  the  seventh  brightest  star  Rigel 
with  M  =  —8.1.  The  apparent  and  absolute  magnitudes  of  the  ten  brightest  stars 
were  included  in  Table  10.1.  Our  apparently  brilliant  Sun  has  an  absolute  mag¬ 
nitude  of  M  =  +4.83. 

Since  each  step  of  15  in  absolute  magnitude  indicates  a  difference  of  a  million 
in  luminosity,  there  are  stars  with  an  absolute  magnitude  of  —10  that  are  about  a 
million  times  more  luminous  than  the  Sun  and  those  with  an  absolute  magnitude  of 
20  that  are  about  a  million  times  less  luminous  than  the  Sun. 

You  can  convert  absolute  magnitude,  M,  to  luminosity  L  through  the  relation: 


0.4(4.83  -  Af), 


or  equivalently: 


(10.11) 


L  =  io°-4(4-83_m)l0  (10.12) 

o/:  _ i 

where  the  absolute  magnitude  of  the  Sun  is  +4.83  and  L0  =  3.828  x  10“  J  s  . 
The  absolute  magnitude,  M,  of  a  star  of  luminosity  L  is: 

M  =  +  4.83  -  2.51og  (T) .  (10.13) 


Example:  Distance,  luminosity,  temperature,  and  size  of  the  nearest 
stars 

The  Sun  is  the  nearest  star.  It  has  a  mean  distance  of  1  AU  =  1.496  x 
1011  m,  an  absolute  luminosity  of  L0  =  3.828  x  1026  J  s_1,  an  effective 
temperature  of  Teff  =  5,780  K,  and  a  radius  of  RQ  =  6.955  x  108  m.  The 
nearest  star  other  than  the  Sun  is  Proxima  Centauri,  located  at  a  distance  of 
4.243  light-years,  where  1  light-year  =  9.4605  x  1015  m.  The  ratio  of  the 
distance,  D ,  of  Proxima  Centauri  to  the  Sun’s  distance  is 
D/ AU  =  2.68  x  105,  so  Proxima  Centauri  is  268,000  times  farther  away 
than  the  Sun  and  stars  are  separated  by  vast,  seemingly  empty  space.  Since  1 
parsec  =1  pc  =  3.26  light-years,  the  distance  of  Proxima  Centauri  is 
D  =  1.30  pc,  and  its  parallax  is  nA  =  7/D  =  0.769  s  of  arc  =  0.769".  The 
spectral  type  of  this  star  is  M5.5,  a  cool  red  star  with  an  effective  temperature  of 
Teff  =  3,042  K.  The  apparent  visual  magnitude  of  the  star  is  rav  =  1 1.05,  so 
its  absolute  visual  magnitude  is  Mv  =  mv  +  5  —  5  log  D  =  11. 05 +  5  —  5 
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log  (1.30)  «  15.5,  where  D  is  the  distance  in  parsecs.  In  the  visible  range  of 
wavelengths,  the  absolute  magnitude  of  the  Sun  is  Mv0  =  4.83,  so  in  visible 
light  Proxima  Centauri  has  a  luminosity  given  by  log  (L/L0)  =  O.4(MV0  — 
My),  or  L  k,  10-4'27  L0  =  0.000054  L0  in  visible  light.  At  a  temperature  of 
3,042  K,  the  most  intense  emission,  from  the  Wien  displacement  law,  is  at  a 
wavelength  of  2  =  0.0029 /T  =  9.53  x  10  m,  or  at  infrared  wavelengths. 
Most  of  Proxima  Centauri’ s  power  is  radiated  at  unseen  infrared  wavelengths, 
and  the  total  luminosity  over  all  wavelengths  is  L  ^  0.0017  L0.  We  can  use 
the  Stefan-Boltzmann  law  L  =  AugR2T where  n  =  3. 14159  and  the  Stefan- 
Boltzmann  constant  a  =  5.670  x 

10— 8  J  s-1  m-2  K-4,  to  infer  the  star’s  radius  of  R  «  108  m  «  O.147?0from 
the  known  values  of  L  and  Teff.  Assuming  that  the  luminosity  of  a  star  varies  as 
the  fourth  power  of  its  mass,  the  mass  of  Proxima  Centauri  would  be  about  a 
tenth  of  that  of  the  Sun. 


The  difference  between  the  apparent  magnitude,  m,  and  the  absolute 
magnitude,  M,  of  an  object  is  related  to  its  distance,  D,  by  the  distance  modulus  fi 
given  by: 


fi  =  m  —  M  =  51ogD  —  5,  (10.14) 

for  D  in  parsecs,  or 

fi  =  m  —  M  =  —  5(1  +  log^),  (10.15) 

where  nA  is  the  annual  parallax  in  seconds  of  arc,  denoted  by  the  symbol 

The  absorption,  or  extinction,  of  light  by  interstellar  dust  diminishes  the  light 
intensity  and  increases  the  apparent  magnitude  by  an  amount  A.  The  distance 
modulus  has  to  be  corrected  for  this  to  yield  the  true  distance  modulus: 


[i  =  m  —  M  —  A  —  51ogD  —  5. 


(10.16) 


Example:  The  absolute  magnitude  and  luminosity  of  Sirius  A  and 
Sirius  B 

The  brightest  star  in  the  sky  is  Sirius,  the  Dog  Star,  often  denoted  Sirius  A  to 
distinguish  it  from  its  faint  companion  Sirius  B.  Sirius  A  has  an  apparent 
magnitude  of  m  =  —1.47  and  a  distance  of  D  =  8.60  light-years.  Divide  by 
3.26  to  get  the  distance  in  parsecs,  or  D  =  2.64  parsecs,  and  the  absolute 
magnitude  is  M  =  m  +  5  -  5  log  D  =  — 1.47  +  5  —  5  log  (2.64)  =  1.42. 
The  luminosity  of  Sirius  A  is  L  =  t0O  4(4  83-M)  L0  «  23  L0.  The  apparent 
magnitude  of  Sirius  B  is  m  =  8.30,  which  corresponds  to  an  absolute 
magnitude  of  M  =  11.19  and  a  luminosity  of  L  =  0.0029  L0  for  the  star’s 
visible  light. 
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There  is  one  more  caveat  to  our  magnitude  system,  for  the  visual  magnitudes 
that  are  measured  by  the  eye  or  a  conventional  optical  telescope  only  sample  the 
visible  wavelengths,  and  a  star  can  emit  intense  radiation  outside  our  range  of 
vision,  including  the  ultraviolet  or  infrared  part  of  the  electromagnetic  spectrum. 
Astronomers  use  the  term  bolometric ,  from  the  Greek  word  meaning  “measure  of 
rays”  to  indicate  the  total  radiation  output  of  a  star,  and  it  has  both  an  apparent  and 
absolute  bolometric  magnitude.  Fortunately,  stellar  temperature,  which  can  be 
determined  by  other  methods,  can  be  used  to  determine  the  amount  of  radiation 
that  was  not  detected  in  the  visible  range,  and  apply  a  bolometric  correction  to 
convert  visual  magnitudes  to  bolometric  ones.  And  this  brings  us  to  the  temper¬ 
ature  of  the  stars. 


10.1.4  The  Temperatures  of  Stars 

An  understanding  of  the  physical  properties  of  a  star  requires  knowledge  of  its 
temperature  as  well  as  its  luminosity.  The  effective  temperature  of  the  Sun’s 
visible  disk,  the  photosphere,  is  inferred  from  the  solar  radius  and  luminosity,  with 
a  value  of  Teff  =  5,780  K.  However,  we  do  not  have  direct  knowledge  of  the  radius 
of  most  stars.  They  are  too  far  away  and  too  small  in  angular  size  for  a  telescope  to 
resolve  them. 

Fortunately,  there  are  two  methods  to  infer  a  star’s  temperature  even  when  we 
do  not  know  its  size  and  luminosity.  We  can  estimate  the  temperature  from  the 
color  of  the  star  or  infer  its  temperature  from  the  relative  intensities  of  absorption 
lines  observed  in  its  spectrum.  These  are  the  effective  temperatures  of  the  visible 
stellar  disks,  or  the  stellar  photospheres.  The  photospheres  of  the  hottest  stars  that 
we  can  see  have  temperatures  of  more  than  100  times  those  of  the  coolest  stars, 
with  a  range  between  2,000  and  50,000  K.  Bohm-Vitense  (1981)  has  provided  a 
review  of  the  stellar  effective  temperature  scale. 

These  are  the  stars  we  look  at,  but  our  eyes  do  not  see  all  of  the  radiation  that  a 
star  produces.  At  extreme  hot  or  cold  temperatures,  a  star  can  become  visibly  dim, 
even  invisible,  because  most  of  its  radiation  is  produced  outside  the  visible  part  of 
the  radiation  spectrum  and  often  is  absorbed  in  the  Earth’s  atmosphere  before 
reaching  the  ground. 

A  very  hot  star,  with  a  temperature  of  more  than  100,000  K,  emits  most  of  its 
light  at  ultraviolet  wavelengths  that  are  absorbed  in  the  atmosphere.  These  hottest 
stars  are  exceptionally  luminous  and  massive. 

The  coolest  star-like  objects  emit  most  of  their  radiation  at  infrared  wave¬ 
lengths,  also  absorbed  in  the  Earth’s  atmosphere  and  outside  our  range  of  vision. 
There  are  the  substellar,  brown  dwarf  objects,  for  example,  that  do  not  have 
enough  mass  to  begin  nuclear  fusion  of  hydrogen  in  their  core.  These  stellar  disks 
emit  heat  associated  with  their  formation  or  by  burning  deuterium  that  was  already 
present  in  them.  The  brown  dwarf  objects  are  sometimes  colder  then  room  tem¬ 
perature,  or  below  300  K. 
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10.1.5  The  Colors  of  Stars 

There  are  reddish  stars  like  Betelgeuse  and  Antares,  yellowish  stars  like  the  Sun 
and  Capella,  and  whitish  stars  like  Vega  and  Sirius.  These  colors  provide  a  rough 
indication  of  the  temperature  of  a  star’s  photosphere.  As  the  temperature  rises,  the 
colors  change  from  red  -  near  3,000  K,  to  yellow  -  around  6,000  K,  to  white  -  at 
about  10,000  K. 

A  star  often  has  a  certain  color  because  most  of  its  radiation  is  emitted  at  the 
wavelengths  corresponding  to  that  color.  The  wavelength  of  maximum  starlight 
intensity  varies  inversely  with  temperature.  A  blue  star,  for  example,  is  hotter  than 
a  red  star.  The  coloring  of  a  star,  however,  is  very  subtle,  depending  on  the  relative 
amount  of  light  seen  in  different  colors. 

Blue-colored  stars,  for  example,  are  not  just  bluer  than  red  stars.  For  a  star  of 
the  same  radius,  a  blue  star  is  more  luminous  than  a  red  one.  Exceptionally  hot 
stars  emit  most  of  their  radiation  at  invisible  ultraviolet  wavelengths,  and  such  a 
star  is  even  more  luminous.  There  is  enhanced  radiation  intensity  at  adjacent 
wavelengths,  and  this  ultraviolet  spillover  produces  more  blue  light  than  expected 
for  a  cooler  star.  In  this  case,  the  temperature  of  the  star  is  much  hotter  than  that 
inferred  from  blue  light  alone. 

Astronomers  therefore  decided  to  quantify  color  by  comparing  the  apparent 
magnitudes  measured  in  different  wavelength  bands.  They  are  denoted  U,  B ,  and 

V  for  ultraviolet,  blue,  and  visual  bands,  and  centered  at  wavelengths  of  350,  450 
and  550  nm.  The  apparent  magnitude  differences  mB  —  mv  =  B  —  V  or  — 
mB  =  U  —  B,  are  then  determined.  The  apparent  magnitude  of  a  star,  denoted  m, 
usually  refers  to  its  visual  apparent  magnitude,  also  written  mv. 

The  difference  between  the  amount  of  light  received  at  one  color  and  the 
amount  at  another  is  known  as  the  color  index ,  which  is  usually  measured  by  the 
difference  between  blue,  designated  B,  and  visual,  denoted  V,  bands  with  a  color 
index  denoted  by  B  —  V.  It  provides  a  reasonable  estimate  of  photosphere 
temperature  by  using  the  ratio  of  luminosities  at  two  wavelengths,  which  is  better 
than  a  temperature  estimated  from  observations  at  only  one  wavelength.  The 
temperatures  increase  from  about  3,150  K  for  B  —  V  =  2.0  to  60,000  K  at  B  — 

V  =  —0.4,  reflecting  the  fact  that  hotter  stars  emit  more  blue  light. 

However,  in  addition  to  ultraviolet  spillover  into  the  blue  colors,  interstellar 

dust  reddens  starlight  as  it  travels  through  space  to  arrive  at  the  Earth,  and  the 
amount  of  reddening  increases  with  a  star’s  distance.  Thus,  the  observed  colors 
may  not  reliably  reflect  the  emitted  colors.  A  star’s  spectral  lines  provide  a  more 
accurate  indication  of  the  temperature  of  a  star’s  photosphere. 
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10.1.6  The  Spectral  Sequence 

More  than  a  century  ago,  astronomers  noticed  that  stars  of  different  colors  exhibit 
different  spectral  lines.  Strong  absorption  lines  of  hydrogen,  for  example,  dominate 
the  spectra  of  white  stars  like  Vega  and  Sirius,  whereas  some  blue  stars  have 
noticeable  helium  absorption  lines.  Yellow  stars  like  the  Sun  have  strong 
absorption  lines  of  calcium  and  heavier  elements,  called  metals,  in  their  spectra. 

The  different  spectral  lines  that  are  emitted  by  stars  depend  on  the  physical 
conditions  in  the  visible  disk  -  the  photosphere  -  and  therefore  the  level  of 
ionization  of  the  emitting  atoms  (also  see  Sect.  6.4).  Stars  that  display  spectral 
lines  of  highly  ionized  elements  must  be  relatively  hot,  because  high  temperatures 
are  required  to  ionize  atoms.  These  hot  stars  have  relatively  weak  hydrogen  lines 
because  nearly  all  of  the  hydrogen  is  ionized  and  all  of  its  electrons  have  been  set 
free  from  their  atomic  bonds,  no  longer  emitting  or  absorbing  radiation.  In  other 
words,  stars  that  display  hydrogen  lines  have  moderate  photosphere  temperatures. 
Those  exhibiting  molecular  lines  have  even  cooler  temperatures  because  mole¬ 
cules  break  apart  into  their  component  atoms  when  the  temperature  increases. 

A  system  of  stellar  classification  based  on  spectra  was  developed  in  the  early 
twentieth  century  and  is  still  in  use  today.  Working  under  the  direction  of  Edward 
C.  Pickering  (1846-1914),  astronomers  at  the  Harvard  College  Observatory 
examined  the  spectra  of  hundreds  of  thousands  of  stars.  The  astronomers  were 
mainly  women  who  had  studied  physics  or  astronomy  at  nearby  women’s  colleges, 
including  Wellesley  and  Ratcliffe.  Harvard  did  not  educate  women  at  that  time  and 
did  not  permit  women  on  its  faculty. 

One  of  these  faithful,  stalwart  workers  was  Annie  Jump  Cannon  (1863-1941), 
who  classified  the  spectra  of  roughly  400,000  stars  in  her  lifetime  (Cannon  and 
Pickering,  1918-1924).  She  distinguished  the  stars  on  the  basis  of  the  absorption 
lines  in  their  spectra  and  arranged  most  of  them  in  a  smooth  and  continuous 
spectral  sequence.  The  hottest  stars,  with  the  bluest  colors,  were  designated  as 
spectral  type  O,  followed  in  order  of  declining  photosphere  temperature  by 
spectral  types  B,  A,  F,  G,  K,  and  M  (Table  10.4). 


Table  10.4  The  spectral  classification  of  starsa 


Class 

Dominant  lines 

Color 

Color  index 

Effective  temperature 

Examples 

O 

He  II 

Blue 

-0.3 

28,000-50,000 

X  Per,  e  Ori 

B 

He  I 

Blue- White 

-0.2 

9,900-28,000 

Rigel,  Spica 

A 

H 

White 

0.0 

7,400-9,900 

Vega,  Sirius 

F 

Metals;  H 

Yellow- White 

0.3 

6,000-7,400 

Procyon 

G 

Ca  II;  Metals 

Yellow 

0.7 

4,900-6,000 

Sun,  a  Cen  A 

K 

Ca  II;  Ca  I 

Orange 

1.2 

3,500-4,900 

Arcturus 

M 

TiO;  Ca  I 

Orange-Red 

1.4 

2,000-3,500 

Betelgeuse 

a  An  H  denotes  hydrogen,  He  is  helium,  Ca  is  calcium,  and  TiO  is  a  molecule.  The  Roman 
numeral  I  denotes  an  electrically  neutral,  unionized  atom,  the  number  II  describes  an  ionized 
atom  missing  one  electron,  and  the  temperatures  are  in  degrees  kelvin,  denoted  K 
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Cannon  further  refined  each  spectral  class  by  adding  numbers  from  0  to  9, 
running  from  hot  to  cold;  the  larger  the  number,  the  cooler  the  star  in  that  class. 
For  example,  the  hottest  F  star  is  designated  as  F0  and  the  coolest  as  F9,  followed 
by  GO.  In  this  system,  our  Sun  is  classified  as  G2. 


10.1.7  Radius  of  the  Stars 

Large  stars  come  in  two  varieties:  the  giants  and  the  supergiants.  A  relatively 
common  type  of  big  star  is  the  red  giant  star,  which  is  about  100  times  bigger  than 
the  Sun;  the  other,  exceedingly  rare  kind,  the  supergiant,  is  about  1,000  times 
larger  than  the  Sun.  The  benchmark  size  is  the  radius  of  the  Sun,  denoted  7?0,  with 
a  value  of  RQ  =  6.955  x  108  m. 

The  red  giants  can  be  found  almost  anywhere  in  the  night  sky,  whereas  the 
supergiants  are  sparsely  scattered  within  the  Milky  Way.  Only  one  in  a  million 
stars  is  likely  to  be  a  supergiant.  As  the  name  implies,  supergiants  are  simply 
extreme  examples  of  the  giant  stars.  They  are  the  rare  anomalies  that  stand  out 
because  of  their  size.  They  are  exceptionally  big,  massive,  luminous  and  often 
bright.  Well-known  examples  of  both  types  of  large  stars  are  given  in  Table  10.5. 

We  can  measure  the  angular  size  of  the  largest  stars  using  an  interferometer  that 
employs  two  or  more  connected  mirrors.  The  radiation  waves  detected  by  any  two 
of  the  mirrors  are  combined  to  produce  an  interference  pattern  -  hence,  the  name 
interferometer ,  short  for  “interference-meter”.  If  the  waves  of  electromagnetic 
radiation  detected  by  the  two  mirrors  are  in  phase  when  combined,  their  wave 


Table  10.5  Some  well-known  large  starsa 


Star  name 

Radius 

(R@) 

Luminosity 

do) 

Mass 

(M0) 

Temperature 

(K) 

Supergiant  stars 

VY  Canis  Majoris 

%  2,000 

^450,000 

%40 

%  3,000 

VV  Cephei  A 

« 1,900 

^300,000 

%30 

^3,300 

Mu  Cephei 

1,650 

60,000 

15 

3690 

Betelgeuse 

1,180 

140,000 

19 

3500 

Antares 

800 

65,000 

15 

3500 

Red  giant  stars 

Mira  A 

400 

9,000 

1.2 

3,000 

R  Doradus 

370 

6,500 

%  1 .0 

2,740 

Aldebaren 

44.2 

425 

1.7 

4,010 

Polaris 

30 

2,200 

7.5 

7,200 

Arc  turns 

25.7 

210 

1.1 

4,300 

Pollux 

8.0 

32 

1.86 

4,865 

a  The  radius,  R,  is  units  of  the  Sun’s  radius  RQ  =  6.955  x  108  m,  the  luminosity,  L,  is  in  units 

o/r  _ i 

of  the  Sun’s  luminosity  L0  =  3.828  x  10  J  s  ,  the  mass,  M,  is  units  of  the  Sun’s  mass 

TO 

M0  =  1.989  x  10  kg,  and  the  temperature,  T,  is  the  effective  temperature  of  the  stellar  disk  in 
degrees  kelvin,  abbreviated  K 
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crests  combine  and  strong  light  is  detected.  When  they  are  out  of  phase,  one  wave 
crest  matches  the  trough  of  the  other  and  they  cancel  one  another.  In  the  earliest 
applications,  the  two  mirrors  were  separated  gradually  to  produce  a  set  of  light  and 
dark  bands,  or  “fringes”;  when  the  fringes  disappeared,  the  star  was  resolved.  The 
angular  diameter  of  the  source,  in  units  of  radians,  is  the  ratio  of  the  wavelength  to 
that  mirror  separation  in  which  the  fringes  disappear. 

The  American  physicist  Albert  A.  Michelson  (1852-1931)  was  one  of  the  first 
to  describe  the  interferometer  technique  (Michelson  1890),  and  thirty  years  later, 
he  teamed  up  with  the  American  astronomer  F.G.  Pease  (1881-1938)  to  use  an 
interferometer  to  measure  the  size  of  Betelgeuse.  They  mounted  two  moveable 
mirrors  and  two  fixed  mirrors  on  a  20  foot  (6  m)  steel  beam  that  was  placed  across 
the  frame  of  the  2.5  m  (100  inch)  Hooker  telescope  on  Mount  Wilson.  By  mea¬ 
suring  the  mirror  separation  when  the  interference  fringes  disappeared,  they 
concluded  that  Betelgeuse  has  an  angular  diameter  of  about  6  =  0.05",  where  the 
symbol  "  denotes  seconds  of  arc  (Michelson  and  Pease  1921).  By  way  of  com¬ 
parison,  if  the  Sun  were  placed  at  the  distance  of  the  next  nearest  star,  Proxima 
Centauri,  it  would  have  an  angular  diameter  of  approximately  0.007". 

Modem  visible-light  interferometry  has  been  used  to  measure  the  angular 
diameters  of  about  100  stars,  including  both  supergiant  stars  and  relatively  near  red 
giant  stars.  Current  observations  of  the  angular  diameter  of  the  supergiant  Betel¬ 
geuse,  at  0.055",  indicate  that  it  has  a  radius  of  1,180  solar  radii,  which  is 
equivalent  to  5.48  AU  -  where  1  AU  =  1.496  x  1011  m  is  the  mean  distance 
between  the  Earth  and  the  Sun.  The  orbital  distance  of  Jupiter  from  the  Sun  is 
5.2  AU;  therefore  Betelgeuse  and  other  supergiant  stars  would  fill  much  of  our 
major  planetary  system.  The  Hubble  Space  Telescope  ( HST)  was  used  to  obtain  an 
image  of  Betelgeuse,  obtaining  the  first  direct  picture  of  the  visible  disk  of  any  star 
other  than  the  Sun.  Interferometric  measurements  in  a  recent  15  year  period 
suggest  that  Betelgeuse  may  be  shrinking,  even  though  its  visible  brightness 
showed  no  significant  dimming  during  the  same  period  -  a  perplexing  result 
(Townes  et  al.  2009) 


Example:  Measuring  the  radius  of  stars 

The  radius  of  the  supergiant  star  Betelgeuse  is  RB  =  1180  RQ  = 

11  Q 

8.207  x  10  m,  where  the  solar  radius  is  R&  =  6.955  x  10  m.  So  the  radius 
of  Betelgeuse  is  equivalent  to  5.486  AU,  where  1  AU  =  1.496 
x  1011  m  is  the  mean  distance  between  the  Earth  and  the  Sun.  The  star’s 
distance  is  DB  =  643  light-years,  with  1  light-year  =  9.460  x  10 15  m.  The 
angular  diameter  0B  of  Betelgeuse  is  therefore  0B  =  2 RB/DB  «  2.7  x 

i—i 

10  radians  =  0.0556",  where  "  denotes  seconds  of  arc  and  1  radian  = 
206,265".  The  angular  resolution,  0r,  of  an  interferometer  consisting  of  two 
mirrors  separated  by  a  distance  Dr  is  6r  =  2/D/  radians  for  radiation  at  a 
wavelength  2.  The  interference  fringes  of  such  an  interferometer  would  dis¬ 
appear  when  6r  =  0B ,  or  at  a  mirror  spacing  of  D7  =  2/0#  =  2.59  m  for  red 

,~j 

light  from  Betelgeuse  at  a  wavelength  2  =  700  nm  =  7  x  10_/  m. 
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The  angular  diameter  of  the  red  giant  star  R  Doradus  is  6R  =  0.057", 
about  the  same  as  Betelgeuse  because  the  red  giant  is  much  closer,  at  a 
distance  of  about  DR  =  178  light-years.  After  converting  this  angle  into 
radians,  we  obtain  a  radius  for  Doradus  of  Rr  =  DR  9R/2  «  2.3  x  1011  m 
«  330  R0,  where  0R  =  2.76  x  10-7  radians  and  DR  =  1.68  x  1018  m. 

The  red  giant  star  Arcturus  is  so  close,  at  a  distance  of  DA  =  36  light- 
years  =  3.40  x  10  m,  that  its  angular  size  has  also  been  measured,  at 

r—J 

0A  =  0.021"  =  10  radians.  We  therefore  obtain  a  radius  for  Arcturus  of 
Ra  =  Da6a/2  «  1.7  x  1010  m  «  25  RQ  «  0.1  AU. 

Optical  interferometry  with  the  Very  Large  Telescope  Interferometer, 
abbreviated  VLTI,  on  the  Cerro  Paranal  in  Chile,  found  the  angular  diameter 
of  Proxima  Centauri,  the  nearest  star  other  than  the  Sun,  to  be 
0PC  =  0.0010"  =  5  x  10-9  radians,  which  at  its  distance  of  4.24  light- 

i  /:  o 

years  =  4.0  x  10  m  corresponds  to  a  radius  of  about  10  m,  or  one-sev- 

'-j 

enth  that  of  the  Sun  and  1.4  times  the  radius  of  Jupiter  Rj  k,  7  x  10  m. 
The  star’s  estimated  mass  is  0.123  M0  =  2.446  x  1029  kg,  where  M0 
denotes  the  mass  of  the  Sun,  and  129  times  Jupiter’s  mass 
Mj  =  1.90  x  1027  kg. 


Fig.  10.2  The  flames  of  Betelgeuse  The  red  supergiant  star  Betelgeuse  is  slowly  shedding  its 
outer  atmosphere,  producing  out-flowing  gas  that  envelops  the  star  and  a  much  bigger  nebula  of 
gas  and  dust  that  surrounds  it.  The  small  circle  in  the  middle  of  black  disk  denotes  the  edge  of  the 
supergiant’s  optically  visible  disk;  it  has  a  diameter  of  about  5.4  AU,  where  one  AU  is  the  mean 
distance  between  the  Earth  and  the  Sun.  The  black  disk  masks  the  bright  central  radiation  of  the 
star,  in  order  to  detect  the  infrared  radiation  of  the  outer  plumes.  They  stretch  to  about  400  AU,  or 
60  million  million,  or  6  x  1013,  m  from  the  supergiant  Betelgeuse.  (Courtesy  of  ESO/VI- 
SIR/VLT /Pierre  Kervella.) 


10.1  Comparisons  of  the  Sun  with  Other  Stars 


309 


Supergiant  stars  are  so  large  that  they  cannot  hold  onto  their  outer  atmosphere. 
They  are  enveloped  by  dust  and  gas  blown  out  by  the  stars’  winds  and  have  no 
well-defined  apparent  “edge”  (Fig.  10.2). 


Example:  Mass  loss  from  supergiant  stars 

The  supergiant  VY  Canis  Majoris  has  a  mass,  M,  of  about  40  times  that  of  the 
Sun,  or  40  M0,  but  a  radius,  R ,  of  about  2,000  solar  radii,  or  2,000  R& ,  which  is 
so  far  from  the  star’ s  center  that  the  stellar  gravity  GM /R  is  but  one  hundred 
thousandth,  or  10-5,  that  of  the  Sun  at  its  radius.  The  radius  of  the  Sun  is 
Rq  =  6.955  x  108  m  and  the  Sun’s  mass  is  M0  =  1.989  x  1030  kg.  The 
escape  velocity  (Sect.  3.2)  required  to  overcome  VY  Canis  Majoris’  gravita¬ 
tional  pull  is  Vesc  =  ( 2GM/R )^2  «  8.7  x  104  m  s_1,  where  the  gravita- 
tional  constant  G  =  6.674  x  10  mkg  s  .  The  average  thermal  speed 
of  a  hydrogen  atom  in  the  visible  disk  of  this  star  is  (Sect.  5.2)  is  Vther. 
mal  ( 3kT/mH)l/ 2  «  8.6  x  103  m  s  ',  where  the  Boltzmann  constant 
k  =  1.381  x  10-23  J  K-1,  mH  =  1.66  x  10-27  kg,  and  the  disk  temperature 
of  the  star  is  T  =  3,000  K.  Since  the  average  thermal  speed  is  just  10  times  less 
than  the  escape  velocity,  the  atoms  in  the  higher  part  of  the  Maxwell  speed 
distribution  should  have  little  trouble  overcoming  the  weak  gravitational  pull 
in  the  outer  atmosphere  of  the  star  and  breaking  away  from  it.  Images  taken 
from  the  Hubble  Space  Telescope  reveal  arcs,  filaments,  and  concentrations  of 
material  formed  by  the  massive  outflows  from  this  supergiant  star,  some  of 
them  moving  close  to  the  star’s  escape  velocity. 


The  next  biggest  known  supergiant  VV  Cephei  A  is  surrounded  by  opaque 
shells  of  a  highly  extended  atmosphere,  and  is  not  entirely  spherical  in  shape.  The 
supergiant  star  Betelgeuse  is  enveloped  by  gas  and  dust  that  extends  out  to  400  AU 
from  the  star  (Fig.  10.2). 

For  most  stars,  the  radius  is  determined  from  the  luminosity  and  temperature 
using  the  Stefan- Boltzmann  law ,  which  states  that  a  star’s  luminosity  increases 
with  the  square  of  its  radius  and  the  fourth  power  of  its  disk  temperature.  That  is, 
the  luminosity  Ls  of  a  star  is  intimately  related  to  the  star’s  radius  Rs  and  effective 
temperature  Teff.  If  any  two  of  these  quantities  are  known,  the  third  can  be  found 
using  the  Stefan-Boltzmann  law 


Ls  =  4noR2sTAeff  (10.17) 

where  n  =  3.14159  and  the  Stefan-Boltzmann  constant  o  =  5.6704  x 
10-8  J  m-2  K-4  s'1  (Sect.  2.4).  The  Austrian  physicist  Joseph  Stefan 
(1835-1893)  first  derived  this  law  (Stefan  1879),  on  the  basis  of  experimental 
measurements  made  by  the  English  physicist  John  Tyndall  (1820-1893),  and 
Stefan’s  student  Ludwig  Boltzmann  (1844-1906)  derived  it  from  theoretical 
considerations,  using  thermodynamics  (Boltzmann  1872). 
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This  means  that 


Ls 

47l(jTeff 


(10.18) 


In  this  expression,  the  effective  temperature,  7^,  is  the  temperature  of  a  thermal 
(blackbody)  gas  emitting  the  observed  luminosity,  which  is  close  to  the  temper¬ 
ature  of  the  visible  stellar  disk,  known  as  the  photosphere.  The  radius  is  that  of  the 
photosphere,  which  is  the  level  at  which  the  stellar  gases  become  opaque  at  visible 
wavelengths. 


10.1.8  How  Massive  are  the  Stars? 

The  mass  of  a  star  usually  is  expressed  in  units  of  the  mass  of  the  Sun, 

on 

Mq  =  1.989  x  10  kg,  which  is  determined  from  Kepler’s  third  law  and  the 
orbital  period  and  distance  of  the  Earth,  or  from  the  length  of  the  year  and  the  AU 
(Sect.  3.3).  The  range  in  the  mass  of  most  stars  is  relatively  small,  between  about 
0.1  and  100  M0.  The  Sun  is  on  the  lower  side  of  the  stellar  mass  range,  as  are  most 
stars.  The  most  massive  stars  are  relatively  rare  due  to  their  relatively  short 
lifetime. 

Although  there  is  not  much  variation  between  the  masses  of  the  stars,  the  mass 
of  a  star  determines  a  star’s  luminosity,  its  effective  disk  temperature,  the  length  of 
its  life,  and  its  ultimate  fate.  A  small  increase  in  a  star’s  mass,  for  example,  implies 
a  big  increase  in  its  luminosity.  Stars  of  lower  mass  have  less  weight  pressing 
down  on  their  core,  so  their  core  is  cooler,  the  rate  of  their  thermonuclear  reactions 
is  slower,  and  the  stars  are  dimmer.  The  life  span  of  stars  also  depends  on  their 
mass.  The  more  massive  a  star  is,  the  shorter  its  life  span.  A  star  of  greater  mass  is 
more  luminous,  burns  its  nuclear  fuel  at  a  greater  rate,  and  depletes  its  available 
energy  in  a  shorter  time. 

When  the  measured  masses  of  stars  are  combined  with  observations  of  the  stars’ 
luminosity,  we  find  that  stellar  luminosity  increases  rapidly  with  increasing  mass. 
The  reason  for  this  increase  is  the  hotter  temperature  at  the  center  of  a  high-mass 
star  when  compared  to  that  of  a  low-mass  star.  The  rate  of  nuclear  reactions  is 
greater  at  the  higher  temperature;  therefore,  the  luminosity  of  the  massive  star  is 
greater. 

The  English  astronomer  Arthur  Stanley  Eddington  (1882-1944)  first  considered 
the  theoretical  aspects  of  such  a  mass-luminosity  relation  (Eddington  1924).  For  a 
star  whose  mass  is  supported  by  gas  pressure,  for  example,  the  internal  tempera¬ 
ture  scales  directly  with  the  mass,  and  the  greater  the  mass,  the  hotter  the  central 
temperature  and  the  greater  the  stellar  luminosity. 
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Current  data  show  that  the  luminosity,  Ls ,  of  most  stars  increases  in  rough 
proportion  to  the  fourth  power  of  the  mass,  Ms ,  with  a  stellar  mass-luminosity 
relation  (Fig.  10.3)  given  by: 


Ls  =  constant  x  M^  5.  (10.19) 

and 


or 


(10.20) 


(10.21) 


o/:  _ i 

where  the  subscript  0  denotes  the  solar  value,  with  L0  =  3.828  x  10"  J  s  and 
M0  =  1.989  x  10  kg.  Notice  that  although  the  range  of  stellar  masses  is  rela¬ 
tively  small,  usually  between  about  0. 1  and  60  M0  in  mass,  the  stellar  luminosity 

_ Q  ^ 

varies  over  about  nine  orders  of  magnitude  from  10  to  10  L0. 


Fig.  10.3  Stellar  mass- 
luminosity  relation  An 

empirical  mass-luminosity 
relation  for  main- sequence 
stars  of  absolute  luminosity, 
L,  in  units  of  the  solar 
luminosity,  L0;  and  mass,  M, 
in  units  of  the  Sun’s  mass, 
Mq.  The  straight  line 
corresponds  to  a  luminosity 
that  is  proportional  to  the 
fourth  power  of  the  mass.  The 
English  astronomer  Arthur 
Eddington  (1882-1944) 
proposed  a  theoretical 
explanation  for  this  relation 
in  1924 
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At  about  0.08  M0  we  reach  the  lower  limit  for  a  gaseous  body  to  become  a  star. 
Its  central  regions  are  too  rarefied  and  too  cool  to  sustain  the  hydrogen-burning 
reactions  that  energize  a  Sun-like  star  and  make  it  shine.  Some  of  these  nonstellar 
objects,  known  as  brown  dwarfs,  can  glow  for  a  brief  time  as  the  result  of  heat 
generated  during  their  formation  by  gravitational  contraction.  Although  never  hot 
enough  for  proton  fusion,  certain  brown  dwarf  stars  can  shine  for  a  time  by 
burning  deuterium  that  was  present  in  the  star  at  the  time  of  its  birth.  The  low-mass 
brown  dwarfs  eventually  cool,  compressing  their  near-stellar  mass  into  the  size  of 
planets  and  disappearing  from  view. 

As  the  mass  increases,  so  does  the  central  temperature.  Nevertheless,  the 
temperature  inside  a  star  cannot  become  too  high,  and  therefore  its  mass  cannot 
exceed  an  upper  bound  of  120  M0.  That  is,  the  internal  temperature  and  pressure 
of  a  very  massive  star  can  become  so  great  that  the  star  will  be  blown  apart  from 
inside  (Focus  10.1).  One  of  the  most  massive  known  stars  is,  for  example,  R136al, 
with  a  mass  of  about  270  times  that  of  the  Sun,  and  it  has  been  shedding  a  large 
fraction  of  its  initial  mass  through  a  continuous  stellar  wind.  It  is  estimated  that,  at 
its  formation,  the  star  held  320  solar  masses  and  that  it  has  lost  50  solar  masses 
over  the  past  million  years. 


Focus  10.1  The  upper  mass  limit  for  a  star 

At  formation,  a  star  cannot  keep  on  getting  larger  and  more  massive.  There 
is  a  limit  established  when  a  star  gets  so  big  that  the  outward  force  of  its 
internal  radiation  exceeds  the  inward  gravitational  force  of  the  entire  star. 

Although  the  gas  pressure  of  the  hot,  moving  subatomic  particles  supports 
a  star  like  the  Sun,  radiation  pressure  becomes  important  in  more  massive 
stars.  As  realized  by  the  prolific  Arthur  Stanley  Eddington  (1882-1944),  the 
outward  pressure  of  radiation  can  exceed  gas  pressure  in  very  massive  stars 
(Eddington  1916),  and  a  decade  later  he  showed  that  this  results  in  a  max¬ 
imum  luminosity,  now  called  the  Eddington  luminosity,  at  which  the  radi¬ 
ation  blows  away  the  outer  atmosphere  of  a  star  (Eddington  1926a,  b). 

The  temperatures  within  stars  are  high  enough  to  ionize  atoms,  creating  a 
plasma  of  free  electrons  and  protons.  As  the  radiation  produced  within  the 
core  of  a  star  works  its  way  out,  the  free  electrons  scatter  the  radiation  with  a 
Thomson  scattering  cross  section  aT  =  6.65246  x  10  ^  m  .  The  outward 
force  of  the  radiation,  FR ,  on  the  free  electrons  at  distance  r  from  the  center 
of  the  star  is 

F  _  Ls 

R  c  4nr2 

where  Ls  is  the  radiation  luminosity,  n  «  3.14159,  and  the  speed  of  light 

o  1 

c  =  2.9979  x  10  m  s  .  The  inward  force  of  gravitation,  FG,  of  a  star  of 
mass  Ms  on  a  proton  at  distance  r  is: 
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GMsmp 
Fg  = - 5 — 

_ 1  i  n _  _ O 

where  the  gravitational  constant  G  =  6.61  A  x  10  N  m  kg  “  and  the 
proton  mass  mP  =  1.6726  x  10  ~  kg.  When  the  two  forces  are  equal,  at 
the  maximum  Eddington  luminosity  LEdd,  we  have 


As  —  LEdd  — 


AnGmpc 

(7  p 


Ms  ~  6.3  Ms  ~  1.3  x  10 
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Mq 


Js 


(10.24) 


or 


Ls  =  3.3  x  104—  L0  Js-1, 

Mq  ° 

for  the  Sun’s  mass  M0  =  1.989  x  1030  kg  and  the  solar 
L0  =  3.828  x  1026  J  s"1. 

If  the  luminosity  of  a  star  reaches  the  Eddington  luminosity,  a  significant 
proportion  of  the  star’s  outer  layers  are  ejected  into  space,  and  this  therefore 
sets  a  limit  to  the  mass  the  star  can  accumulate  at  formation.  To  a  first 
approximation,  we  can  use  the  mass-luminosity  relation  to  obtain  this  lim¬ 
iting  mass  from: 


(10.25) 

luminosity 


3.5 

L(d  <  LEdd 


(10.26) 


or 


(— )  <3.28  x  104 

\mJ  ~ 

to  obtain  the  upper  mass  limit 

Ms  <  64  Mq. 

This  is  an  approximate  limit,  and  a  more  precise  value  can  be  obtained  by 
setting  the  radiation  pressure  equal  to  the  gas  pressure,  and  assuming  that  the 
gravitational  binding  energy  of  the  star,  GM^/R,  is,  from  the  virial  theorem, 
3  times  the  product  of  the  gas  pressure  and  the  volume,  resulting  in  an  upper 
mass  limit  of  M  <  110  MQ 


(10.27) 


(10.28) 


A  direct  measurement  of  stellar  mass  can  be  obtained  from  observations  of  the 
relative  motion  of  two  stars  in  a  binary-star,  or  double-star,  system.  Popper  (1980) 
gave  us  a  review  of  stellar  masses. 

The  members  of  a  double-star  system  are  in  mutual  orbit  around  one  another, 
revolving  about  a  common  center  of  mass.  If  the  orbital  period  and  the  distance 
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separating  the  two  stars  are  measured,  for  example,  the  sum  of  their  masses, 
Mi  +  M2,  can  be  determined  from  Kepler’s  third  law  (Sect.  3.2): 


M  i  +  M2 


47i2a3 

GP 2 


(10.29) 


for  a  linear  star  separation,  a ,  and  an  orbital  period,  P,  where  the  gravitational 

_ 11  o  _ 1  _ o 

constant  G  =  6.674  x  10  m  kg  s  .  Since  orbits  are  always  mutual  (each 
star  going  about  the  other),  the  relative  sizes  of  the  orbits  provide  the  ratio  of  the 
masses,  and  the  combination  of  the  sum  and  the  ratio  gives  the  individual  masses 
of  the  stars  can  be  found  and  compared  to  the  Sun’s  mass. 


Example:  Measuring  the  mass  of  two  stars  in  a  binary  system 

Suppose  the  spectral  lines  of  two  stars  shift  back  and  forth  with  a  period  of 

^ 7 

P  =  2  years  =  6.312  x  10  s,  that  the  lines  of  star  1  shift  twice  as  far  as  the 
lines  of  the  other  star  2,  and  Doppler  shift  observations  of  spectral  lines 
indicates  an  orbital  speed  of  V  =  100  km  s-1  for  star  1  relative  to  star  2.  The 
semi-major  axis  a  of  the  system  can  be  determined  from  the  circumference 
of  one  orbit  2na  =  VP ,  or  a  k,  10  "  m.  Then  the  sum  of  the  masses  can  be 
calculated  from  Newton’s  expression  of  Kepler’s  third  law, 
Mi  +  M2  =  4n2a3 /(GP2)  ^  1.5  x  1032  kg,  where  the  gravitational  con- 

_ 1  1  o  _ 1  _ n 

stant  G  =  6.674  x  10  m  kg  s  .  Because  the  lines  of  star  1  move 
twice  as  far  as  those  of  star  2,  star  1  is  half  as  massive  as  star  2,  with  star  1 
weighing  in  at  about  0.5  x  10  ~  kg  =  25  M0  and  star  2  at  about 
1.0  x  1032  kg  or  50  M0,  where  the  Sun’s  mass  M0  =  1.989  x  1030  kg. 


Some  members  of  binary-star  systems  are  tens  of  thousands  of  AU  apart, 
whereas  others  touch  one  another.  Moreover,  the  binary  stars  come  in  at  least  four 
varieties.  There  are  visual  binaries,  the  two  components  of  which  both  can  be 
resolved  with  a  telescope  and  separately  observed  (Fig.  10.4);  however,  they  are 
separated  so  widely  that  their  orbital  period  about  a  common  center  of  mass  is 
often  greater  than  a  human  lifetime. 

The  periodic  motion  of  just  one  component  of  an  astrometric  binary  is 
observed,  whereas  its  companion  is  too  faint  to  be  seen.  An  eclipsing  binary  is  a 
pair  of  stars  whose  orbital  plane  contains  the  Earth’s  line  of  sight,  so  we  period¬ 
ically  observe  the  stars  when  they  pass  in  front  of  or  behind  one  another 
(Fig.  10.5). 

A  famous  example  of  an  eclipsing  binary  system  is  the  two  brightest  stars  in  the 
Algol  system:  they  have  an  orbital  period  of  2.87  days  and  a  combined  mass  of 
about  4.5  solar  masses.  The  stars  are  located  at  a  distance  of  28.5  pc,  or  93  light- 
years,  and  are  separated  by  only  0.062  AU,  where  1  AU  =  1.496  x  1011  m  is  the 
mean  distance  between  the  Earth  and  the  Sun.  The  two  stars  are  so  close  to  one 
another  that  the  more  massive  and  bigger  component  has  entered  the  gravitational 
sphere  of  influence  of  the  other,  transferring  mass  to  it. 
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Fig.  10.4  Alpha  Centauri  Two  of  the  most  brilliant  stars  in  the  southern  sky  appear  as  a  single 
star,  named  Alpha  Centauri,  to  the  unaided  eye,  but  they  can  be  resolved  into  two  stars  with  the 
aid  of  binoculars  or  a  small  5  cm  (2  inch)  telescope.  The  yellowish  Alpha  Centauri  A  ( lower  left), 
also  known  as  Rigil  Kentaurus,  and  the  blue  Alpha  Centauri  B  ( upper  right )  are  locked  together 
in  a  gravitational  embrace,  orbiting  each  other  every  79.91  years.  The  two  components  of  this 
binary-star  system  can  approach  one  another  within  11.2  AU  and  may  recede  as  far  as  35.6  AU, 
where  the  mean  distance  between  the  Earth  and  the  Sun  is  1  AU  =  1.495  x  1011  m.  Both  stars 
have  a  mass  comparable  to  that  of  the  Sun,  denoted  M0,  of  1.1  M0  and  0.90  M0  for  A  and  B,  and 
a  luminosity  near  that  of  the  Sun,  at  1.519  L0  and  0.500  L0.  They  appear  bright  because  they  are 
very  nearby,  at  a  distance  of  just  4.37  light-years.  A  third  and  faint  companion  Proxima  Centauri 
has  a  luminosity  of  just  0.0017  L0  and  is  located  at  about  15,000  AU  or  2.2°  from  the  two  bright 
stars.  At  a  distance  of  4.24  light-years  from  the  Earth,  Proxima  Centauri  is  the  closest  star  other 
than  the  Sun.  (Courtesy  of  ESO/Yuri  Beletsky.) 


Eclipsing  Binary 


Non-Eclipsing  Binary 


o 


Fig.  10.5  Double  stars  Two  close  stars  are  joined  in  a  gravitational  embrace,  orbiting  each  other 
and  forming  a  binary  star  system  ( left  and  right).  The  orbital  period  and  linear  star  separation  can 
be  used  to  determine  the  sum  of  their  masses.  If  the  orbital  plane  of  the  two  companions  is 
sufficiently  inclined  and  within  the  line  of  sight,  the  star  system  becomes  an  eclipsing  binary 
(left),  in  which  one  star  is  observed  to  pass  behind  the  other  and  vice  versa;  this  can  provide 
additional  information  about  the  stars.  (From  “The  Life  and  Death  of  Stars”  by  Kenneth  R.  Lang, 
published  by  Cambridge  University  Press,  2013.  Reprinted  with  permission.) 


Algol  also  is  an  example  of  the  spectroscopic  binary  stars.  The  separation  of 
this  type  of  star  pairs  can  be  inferred  from  their  orbital  velocity  and  period.  But 
this  is  an  approximate  determination,  since  the  actual  orbital  velocities  can  be 
greater  than  the  line-of-sight  radial  velocities  measured  from  the  Doppler  shift  of 
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the  spectral  lines.  The  spectral  variations  of  a  spectroscopic  binary  reveal  the 
orbital  motion  of  its  unresolved  components  (Focus  10.2). 


Focus  10.2  Determining  the  stellar  mass  in  a  spectroscopic  binary 

system 

The  analysis  of  binary  stars  usually  assumes  circular  motion  about  a  center 

of  mass  located  between  two  stars.  We  have: 

r\M\  =  r2M2 

(10.30) 

with  Mi  and  M2  being  the  masses  and  r\  and  r2  their  respective  distances  to 
the  center  of  mass.  Thus,  if  a  =  r\  +  r2  is  the  separation  between  the  masses, 

Mi,  \ 

n=M,(“  'l) 

(10.31) 

or 

m2 

r\  = - a 

Mi +M2 

(10.32) 

and 

Mi 

r2  = - a. 

Mi +M2 

(10.33) 

We  can  find  the  total  mass  of  the  binary  system  using  Newton’s 
Kepler’s  third  law.  The  orbital  period,  P,  is  given  by: 

version  of 

9  47T2  0 

p-  a 3 

G(M  1  +  M2) 

(10.34) 

or 

47 z2a3 

Mi+M2=  Qpl 

(10.35) 

which  provides  the  sum  of  the  two  stellar  masses. 

In  spectroscopic  binaries,  the  stars  are  usually  not  resolved, 

and  their 

separations  cannot  be  measured,  but  oscillations  in  the  line-of-sight  veloc¬ 
ities  are  inferred  from  Doppler  shifts  of  spectral  lines.  Because  the  per¬ 
pendicular  to  the  orbital  plane  is  inclined  to  the  line  of  sight  by  an  angle  i, 
the  Doppler  velocity  amplitudes  will  be  related  to  the  true  orbital  velocity 

amplitudes  by: 

Viobs  =  Vi  sin/, 

(10.36) 
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and 


V 


lobs 


V, 


sinz 


(10.37) 


where  the  symbol  \x\  denotes  the  absolute  value  of  v.  Since 


Vtl  = 


2nr\ 


(10.38) 


and 


\V2 


2nr2 

p  ’ 


then 


(10.39) 


V lobs  I  ^  1  Mi 


v lobs  I  Cz 


(10.40) 


Replacing  a  with  rx  +  r2  =  P{\V\0bs\  +  |V2ofo|)/(27rsin/)  in  Kepler’s  third 
law  and  using  these  expressions  for  rx  and  r2  we  can  obtain: 


(Mi  +  M2)  sinJ  i 


3  P{\Vlobs\  +  |  Vlobs  | ) 3 


2ttG 


(10.41) 


If  the  spectrum  of  only  one  star,  designated  by  the  subscript  7,  is  detected, 
due  to  the  faintness  of  the  second  one,  we  can  use: 


(Mi  +  M2)  sin3  i  = 


V\obs\  + 


MiV 

M2y 


2ttG 


(10.42) 


or 


Ml 


(Mi  +  M2)' 


sin  «  = 


2nG 


(10.43) 


When  M2  is  much  less  than  Mx,  which  would  account  for  the  faint  luminosity 
of  the  second  star,  we  obtain: 


M2  sin  i  ~  (^)  Vio/»|m|. 


(10.44) 


As  might  be  expected,  bigger  stars  are  more  massive,  and  there  are  fewer  stars 
of  high  mass  than  those  with  low  mass.  The  distribution  of  stars  relative  to  mass  is 
known  as  the  initial  mass  function,  with  the  term  initial  meaning  the  mass  with 
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which  the  stars  were  formed  before  their  subsequent  evolution.  The  Austrian-born 
American  astronomer  Edwin  E.  Salpeter  (1924-2008)  derived  the  initial  mass 
function  for  stars  more  massive  than  the  Sun  (Salpeter  1955),  and  found  that  the 
number  of  stars  with  masses  in  the  range  M  to  M  +  dM  is  proportional  to  M~235. 
The  number  falls  off  roughly  as  the  inverse  square  of  the  mass  and  indicates  that 
the  star-formation  process  results  in  many  more  stars  of  low  mass  than  high  mass. 
When  compared  to  the  number  of  stars  with  a  mass  equal  to  that  of  the  Sun, 
denoted  M0,  there  are  roughly  100  times  more  stars  with  1/1 0th  of  that  mass,  at 
0.1  M0,  and  about  1  /  100th  fewer  stars  with  a  mass  of  10  solar  masses,  or  10  M0. 
Stars  of  higher  mass  are  also  bigger,  whereas  those  of  low  mass  are  relatively 
small;  the  small  stars  outnumber  the  large  stars. 


10.2  Main-Sequence  and  Giant  Stars 
10.2.1  The  Hertzsprung-Russell  Diagram 

Once  the  luminosity  of  stars  was  obtained  from  their  brightness  and  measurements 
of  their  distance,  astronomers  were  able  to  show  that  most  stars  exhibit  a  sys¬ 
tematic  decrease  in  luminosity  as  one  progresses  through  the  spectral  sequence  O, 
B,  A,  F,  G,  K,  M.  This  progression  is  exactly  what  we  would  expect  because  the 
spectral  sequence  also  denotes  a  scale  of  decreasing  stellar  temperatures,  and  the 
luminosity  of  a  radiating  body  depends  strongly  on  temperature. 

The  luminosity  drop  is  illustrated  in  the  famous  Hertzsprung-Russell  diagram 
( H-R )  diagram,  of  luminosity  or  absolute  magnitude  plotted  against  the  spectral 
class  or  effective  temperature  (Fig.  10.6).  The  diagram’s  name  derives  from  the 
Danish  astronomer  Ejnar  Hertzsprung  (1873-1967),  who  plotted  such  diagrams  for 
the  Pleiades  and  Hyades  star  clusters,  and  the  American  astronomer  Henry  Norris 
Russell  (1877-1957),  who  published  an  early  version  of  this  diagram  for  both 
noncluster  and  cluster  stars  (Hertzsprung  1911;  Russell  1914). 

Most  stars,  including  the  Sun,  lie  on  the  main  sequence  that  extends  diagonally 
from  the  upper  left  to  the  lower  right,  or  from  the  high-luminosity,  high-temper- 
ature  blue  stars  to  the  low-luminosity,  low-temperature  red  stars.  The  stars  on  the 
main  sequence  are  the  most  common  type  in  the  Milky  Way,  constituting  about 
90  %  of  its  stars. 

The  Stefan- Boltzmann  law  describes  the  general  characteristics  of  the  H-R 
diagram.  It  is  given  by: 


Ls  =  AnaR2sTAeff ,  (10.45) 

where  Ls  is  the  luminosity  of  the  star,  n  =  3.14159,  the  Stefan-Boltzmann  con¬ 
stant  o  =  5.6704  x  10-8  J  s'1  m-2  K-4,  the  radius  of  the  star  is  Rs ,  and  Teff  is  the 
effective  temperature  of  the  visible  stellar  disk.  This  expression  indicates  that  for  a 
fixed  radius,  the  luminosity  of  a  star  increases  with  the  fourth  power  of  the 
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Spectral  Class 


Fig.  10.6  Original  Hertzsprung-Russell  diagram  The  absolute  luminosity,  in  magnitude  units 
(■ vertical  axis )  plotted  in  1914  by  Henry  Norris  Russell  (1877-1957)  as  a  function  of  spectral  class 
(top  horizontal  axis )  for  four  moving  star  clusters:  the  Hyades  ( black  dots),  the  Ursa  Major  group 
(small  crosses),  the  large  group  in  Scorpius  (small  open  circles),  and  the  61  Cygni  group 
(triangles).  The  large  circles  and  crosses  represent  points  calculated  from  the  mean  parallaxes 
and  magnitudes  of  other  groups  of  stars.  The  two  diagonal  lines  mark  the  boundaries  of  Ejnar 
Hertzsprung’s  (1873-1967)  observations  of  the  Pleiades  and  Hyades  open  star  clusters  in  1911; 
this  now  is  known  as  the  main  sequence  along  which  most  stars,  including  the  Sun,  are  located. 
The  giant  stars  are  located  at  the  upper  right.  In  his  publication,  Russell  included  a  similar 
diagram  for  individual  bright  stars,  the  distances  of  which  had  been  established  from  stellar 
parallax  measurements.  It  closely  resembled  the  diagram  shown  here  with  an  exceptional  point  in 
the  lower  left-hand  corner,  which  is  included  here  with  an  “x”  mark.  This  star  is  the  faint 
companion  of  a  double-star  system  Omicron  Eridani,  or  40  Eridani,  now  known  to  be  a  white 
dwarf  star.  [Adapted  from  Russell  (1914).] 


effective  temperature;  therefore,  colder  stars  are  less  luminous.  That  is  exactly 
what  happens  along  the  main  sequence,  for  although  the  radius  varies  by  a  rela¬ 
tively  small  amount  along  the  main  sequence,  the  luminosity  variation  is  due 
mainly  to  a  change  in  temperature. 

The  observations  also  showed  a  different,  unanticipated  effect  that  gives  the 
H-R  diagram  a  peculiar  shape.  Some  of  the  stars  retain  a  high  luminosity  at 
decreasing  temperature,  in  a  band  that  extends  to  the  upper  right  of  the  H-R 
diagram  (see  Fig.  10.6).  This  could  be  explained  if  the  luminous  cool  stars  were 
larger  in  radius  than  the  less  luminous  ones,  with  the  increase  in  size  offsetting  the 
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Color  Index  (B-V) 
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Fig.  10.7  Hertzsprung-Russell  diagram  for  nearby  stars  A  plot  of  the  luminosity  ( left 
vertical  axis )  in  units  of  the  Sun’s  absolute  luminosity,  denoted  L0  =  3.828  x  10  J  s  , 
against  the  effective  temperature  of  the  star’s  disk  in  degrees  kelvin,  designated  K  ( bottom 
horizontal  axis )  for  22,000  stars  in  the  catalogue  of  the  HIPPARCOS  satellite.  This  plot  is  known 
as  the  Hertzsprung-Russell  (H-R)  diagram.  The  absolute  visual  magnitude  (right  vertical  axis ) 
and  color  index,  B-V  ( top  horizontal  axis )  are  also  designated.  Most  stars,  including  our  Sun,  lie 
along  the  main  sequence,  which  extends  from  the  high-temperature  blue-white  stars  at  the  top  left 
to  the  low-temperature  red  stars  at  the  bottom  right.  The  Sun  is  a  main-sequence  star  with  an 
absolute  visual  magnitude  Mv  =  4.8  and  color  index  B-V  =  0.68.  The  radiation  from  all  main- 
sequence  stars  is  sustained  by  hydrogen-burning  reactions  in  their  core.  Stars  of  about  the  Sun’s 
mass  evolve  into  helium-burning  red  giant  stars,  located  in  the  upper-right  side  of  the  diagram. 
Very  rare  bright  giant  stars  and  extremely  scarce  and  luminous  supergiants  are  found  above  the 
giant  stars  and  along  the  top  of  the  diagram.  Faint  and  initially  hot  white  dwarf  stars  are  located  in 
the  lower  left  side.  Due  to  their  low  luminosity,  these  endpoints  of  stellar  evolution  are  relatively 
difficult  to  observe.  (Data  courtesy  of  the  ESA  /HIPPARCOS  mission.  From  “The  Life  and  Death 
of  Stars”  by  Kenneth  R.  Lang,  published  by  Cambridge  University  Press,  2013.  Reprinted  with 
permission.) 


drop  in  temperature.  The  Stefan-Boltzmann  law  indicates  that  for  a  fixed  tem¬ 
perature,  the  luminosity  of  a  star  increases  with  the  square  of  the  radius.  If  a  star  is 
25,000  times  more  luminous  than  the  Sun,  with  the  same  temperature,  it  follows 
from  the  Stephan- Boltzmann  law  that  it  will  be  158  times  bigger  than  the  Sun. 

Russell  realized  that  he  had  found  another  type  of  star,  which  he  named  giants 
for  their  large  size  (Russell  1913).  These  stars  have  a  radius  as  large  as  the  mean 
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distance  between  the  Earth  and  the  Sun.  Russell  also  used  the  name  dwarfs  for  the 
more  numerous  main-sequence  stars,  because  they  are  smaller  than  the  giants,  but 
the  designation  is  confusing.  There  is  no  observable  difference  between  the  size 
and  luminosity  of  the  hottest  dwarf  and  most  giant  stars,  and  the  white  dwarf  stars 
are  not  even  on  the  main  sequence.  In  this  book,  therefore,  we  retain  the  desig¬ 
nation  giant  stars ,  but  use  the  term  main- sequence  stars  for  the  other  stars. 

There  are  relatively  few  giant  stars  when  compared  to  the  number  of  stars  on 
the  main  sequence.  This  is  because  stars  spend  the  majority  of  their  lifetime  on  the 
main  sequence,  and  the  giant  stars  belong  to  a  subsequent  and  shorter-lived  part  of 
a  star’s  evolution. 

Nearly  a  century  of  increasingly  accurate  and  extensive  observations  confirmed 
the  initial  characteristics  of  the  H-R  diagram  (Fig.  10.7).  To  assist  physical 
interpretations,  the  luminosities  are  displayed  along  the  left  vertical  axis,  and  the 
color  index,  or  equivalently,  the  effective  temperature  of  the  stellar  disk  is  on  the 
bottom  horizontal  axis. 

Chiosi  et  al.  (1992)  have  reviewed  developments  in  our  understanding  of  the 
H-R  diagram.  Reid  (1999)  reviewed  the  H-R  diagram  and  the  galactic  distance 
scale  after  the  HIPPARCOS  mission  and  Lebreton  (2000)  has  reviewed  the  mis¬ 
sion’s  implications  for  stellar  structure  and  evolution. 


10.2.2  The  Luminosity  Class 

There  was  an  unresolved  uncertainty  in  the  H-R  diagram,  which  created  a  dilemma 
for  specifying  the  physical  characteristics  of  a  star.  A  star  could  be  small  or  large 
as  well  as  hot  or  cold.  A  red  cool  star,  for  example,  might  be  either  much  more 
luminous  than  the  Sun  or  much  fainter.  Once  the  spectral  type  establishes  the 
temperature,  the  star  could  be  on  either  the  luminous  giant  or  the  dimmer  main- 
sequence  part  of  the  H-R  diagram.  To  resolve  this  ambiguity,  astronomers  found  a 
way  of  classifying  stars  by  their  luminosity  in  addition  to  their  spectral  type. 

Pioneering  investigations  by  the  American  astronomer  Walter  S.  Adams 
(1876-1956)  and  the  German  astronomer  Arnold  Kohlschiitter  (1883-1969)  found 
that  the  relative  intensities  of  certain  neighboring  spectral  lines  could  be  used  to 


Table  10.6  The  Morgan- 
Keenan  (M-K)  luminosity 
classes 

la 

lb 

II 

Bright  supergiants 

Supergiants 

Bright  giants 

III 

Giants 

IV 

Subgiants 

V 

Main  sequence  stars  (or  dwarfs) 

VI  (or  SD) 

Subdwarfs 

D  (or  VII) 

White  dwarfs 
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Spectral  Type 


O  B  A  F  G  K  M 


Fig.  10.8  Spectral  type  and  luminosity  class  in  the  H-R  diagram  When  both  a  star’s  spectral 
type  ( top  horizontal  axis )  and  luminosity  class  ( Roman  numerals)  are  known,  the  star’s 

O/T  _  1 

luminosity  ( left  vertical  axis)  in  units  of  the  Sun’s  luminosity,  denoted  L0  =  3.828  x  10  J  s  , 
and  effective  disk  temperature  ( bottom  horizontal  axis)  can  be  obtained.  The  spectral  types  are 
shown  at  the  coolest  temperature  for  each  type.  A  Roman  numeral  V  designates  the  main- 
sequence  stars:  subgiants  by  IV,  giants  by  III,  bright  giants  by  II,  and  supergiants  by  la  and  lb.  VI 
or  SD  denotes  the  subdwarfs,  and  D  or  VII  designates  the  white  dwarf  stars.  (From  “The  Life  and 
Death  of  Stars”  by  Kenneth  R.  Lang,  published  by  Cambridge  University  Press,  2013.  Reprinted 
with  permission.) 


determine  the  luminosities  of  both  main  sequence  and  giant  stars  (Adams  and 
Kohlschiitter  1914). 

In  the  mid-twentieth  century,  William  W.  Morgan  (1906-1994)  and  Philip  C. 
Keenan  (1908-2000),  of  the  Yerkes  Observatory  in  Chicago,  introduced  the  M-K 
system  (Morgan  et  al.  1943),  in  which  the  most  luminous  and  largest  stars  have  the 
lowest  numbers,  given  in  Roman  numerals  (Table  10.6).  In  the  M-K  system,  the 
Roman  numeral  III  designates  the  giant  stars,  and  V  denotes  the  main- sequence 
stars;  Class  IV  of  subgiants  is  located  between  them.  The  most  luminous,  Class  I 
stars  are  the  supergiants,  shown  near  the  rarely  occupied,  upper  edge  of  the  H-R 
diagram.  Both  the  spectral  type  and  the  M-K  luminosity  class  can  be  specified  in 
the  H-R  diagram  (Fig.  10.8). 

Because  the  spectral  type  O,  B,  F,  G,  K,  or  M  depends  solely  on  the  physical 
properties  of  a  star’s  outer  atmosphere  -  the  photosphere  -  it  is  not  sufficient  to 
determine  the  star’s  internal  properties  and  evolutionary  status.  To  solve  this 
problem,  both  the  spectral  type  and  luminosity  class  are  provided  in  a  two- 
dimensional  scheme;  for  example,  the  Sun  is  designated  as  G2  V. 

If  we  know  a  star’s  luminosity  class,  we  can  find  its  luminosity,  or  absolute 
magnitude,  and  a  star’ s  distance  can  be  inferred  from  the  apparent  magnitude  and 
luminosity.  This  is  known  as  the  spectroscopic  distance ,  or  spectroscopic  parallax. 
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Fig.  10.9  Stellar  mass  and  lifetime  on  the  main  sequence  The  relation  between  a  star’s 
luminosity  ( left  vertical  axis),  in  units  of  the  Sun’s  luminosity,  denoted  L0  =  3.828  x 
10  J  s  ,  and  the  star’s  effective  disk  temperature  ( bottom  horizontal  axis )  in  degrees  kelvin, 
designated  K,  for  the  main-sequence  stars  in  the  Hertzsprung-Russell  diagram.  The  stellar  masses 
given  along  the  main-sequence  curve  are  in  units  of  the  Sun’s  mass  denoted 

on 

M0  =  1.989  x  10  kg.  Stars  of  higher  mass  are  hotter  and  more  luminous.  All  of  these  stars 
shine  by  hydrogen  burning  with  a  lifetime  that  also  is  denoted  along  the  main- sequence  curve. 
More  massive  stars  burn  their  hydrogen  fuel  at  a  faster  rate  and  have  a  shorter  lifetime.  (From 
“The  Life  and  Death  of  Stars”  by  Kenneth  R.  Lang,  published  by  Cambridge  University  Press, 
2013.  Reprinted  with  permission.) 


10.2.3  Life  on  the  Main  Sequence 

Today,  the  H-R  diagram  remains  a  primary  tool  for  tracing  the  path  of  stellar 
evolution,  but  the  routes  are  more  complex  than  initially  supposed.  Russell,  for 
example,  thought  that  most  stars  began  life  as  hot,  blue-white  stars  and  ended  their 
life  as  cool  red  ones,  moving  from  upper  left  to  lower  right  along  the  main 
sequence,  which  is  what  might  happen  if  the  stars  cool  with  time.  But  once 
scientists  understood  the  ways  that  nuclear  fusion  makes  a  star  shine,  the  early 
speculations  proved  to  be  wrong.  The  main  sequence  is  not  a  singular  evolutionary 
pathway,  as  once  thought;  it  is  simply  a  portrait  of  the  sky  at  one  moment, 
depicting  different  stars  of  varying  mass.  The  giant  stars  represent  later  rather  than 
earlier  stages  in  a  star’s  life  cycle.  As  it  turns  out,  a  star  begins  its  bright  shining 
life  on  the  main  sequence. 

Like  our  Sun,  other  stars  on  the  main  sequence  generate  energy  by  converting 
hydrogen  into  helium;  as  long  as  it  shines  in  this  way  a  star’ s  position  on  the  main 
sequence  does  not  change  substantially.  It  simply  slowly  becomes  more  luminous 
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Table  10.7  The  main- sequence  starsa 


Spectral 

type 

Effective  temperature 
(K) 

Mass 

(Ms) 

Luminosity 

<Xo) 

Radius 

(Re) 

Lifetime 

(years) 

05 

44,500 

60 

7.9  x  105 

12 

3.7  x  10s 

B0 

30,000 

17.5 

5.2  x  104 

7.4 

1.1  x  107 

B5 

15,400 

5.9 

8.3  x  102 

3.9 

6.5  x  107 

A0 

9,520 

2.9 

5.4  x  10 

2.4 

2.9  x  108 

F0 

7,200 

1.6 

6.5 

1.5 

1.5  x  109 

GO 

6,030 

1.05 

1.5 

1.1 

5.1  x  109 

K0 

5,250 

0.79 

0.42 

0.85 

1.4  x  1010 

M0 

3,850 

0.51 

0.077 

0.60 

4.8  x  1010 

M5 

3,240 

0.21 

0.011 

0.27 

1.4  x  10“ 

a  The  mass,  Ms,  is  in  units  of  the  Sun’s  mass  M0  =  1.989  x  1030  kg,  the  absolute  luminosity, 
Ls,  is  in  units  of  the  Sun’s  absolute  luminosity,  L0  =  3.828  x  10  J  s  ,  and  the  radius,  Rs,  is 

o 

in  units  of  the  Sun’s  radius,  RQ  =  6.955  x  10  m.  The  lifetimes  are  the  amount  of  time  required 
to  exhaust  the  nuclear  hydrogen  fuel  that  supplies  the  energy  of  stars  on  the  main  sequence 


and  moves  slightly  to  the  upper  right  in  the  H-R  diagram.  Moreover,  the  different 
positions  along  the  main  sequence  are  closely  related  to  only  one  property  of  the 
stars  -  their  mass  -  and  not  to  their  different  evolutionary  status.  The  mass  sets 
the  central  temperature  and  nuclear  fusion  rate  at  which  the  outward  pressure  and 
the  inward  gravitational  force  remain  in  balance. 

The  stellar  masses  decrease  downward  from  upper  left  to  lower  right  on  the 
main  sequence  (Fig.  10.9).  The  high-mass  stars  are  more  luminous  than  the  low- 
mass  stars  because  the  central  temperatures  of  the  former  are  higher,  to  support  the 
greater  mass,  and  their  nuclear-reaction  rates  are  faster,  producing  radiation  of 
much  greater  luminosity.  The  hot,  luminous  O  stars  can  have  masses  as  high  as 
150  times  that  of  the  Sun,  whereas  the  cool,  dim,  main-sequence  M  stars  might 
have  as  little  as  0.08  solar  masses. 

A11  of  these  main  sequence  stars  shine  by  converting  hydrogen  into  helium.  The 
effective  disk  temperature,  mass,  luminosity,  radius,  and  lifetime  of  main-sequence 
stars  of  different  spectral  types  are  listed  in  Table  10.7. 

Because  a  star  begins  shining  with  a  limited  supply  of  hydrogen,  it  can  remain 
on  the  main  sequence  for  only  a  limited  lifetime  -  that  is,  the  time  it  takes  to 
deplete  ah  of  the  hydrogen  fuel  in  its  hot  core.  Although  more  massive  stars 
certainly  contain  more  hydrogen  in  their  larger  core,  they  are  much  hotter  inside 
and  fuse  this  hydrogen  into  helium  at  a  faster  rate,  resulting  in  a  shorter  life.  As 
indicated  in  Table  10.7,  main- sequence  lifetimes  range  from  a  few  million  to 
100  billion  years  from  spectral  type  05  to  M5. 

A  more  massive  star  is  hotter  at  its  center  than  a  less  massive  star,  and  it  ought 
to  be  more  luminous.  It  turns  out  that  a  star’s  nuclear  energy  supply  is  proportional 
to  the  mass,  as  indicated  by  Einstein’s  famous  expression  E  =  Me  ,  where  E  is  the 
energy,  M  the  mass,  and  c  is  the  speed  of  light.  The  rate  at  which  energy  is  being 
radiated  away,  the  luminosity,  Ls ,  also  increases  with  the  mass,  but  as  the  fourth 
power  of  the  mass,  Ms.  So,  the  length  of  time,  t,  that  a  star  shines,  is  another 
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dramatic  function  of  the  mass;  these  times  were  also  given  in  Table  10.7  for  main- 
sequence  stars. 

How  do  we  determine  the  length  of  time  that  a  main- sequence  star  can  continue 
to  shine  by  converting  protons  into  helium  nuclei?  These  nuclear- fusion  reactions 
are  limited  to  the  hot,  dense  stellar  core.  Outside  of  the  core,  where  the  overlying 
weight  and  compression  are  less,  the  gas  is  cooler  and  thinner  so  nuclear  fusion 
cannot  exist.  For  instance,  the  energy-generating  core  of  the  Sun  extends  about  one 
quarter  of  the  distance  from  the  center  to  the  visible  solar  disk.  When  all  of  the 
hydrogen  within  the  core  has  been  converted  into  helium,  a  star  has  exhausted  its 
nuclear  fuel  supply  and  can  no  longer  reside  on  the  main  sequence. 

More  than  a  half-century  ago,  the  Brazilian  astrophysicist  Mario  Schonberg 
(1914-1990)  and  the  Indian- American  astrophysicist  Subrahmanyan 
Chandrasekhar  (1910-1995)  considered  stellar  models  in  which  hydrogen  is 
burned  inside  a  star’s  core,  or  in  a  thin  shell  between  the  burned-out  core  and  the 
overlying  material  (Schonberg  and  Chandrasekhar  1942).  They  found  that  it  was 
impossible  to  construct  models  in  which  more  than  12  %  of  the  mass  of  the  star  is 
included  in  the  exhausted  core.  This  meant  that  the  lifetime  of  a  star  on  the  main 
sequence  is  limited  to  the  time  it  takes  to  convert  12  %  of  its  hydrogen  into 
helium. 

The  energy,  A E,  released  in  the  conversion  of  four  protons  into  one  helium 

r\  r\ 

nucleus  by  hydrogen  burning  (Sect.  8.3)  is  A  E  =  A  me  =  0.007  (4 mPc  ),  and  the 
rest-mass-energy  conversion  process  is  just  0.007  or  0.7  %  efficient.  In  this 
expression,  Am  is  the  mass  difference  between  the  mass  of  four  protons  and  the 
mass  of  the  helium  nucleus,  mP  denotes  the  mass  of  a  proton,  and  c  is  the  speed  of 
light.  The  mass  difference  is  due  to  the  binding  energy  liberated  during  nuclear 
fusion  to  make  a  star  shine. 

If  we  convert  12  %,  or  0.12,  of  the  Sun’s  mass  into  energy  in  this  way,  then  the 
energy  released  is  E  =  0.12  x  0.007  M&c  where  the  mass  of  the  Sun 
M0  =  1.989  x  1030  kg  and  the  speed  of  light  c  =  2.9979  x  108  ms-1.  The 
Sun’s  main-sequence  lifetime,  Tms ,  required  to  convert  12  %  of  its  mass  into 
helium  is  therefore: 

Tms  =  (E\  =  0.12(0.007)  «  3.92  x  1017  s  w  1.24  x  1010  years, 

(10.46) 

yz  _ i 

where  the  luminosity  of  the  Sun  is  L0  =  3.828  x  10  J  s  ,  and 

rj 

1  year  =  3.156  x  10  s.  Assuming  that  the  luminosity  of  main-sequence  stars 
increases  with  the  3.5  power  of  the  mass,  the  main-sequence  lifetime  for  a  star  of 
mass,  Ms ,  is: 


Tms  =  3.90  x  1017 


'Af. 


0 


Ms 


2.5 


S. 


(10.47) 
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An  exceptionally  massive  star,  of  say  100  times  the  mass  of  the  Sun,  will 
survive  on  the  main  sequence  for  only  about  40  thousand  years;  in  contrast,  a  star 
of  moderate  mass,  say  10  times  the  mass  of  the  Sun,  will  survive  for  40  million 
years.  That  is  why  the  more  massive  stars  are  so  rare  and  hard  to  find. 

Stars  of  intermediate  mass,  such  as  the  Sun,  will  shine  for  10-20  billion  years. 
The  Sun  formed  about  4.6  billion  years  ago;  in  another  7.8  billion  years,  it  is 
expected  to  end  its  life  on  the  main  sequence. 

So,  the  position  of  a  star  on  the  main  sequence  depends  on  its  mass  -  the  most 
massive  stars  being  the  most  luminous  and  the  more  massive  a  star,  the  shorter  it 
lives  and  the  sooner  it  evolves  off  of  the  main  sequence.  Ninety  percent  of  all 
main-sequence  stars  have  a  mass  below  0.8  solar  mass,  and  they  have  not  yet  had 
time  to  perish.  They  have  been  on  the  main  sequence  ever  since  they  were  born, 
thereby  providing  us  with  no  information  about  stellar  evolution.  In  contrast,  some 
of  the  more  massive  stars,  which  were  born  long  ago,  have  had  enough  time  to 
burn  up  their  available  hydrogen  fuel  and  advance  to  the  next  stage  of  stellar  life. 
Thus,  to  understand  stellar  evolution  we  must  examine  the  upper  part  of  the  main 
sequence  in  the  H-R  diagram ,  which  applies  to  the  more  massive,  shorter-lived 
stars. 


10.2.4  The  Red  Giants  and  Supergiants 

After  the  low-mass,  main-sequence  stars,  the  most  common  type  of  star  is  the  red 
giant  found  in  the  upper  right  side  of  the  H-R  diagram  (Fig.  10.10).  These  low- 
temperature  stars  are  not  exceptionally  massive.  They  have  an  intermediate  mass 
of  roughly  1-10  times  that  of  the  Sun  and  are  in  a  late  state  of  stellar  evolution 
from  somewhat  hotter  main-sequence  stars.  Although  cooler  than  the  Sun,  the  red 
giants  are  about  100  times  more  luminous  due  to  their  much  larger  size,  about  50 
times  the  radius  of  the  Sun.  Prominent,  bright-red  giants  include  Aldebaran  and 
Arcturus. 

Because  they  are  so  luminous,  we  can  see  red  giant  stars  that  are  relatively 
distant  without  using  a  telescope.  However,  they  also  are  much  less  common  than 
main- sequence  stars  because  relatively  few  stars  have  entered  this  later  stage  of 
life.  The  red  giants  last  only  a  few  million  years,  which  is  a  brief  existence 
compared  to  the  billions  of  years  that  stars  of  roughly  solar  mass  spend  on  the 
main  sequence. 

The  giant  stars  are  enormously  distended  stars  with  a  low  mean  mass  density 
and  a  high  luminosity.  If  we  assume  that  the  inner  temperatures  of  giant  stars  are 
high  enough  to  generate  a  gas  pressure  sufficient  to  balance  gravitation,  then  their 
luminosity  would  greatly  exceed  that  which  is  actually  observed.  This  enigma  was 
resolved  almost  a  century  ago  when  the  great  English  astronomer  Arthur  Stanley 
Eddington  (1882-1944)  showed  that  radiation  pressure  must  stand  with  gravitation 
and  gas  pressure  as  the  third  major  factor  in  maintaining  the  equilibrium  of  a  star 
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Fig.  10.10  Giants,  supergiants,  and  white  dwarfs  The  majority  of  stars  occupy  the  main 
sequence  in  the  H-R  diagram.  Stars  with  a  mass  comparable  to  that  of  the  Sun  will  evolve  into 
helium-burning  giant  stars,  illustrated  by  Aldebaran  and  Arcturus  in  this  diagram  ( middle  right). 
These  red  giants  are  somewhat  cooler  than  the  Sun  but  about  100  times  more  luminous.  The 
luminosity  ( left  vertical  axis)  is  in  units  of  the  Sun’s  luminosity  Le  =  3.828  x  10  J  s  .  More 
massive  stars,  which  have  a  shorter  lifetime  on  the  main  sequence,  evolve  into  supergiant  stars 
that  are  between  10  thousand  and  1  million  times  as  luminous  as  the  Sun.  Antares  and  Betelgeuse 
(top  right)  illustrate  the  supergiant  stars  on  this  diagram.  After  depleting  all  of  their  helium  fuel, 
which  is  after  the  core  hydrogen  is  exhausted,  the  less-luminous  giant  stars  evolve  into  white 
dwarf  stars  of  very  low  luminosity  and  initially  hot  disk  temperatures.  They  are  illustrated  in  the 
bottom  left  of  this  diagram  by  Sirius  B,  40  Eridani  B,  and  Procyon  B.  (From  “The  Life  and  Death 
of  Stars”  by  Kenneth  R.  Lang,  published  by  Cambridge  University  Press,  2013.  Reprinted  with 
permission.) 


(Eddington  1917).  The  radiation  pressure  is  the  pressure  exerted  by  electromag¬ 
netic  radiation. 

Although  the  outward  pressure  caused  by  the  motion  of  gas  particles,  or  the  gas 
pressure,  indeed  does  support  the  Sun  and  most  other  stars  against  the  inward  force 
of  their  immense  gravity,  it  is  insufficient  for  the  much  larger  giant  stars.  They  are 
also  supported  by  radiation  pressure,  which  increases  with  the  fourth  power  of  the 
temperature.  In  contrast,  gas  pressure  is  simply  proportional  to  the  temperature;  so, 
if  we  sufficiently  increase  the  central  temperature,  radiation  pressure  will  become 
much  larger  than  gas  pressure. 

Eddington  also  showed  how  some  of  the  giant  stars  that  are  in  radiative  equi¬ 
librium  could  pulsate,  with  outer  envelopes  that  move  in  and  out,  becoming 
alternately  ionized  and  neutral  during  the  course  of  pulsation  (Sect.  14.1,  Focus 
14.1,  Eddington  1918,  1919). 
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The  supergiants  are  very  massive,  evolving  from  main-sequence  stars  of  10  to 
100  solar  masses,  and  exceptionally  large,  with  radii  of  hundreds  of  times  that  of 
the  Sun.  They  are  also  10-100  times  more  luminous  than  the  red  giants.  Antares 
and  Betelgeuse  are  supergiant  stars. 

The  supergiants  are  so  exceedingly  rare  that  we  can  only  see  them  as  a  sparse 
sprinkling  across  the  top  edge  of  the  H-R  diagram.  They  are  even  less  common 
than  the  O  stars;  rarely  seen  in  any  given  part  of  the  night  sky,  but  so  intrinsically 
luminous  that  we  can  see  a  few  without  a  telescope. 


Example:  Properties  of  a  large  star 

Suppose  a  star  is  ten  thousand  times  more  luminous  than  the  Sun,  with  a 
stellar  luminosity  of  Ls  =  104  L©,  where  the  Sun’s  luminosity 

o/:  _ i 

L©  =  3.828  x  10"  J  s  ,  and  a  spectral  type  and  effective  temperature  the 
same  as  that  of  the  Sun,  whose  effective  temperature  is  Teff  «  5780  K.  We 
can  use  the  Stefan-Boltzmann  law  Ls  =  Ana  Rfliff,  where  n  «  3.14159  and 

_ O  _ O  _ A  _ 1 

the  Stefan-Boltzmann  constant  a  =  5.6704  x  10  J  m  “  K  s  ,  to  infer 
the  star’s  radius  Rs  ~  6.9  x  1010  m  ^  100  R&,  where  the  Sun’s  radius 

o 

rq  =  6.955  x  10  m.  The  mass-luminosity  relation  indicates  that  the 
luminosity  scales  roughly  as  the  3.5  power  of  the  mass,  so  the  mass  of  the 
star  is  about  Ms  =  (Ls/L&)1^3'5  M©  ^10  M©  where  the  Sun’s  mass 

on 

M©  =  1.989  x  10  kg.  The  main-sequence  lifetime  of  the  star  is  about 
Tms  «  1010  (. Mq/Ms )2'5  years  «  14  million  years. 

Instruments  aboard  HIPPARCOS  have  obtained  a  parallax  of 
nA  =  3.78  x  10“  "  for  the  blue  supergiant  star  Rigel,  to  give  a  distance  of 
D  =  1  / nA  «  264  pc  =  8.15  x  1018  m,  where  1  pc  =  3.0857  x  1016  m. 
Interferometer  measurements  indicate  that  Rigel  has  an  angular  diameter  of 
6  =  2.75  x  10-3  "  =  1.33  x  10-8  radian,  where  1  radian  =  2.06264  x 
105  ",  so  its  radius  is  R  =  6D/2  ^  5  x  1010  m  =  78  R&,  where  the  Sun’s 

o 

radius  RQ  =  6.955  x  10  m.  Rigel’ s  absolute  bolometric  magnitude  M  = 
—7.84,  so  its  luminosity  L  =  1004^4  83_M^L©  «  105LQ,  where  the  Sun’s 
absolute  magnitude  is  4.83  and  the  Sun’s  luminosity  L©  =  3.828  x 

oz  _ 1 

10“  J  s  .  We  can  determine  the  star’s  effective  temperature,  Teff,  from  the 
Stefan-Boltzmann  law  L  =  Ana  R2  where  n  =  3.14159  and  the  Stefan- 

_ O  _ O  _ A  _ 1 

Boltzmann  constant  a  =  5.6704  x  10  J  m  “  K  s  ,  obtaining 
Teff  x  104  K. 


Both  the  giant  and  the  supergiant  stars  are  so  large  that  their  atmospheres  are 
slowly  blowing  away  with  strong  winds  that  carry  their  outer  atmospheres  into 
surrounding  space.  For  example,  the  giant  star  Mira,  “The  Wonderful,”  is  pul¬ 
sating  and  also  losing  mass  at  about  one  millionth  of  a  solar  mass  per  year.  That  is 
more  than  1  million  times  the  mass  loss  rate  of  the  Sun’s  wind. 


10.2  Main-Sequence  and  Giant  Stars 


329 


Because  giants  and  supergiants  are  former  main-sequence  stars  that  have 
exhausted  their  core  supply  of  nuclear  hydrogen,  they  must  be  undergoing  other 
forms  of  nuclear  fusion.  We  now  consider  these  various  nuclear  burning  reactions. 


10.3  Nuclear  Reactions  Inside  Stars 

Clayton  (1984)  and  Rolfs  and  Rodney  (2005)  have  provided  textbooks  that  include 
stellar  nuclear  reactions  and  nucleosynthesis. 


10.3.1  The  Internal  Constitution  of  Stars 


Although  we  cannot  see  the  inside  of  a  star,  its  internal  structure  can  be  explained 
by  a  few  simple  concepts,  one  of  which  is  a  star’s  equilibrium.  Like  the  Sun, 
almost  every  star  we  see  is  neither  collapsing  nor  expanding,  and  it  remains  the 
same  size  throughout  most  of  its  long  life.  At  every  point  inside  such  a  star,  the 
inward  pull  of  its  gravity  is  balanced  precisely  by  the  outward  push  of  its  internal 
pressure. 

As  with  the  Sun,  all  of  the  other  main-sequence  stars  are  composed  mainly  of 
the  lightest  element,  hydrogen,  but  it  is  too  hot  for  whole  hydrogen  atoms  to  exist 
within  them.  These  atoms  are  fragmented  into  their  subatomic  constituents  by 
frequent  collisions.  The  material  in  these  stellar  interiors  therefore  is  in  the  plasma 
state,  composed  almost  entirely  of  hydrogen  nuclei,  protons,  and  free  electrons  no 
longer  attached  to  atoms.  Compressed  to  high  density,  protons  still  occupy  the  vast 
empty  spaces  of  former  atoms,  so  plasma  behaves  like  a  perfect  gas  with  a  pressure 
that  increases  with  the  temperature. 

The  central  temperature  of  any  main-sequence  star  can  be  estimated  by  assuming 
that  a  proton  at  the  center  is  hot  enough  and  moving  fast  enough  to  counteract  the 
gravitational  compression  on  the  proton  from  the  rest  of  the  star.  For  the  Sun,  this 
balance  is  achieved  at  a  central  temperature  of  15.6  million  K  (Sect.  8.2).  A  more 
massive  star  produces  greater  compression  at  its  center,  so  a  higher  central 
temperature  is  required  to  hold  it  up. 

The  temperature,  Tc,  at  the  center  of  a  star  like  the  Sun  can  be  estimated  by 
assuming  that  each  proton  down  there  is  hot  enough  and  moving  fast  enough  that 
the  thermal  energy  3kTc/2  counteracts  the  gravitational  compression  it  experi¬ 
ences  from  all  the  rest  of  the  star.  That  is: 


3 
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where  the  Boltzmann  constant  k  =  1.38065  x  10  J  K  ,  the  gravitational  con- 
stant  G  =  6.61  A  x  10  m  kg  s  ,  the  mass  of  the  proton  is  mP  =  1.6726  x 
10  "  kg,  and  Ms  and  Rs  respectively  denote  the  mass  and  radius  of  the  star. 
Solving  for  the  central  temperature  we  obtain: 


IGmpM s 
3  kRs 


1.54  x  107 


(10.49) 


In  the  numerical  approximation,  the  mass  and  radius  of  the  star  are  given  in 

on 

solar  units,  denoted  by  the  subscript  O,  where  M&  =  1.989  x  10  kg  and 

o 

Rq  =  6.955  x  10  m.  Thus,  the  temperature  at  the  center  of  the  Sun  is  about 
15  million  K. 

The  temperature  at  the  center  of  a  main-sequence  star  is  proportional  to  its 
mass,  so  a  star  that  is  10  times  more  massive  than  the  Sun  is  10  times  hotter  in  its 
center.  At  a  large  enough  mass,  the  star  becomes  so  hot  that  it  is  blown  apart;  this 
explains  why  there  are  no  known  stars  with  a  mass  greater  than  about  120  times 
the  mass  of  the  Sun. 

We  have  assumed  that  the  outward  gas  pressure  of  the  moving  protons  supports 
the  inward  pull  of  gravity,  a  condition  known  as  hydrostatic  equilibrium.  The  ideal 
gas  law  (Sect.  5.4)  gives  this  gas  pressure,  Pg : 

Pg=NkT ,  (10.50) 

which  is  equivalent  to: 


pkT  pkT 
m  prriH 


(10.51) 


for  a  gas  of  mass  density  p  and  mean  mass  per  particle  given  by: 


m  =  p/N  =  pnin  (10.52) 

for  mean  molecular  weight  p.  The  mass  of  the  hydrogen  atom  mH  =  1.00794 

r)'l 

u  «  1.67  x  10  ~  kg,  which  is  roughly  equal  to  the  atomic  mass  unit 
u  =  1.6605  x  10  ^  kg  and  good  enough  for  order  of  magnitude  estimates.  For  a 
fully  ionized  hydrogen  gas,  the  mass  per  particle  is  mP/2  or  half  the  proton  mass. 

The  gas  pressure  is  the  outward  pressure  caused  by  the  motion  of  gas  particles 
and  increases  with  their  temperature.  Gas  pressure  does  indeed  support  the  Sun  and 
most  other  stars  against  the  inward  force  of  their  immense  gravity,  but  this  does 
not  apply  for  much  larger,  and  relatively  rare,  giant  stars. 

Eddington  (1917)  showed  that  in  addition  to  the  kinetic  gas  pressure,  Pg9  the 
radiation  photons  in  a  giant  star  exert  an  additional  radiation  pressure,  Pn  given  by: 

Pr  =  a^~  (10.53) 

where  the  radiation  constant  a  =  7.5657  x  10  16  J  m  3  K  4.  For  radiative 
equilibrium  at  the  center  of  star  of  radius,  Rs ,  and  mass,  Ms : 
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where  the  gravitation  constant  G  =  6.61  A  x  10  m  kg  s  . 

The  total  pressure,  P,  inside  a  star  is  given  by  the  sum  of  the  gas  pressure  and 
the  radiation  pressure.  The  radiation  pressure  is  much  less  than  the  gas  pressure  at 
the  center  of  the  Sun,  but  it  can  compete  with  gas  pressure  in  supporting  giant 
stars.  The  complete  equation  of  state  for  the  pressure,  P,  is  then  given  by: 


P  —  Pg  +  Pr 
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1  , 
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(10.55) 


Example:  Supporting  a  star  by  gas  pressure  or  radiation  pressure 

At  the  center  of  a  star  of  mass  density  pc  and  temperature  Tc,  the  gas  pressure 
is  Pg  =  NkT  «  pckT / mp,  where  the  proton  number  density  N  «  pc/ mp ,  the 

_ no  _ i 

Boltzmann  constant  k  =  1.38065  x  10  “  J  K  and  the  proton  mass 

_ 0^7 

mp  =  1.6726  x  10  ^  kg.  The  radiation  pressure,  Pr,  is  given  by 
Pr  =  a7^/3,  where  the  radiation  constant  a  =  7.5657  x  1(T16  J  m'3  K-4. 
At  the  center  of  the  Sun  pc  x  1.5  x  105  kg  m  and  Tc  x  1.5  x  107  K, 

1  Z 

so  the  gas  pressure  Pg  ~  2  x  10  Pa  and  the  radiation  pressure 

i  o 

Pr  «  1.3  x  10  Pa,  which  is  1  thousand  times  less  than  the  gas  pressure.  In 
other  words,  the  gas  pressure  is  about  1  thousand  times  greater  than  the 
radiation  pressure  at  the  center  of  the  Sun. 

Suppose  a  main-sequence  star  is  one  hundred  times  as  massive  as  the  Sun, 
with  a  mass  Ms  =  100  M©,  where  the  Sun’s  mass  M©  =  1.989  x  1030  kg. 
The  mass-luminosity  relation  indicates  that  the  luminosity  scales  roughly  as 

^ 7 

the  3.5  power  of  the  mass,  so  the  star’s  luminosity  would  be  Ls  «  10  L0, 
an  exceptionally  luminous  star,  where  the  Sun’s  luminosity 
L©  =  3.828  x  10“  J  s  .If  the  effective  temperature  was  about  the  same  as 
that  of  the  Sun,  and  because  the  luminosity  varies  as  the  square  of  the  radius, 
the  star’s  radius  will  be  Rs  ~  10  RQ,  a  large  star,  where  the  Sun’s  radius 

o 

R&  =  6.955  x  10  m.  Assuming  that  the  star  is  entirely  composed  of  pro¬ 
tons,  then  the  central  temperature,  Tcs ,  of  the  star,  which  scales  as  Ms/Rs , 

1  '-J  Q 

will  be  10_  that  of  the  Sun.  The  gas  pressure,  PgS ,  varies  as  MsTCs/Rs  or 
as  Ml/Ri  so  it  will  be  10  11  that  of  the  Sun.  The  central  radiation  pressure, 
PrS,  varies  as  Tc,  or  as  ( Ms/Rs )4,  which  will  be  10-7  that  of  the  Sun,  and  the 
ratio  of  radiation  pressure  to  gas  pressure  in  the  star  center  will  be  ten 
thousand  times  greater  than  that  of  the  Sun.  For  this  more  massive  and 
luminous  star  the  central  radiation  pressure  is  estimated  to  be  10  times  the 
gas  pressure  at  the  star’s  center. 


Since  the  radiation  pressure  increases  with  the  fourth  power  of  the  temperature, 
and  the  temperature  has  to  increase  with  the  mass,  the  radiation  pressure  can 
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overcome  the  gravity  of  an  exceptionally  massive  star.  That  is,  a  giant  star  cannot 
remain  in  equilibrium  if  the  central  temperature  and  mass  become  too  high,  and  this 
occurs  for  masses  greater  than  about  120  solar  masses  (see  Focus  10.1,  Sect.  10.1). 

But  where  does  a  star’s  heat  come  from?  The  energy  released  by  nuclear  fusion 
in  the  stellar  core  heats  the  gas  and  generates  its  pressure.  That  is,  nuclear  reactions 
that  transform  a  light  element  into  a  heavier  one  liberate  subatomic  energy  that 
sustains  the  high  temperatures  within  a  star.  This  energy  also  makes  its  way  out  of 
the  star  to  provide  its  luminosity  and  keep  it  shining. 

Thus,  two  other  fundamental  concepts  in  understanding  a  star’s  interior  are:  (1) 
the  way  energy  is  generated  by  nuclear  reactions  near  its  center,  and  (2)  the 
methods  in  which  the  radiation  produced  by  these  reactions  works  its  way  out  to 
the  observed  stellar  disk,  its  photosphere.  The  energy  generation  depends  on  the 
nuclear  fuel,  as  well  as  the  mass  density  and  temperature  in  a  star’s  core.  The 
radiation-energy  transfer  depends  on  a  star’s  internal  opacity  to  radiation,  which 
prevents  some  of  the  radiation  from  escaping. 

After  arrival  on  the  main  sequence,  which  is  designated  the  zero  age ,  the 
internal  structure  of  a  star  can  be  determined  by  only  four  equations,  which 
describe  the  equilibrium,  energy  transport,  conservation  of  mass,  and  conservation 
of  energy  within  the  star.  The  crucial  equations,  given  in  Focus  10.3,  can  be  solved 
without  any  knowledge  of  the  properties  of  the  star  before  arrival  on  the  main 
sequence.  Kippenhahn  et  al.  (2012)  provide  a  good  textbook  of  stellar  structure 
and  evolution. 


Focus  10.3  The  equations  of  stellar  structure 

To  obtain  information  on  the  interior  constitution  of  the  stars,  astrophysicists 
have  to  integrate  basic  equations.  Pioneering  work  in  this  field  can  be  found 
in  the  books  of  Eddington  (1926a,  b)  and  Chandrasekhar  (1939).  The  four 
differential  equations  that  determine  a  stars  initial  position  on  the  main 
sequence  of  the  Hertz sprung-Rus sell  diagram  and  its  subsequent  evolu¬ 
tionary  history  are: 

The  equation  of  hydrostatic  equilibrium.  This  equation  states  that  the 
inward  force  of  gravity  caused  by  the  mass,  M(r),  within  a  distance,  r,  from  the 
stellar  center  is  just  balanced  by  the  outward  gas  pressure,  P(r),  at  radius,  r, 
so  that: 


or  equivalently 


dP(r ) 
dr 


dP{r) 

dM{r) 


(10.56) 


(10.57) 


10.3  Nuclear  Reactions  Inside  Stars 


333 


_ 11  o  _ i  _ o 

where  the  gravitational  constant  G  =  6.674  x  10  m  kg  s  ,  the  mass 
density  is  denoted  by  p(r),  and  the  gas  pressure  is  given  by  the  ideal  gas  law 

(Sect.  5.4). 

The  equation  of  mass  continuity  or  the  equation  of  mass  conservation. 
This  equation  specifies  the  mass,  M(r),  contained  within  radius,  r,  in  terms  of 
the  mass  density,  p(r),  by: 


or  equivalently 


dM(r ) 
dr 


4nr2  p(r) 


dr  1 

dM(r)  4nr2p(r) 


(10.58) 


(10.59) 


This  equation  is  subject  to  the  boundary  conditions  of  zero  mass  at  zero 
radius,  or  M(r)  =  0  at  r  =  0,  and  a  mass  that  is  now  equal  to  the  total  mass 
of  the  star,  Ms ,  at  the  visible  stellar  radius,  Rs ,  or  M(RS)  =  Ms.  For  the  Sun, 
Ms  =  M&  =  1.989  x  1030  kg  and  the  radius  Rs  =  RQ  =  6.955  x  108  m. 

The  equation  of  energy  conservation.  This  equation  states  that  the  energy 
generated  per  unit  mass  per  unit  time  in  the  star’s  core,  denoted  by  e(r), 
supplies  the  energy  flux,  L(r),  carried  across  radius,  r,  or  that: 


or  equivalently 


dL{r) 

dr 


4nr2  p(r)r(r) , 


dL(r ) 
dM(r) 


(10.60) 


(10.61) 


The  energy  generation,  e(r),  is  a  function  of  the  initial  composition,  mass, 
density,  and  temperature.  This  equation  has  the  boundary  condition  provided 
by  the  current  luminosity,  Ls ,  for  a  star  of  total  mass,  Ms,  and  radius,  Rs.  For 
the  Sun,  we  have  Ls  =  L0  =  3.828  x  1026  J  s_1. 

The  equation  for  radiative  energy  transfer.  This  equation  relates  the 
temperature,  T(r),  at  radius,  r,  to  the  amount  of  energy  being  transferred  by 
radiation  to  that  distance.  It  is  related  to  the  opacity  to  radiation,  /c(r),  which 
measures  the  resistance  of  the  material  to  energy  transport  by  radiation.  The 
equation  is: 


or  equivalently 


dT(r)  3  K(r)p(r)L(r) 

dr  167 ir2ac[T(r)}3  ’ 


(10.62) 
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dT(r)  3/c(r)L(r) 


o  _ 1 

speed  of  light  c  =  2.9979  x  10  m  s  .  The  total  luminosity  of  a  star,  Ls ,  with 
a  radius,  Rs ,  is  given  by  the  Stefan- Boltzmann  law  Ls  =  4no  R2S  T4^  where 
the  Stefan-Boltzmann  constant  a  =  ac/4  =  5.6704  x  10-8  J  s-1  m-2  K-4 
and  Teffis  the  effective  temperature  of  the  visible  stellar  disk. 

The  chemical  composition  of  a  zero-age  main-sequence  star  is  assumed  to 
be  homogenous,  and  is  often  specified  by  X  =  pH/ p  the  fraction  by  mass  of 
material  in  the  form  of  hydrogen,  Y  =  pHe / p  the  fraction  by  mass  of 
material  in  the  form  of  helium,  and  Z  =  1  -  X  —  Y  =  pmetaisl P  the  fraction 
by  mass  of  material  heavier  than  helium.  The  mass  density  for  fully  ionized 
plasma  of  total  number  density,  A,  is: 


(10.64) 


_ ^)'~7 

where  the  mass  of  the  hydrogen  atom  is  mH  =  1.6726  x  10“"  kg. 


The  approximate  abundances  observed  in  the  disk  of  the  Sun  are 
X  =  0.71,  Y  =  0.27,  and  Z  =  0.02,  and  m  =  0.61  mH;  in  the  solar  core 
nuclear  fusion  reactions  have  converted  about  half  the  hydrogen  into  helium 
and  X  =  0.34,  Y  =  0.64  and  Z  =  0.02,  and  in  =  0.85  mH. 

At  any  given  time,  stars  of  the  same  composition  have  radii,  luminosities, 
effective  temperatures,  and  mean  densities  determined  solely  by  the  star’s  mass. 
The  German  astronomer  Heinrich  Vogt  (1875-1936)  demonstrated  this  concept  in 
1926,  and  the  American  astronomer  Henry  Norris  Russell  (1877-1957)  derived  it 
independently  the  following  year  in  his  textbook;  therefore,  it  is  known  as  the 
Vogt-Russell  theorem  (Vogt  1926;  Russell  et  al.  1927).  It  implies  that  a  star  of  a 
given  mass,  age,  and  chemical  composition  occupies  a  unique  position,  related  to 
the  star’s  evolutionary  history,  on  the  H-R  diagram.  The  mass,  age,  and  compo¬ 
sition  are  all  we  need  to  know  to  understand  the  life  history  of  a  star. 

A  star  will  continue  shining  with  a  luminosity  and  temperature  determined  by 
its  mass,  remaining  stable  and  fundamentally  unchanged  for  millions  to  billions  of 
years.  The  only  caveat  to  this  understanding  of  stellar  life  is  that  the  core  of  a  star 
is  the  only  place  hot  enough  for  nuclear  reactions  to  occur.  The  composition  of  the 
core  slowly  changes  as  the  result  of  these  reactions;  eventually,  there  is  no  more 
nuclear  energy  in  the  stellar  core  so  it  loses  its  equilibrium. 

It  appears  to  be  simple,  but  the  theory  is  complex  -  with  detailed  applications 
that  are  found  in  advanced  texts.  Moreover,  our  understanding  of  the  internal 
constitution  of  stars  includes  explanations  of  how  their  energy  is  generated,  by 
thermonuclear  reactions  in  the  stellar  cores,  beginning  with  the  fusion  of  hydrogen 
into  helium  in  main- sequence  stars. 
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10.3.2  Two  Ways  to  Bum  Hydrogen  in  Main-Sequence  Stars 

All  main-sequence  stars  generate  energy  by  the  thermonuclear  fusion  of  hydrogen 
nuclei,  the  protons,  into  helium  nuclei.  Because  the  hydrogen  is  “burned  up”  or 
consumed  to  fuel  the  nuclear  fires,  we  call  this  process  hydrogen  burning ,  although  it 
is  a  chain  of  nuclear-fusion  reactions  rather  than  the  combustion  of  an  ordinary  fire. 

There  are  two  methods  of  burning,  or  fusing,  hydrogen  into  the  heavier  element 
helium;  the  dominant  mechanism,  which  produces  the  most  power,  depends  on  the 
mass  of  a  star.  The  main  source  of  energy  for  main-sequence  stars  with  a  mass  less 
than  1.5  times  the  Sun’s  mass  is  the  proton-proton  chain  of  nuclear  reactions, 
abbreviated  as  the  p-p  chain.  A  different  sequence  of  nuclear  reactions  converts 
protons  into  helium  nuclei  inside  main-sequence  stars  more  massive  than  1.5  times 
the  mass  of  the  Sun.  This  is  known  as  the  carbon-nitrogen-oxygen  ( CNO )  cycle. 
As  the  name  suggests,  this  is  a  cyclic  set  of  nuclear  reactions. 

The  thermonuclear  process  responsible  for  the  energy  production  in  any  star  is 
not  limited  to  a  single  nuclear  transformation,  but  rather  consists  of  a  sequence  of 
linked  transformations  that  together  form  a  nuclear  chain  reaction.  The  p-p  chain 
is  linear,  with  only  one  direction,  whereas  the  CNO  cycle  occurs  in  a  closed 
circular  chain.  For  both  types  of  hydrogen  burning,  four  protons  combine  to  make 
one  helium  nucleus,  thereby  releasing  energy. 

In  the  lower  right-hand,  low-mass  side  of  the  main  sequence,  where  the  great 
majority  of  stars  are  located,  nuclear  energy  is  generated  as  a  result  of  the  proton- 
proton  chain  that  makes  the  Sun  shine.  In  1939,  the  German-born  American 
physicist  Hans  A.  Bethe  (1906-2005)  first  delineated  this  complete  nuclear 
transformation  of  four  protons  into  one  helium  nucleus  (Bethe  1939). 

Because  the  proton-proton  chain  was  discussed  in  detail  in  Sect.  8.3,  we  now 
limit  the  discussion  to  the  general  result.  Four  protons,  each  designated  by  :H  or  p , 
combine  to  form  a  helium  nucleus,  denoted  by  4He,  releasing  powerful  gamma  ray 
radiation,  designated  y,  positrons  with  the  symbol  e+ ,  electron  neutrinos,  denoted  by 

_ i  o 

the  symbol  ve,  and  an  energy  of  about  4x10  ~  J  per  reaction  chain.  We  can  use  the 

notation  of  nuclear  reactions  to  describe  the  p-p  chain,  with  nuclei  on  the  left  side  of 
a  reaction  designated  by  an  arrow,  -> ,  fusing  to  make  the  nuclei  and  other  particles  or 
radiation  on  the  right  side  of  the  arrow.  The  proton-proton  chain  is  described  using 
this  notation  in  Focus  10.4,  with  the  net  result  for  each  reaction  chain: 

4:H  ^  4He  +  6y  +  2V,.  (10.65) 


Focus  10.4  The  proton-proton  chain 

The  proton-proton  chain  of  reactions  begins  with  the  merger  of  two  protons, 
each  designated  by  1H,  in  the  reaction: 

lU  +  lU  -►  2D  +  e+  +  v  (10.66) 
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It  is  followed  by 

2D+  *H-»  3He  +  y, 

(10.67) 

with  a  last  step 

3 He  +  3 He  -►  4He  +  *H  +  *H. 

(10.68) 

In  the  process,  additional  gamma  rays  are  released  when  the  positrons 
combine  with  electrons,  denoted  e~,  during  pair  annihilation,  denoted  by 
e+  +  e~  ->  y  +  y,  and  the  net  result  of  the  p-p  chain  is: 

4*H  -►  4He  +  6y  +  2ve. 

(10.69) 

At  about  the  same  time  that  Bethe  delineated  the  proton-proton  chain,  the 
German  physicist  Carl  Friedrich  von  Weizsacker  (1912-2007)  examined  the 
thermonuclear  reactions  that  might  occur  in  stars,  publishing  his  results  in  two 
papers  that  appeared  in  the  Physikalische  Zeitschrift  (Weizsacker  1937,  1938). 
In  the  first  paper,  he  reasoned  that  the  merger  of  two  protons  must  have  started 
hydrogen  burning  in  the  Sun.  In  the  second  paper,  Weizsacker  proposed  that 
elements  that  are  heavier  than  hydrogen  already  were  created  before  the  formation 
of  stars,  as  we  know  them  now.  He  no  longer  was  limited  to  reactions  that  began 
with  the  lightest  element,  hydrogen,  and  this  led  him  to  the  important  discovery  of 
the  cyclic  CNO  chain  of  reactions  in  which  carbon  acts  as  a  catalyst  for  the 
synthesis  of  helium  from  hydrogen  (Focus  10.5).  The  overall  result  for  each  CNO 
reaction  chain  is  as  follows: 

12C  +  41  H  — >  12C  +  4He  +  5y  +  2ve,  (10.70) 

where  the  carbon  nucleus,  denoted  C,  is  forever  being  regenerated  and  acts  like  a 
catalyst.  That  is,  the  CNO  cycle  is  a  circular  reaction  chain  in  which  carbon  is 
destroyed  and  then  re-created.  Therefore  it  is  available  for  the  sequence  of  reac¬ 
tions  to  occur  repeatedly.  Like  the  non-circular,  linear  proton-proton  chain,  each 

_ i  o 

CNO  reaction  chain  releases  about  4  x  10  ~  J  of  energy. 


Focus  10.5  The  CNO  cycle 

1  9 

The  cyclic  CNO  chain  of  reactions  starts  when  a  carbon  nucleus,  C,  fuses 

1  i  o 

with  a  proton,  H,  to  produce  a  nucleus  of  nitrogen,  N,  and  gamma  ray 
radiation,  y.  The  nitrogen  decays  to  form  a  nucleus  of  heavier  carbon,  C;  a 
positron,  e+\  and  an  electron  neutrino,  ve.  These  beginning  nuclear-fusion 
reactions  are  as  follows: 
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and 

13N  -►  13C  +  e+  +  ve.  (10.72) 

The  cycle  then  continues  when  the  heavy  carbon  combines  with  a  proton 
to  form  heavier  nitrogen,  14N,  which  then  fuses  with  another  proton  to  form 
oxygen,  150.  The  oxygen  decays  to  make  15N,  which  then  combines  with  a 

1  SJ  A 

proton  to  make  the  original  carbon,  C,  together  with  a  helium  nucleus,  He. 
The  nuclear- fusion  reactions  are  as  follows: 


13C  +  ‘H^  14N  +  y 

(10.73) 

14N  +  ‘H  ->  150  +  y 

(10.74) 

150  -►  15N  +  e+  +  ve 

(10.75) 

and 

15N+'H^  12C  +  4He.  (10.76) 

The  positrons,  e+ ,  annihilate  with  the  electrons,  e~ ,  to  produce  energetic 
gamma  radiation  by  the  following  reaction: 

e++e~^2y.  (10.77) 

This  occurs  for  both  positrons  generated  during  the  each  CNO  reaction 
chain. 

Like  the  proton-proton  chain,  the  net  result  of  the  CNO  cycle  is  that  four 
protons  are  fused  together  to  form  one  helium  nucleus,  gamma  rays  and 
electron  neutrinos.  By  summing  the  left  and  right  sides  of  all  the  partici¬ 
pating  reactions,  we  obtain  the  following: 

12C  +  4‘H  -►  12C  +  4He  +  5y  +  2ve.  (10.78) 


The  circular  CNO  reaction  chain  is  induced  by  high  temperatures  in  the  cores  of 
massive  main-sequence  stars  and  becomes  self-sustaining  by  the  catalytic  action  of 
carbon.  This  cycle  also  could  begin  at  the  intermediate  stages  with  nitrogen  or 
oxygen,  so  the  entire  cycle  is  called  the  carbon-nitrogen-oxygen,  or  CNO,  cycle. 

It  was  Bethe  who  realized  that  the  proton-proton  reaction,  which  explains  the 
luminous  output  of  the  Sun,  fell  short  of  the  much  greater  luminosity  of  the  hotter 
and  more  massive  stars.  So,  he  systematically  examined  a  great  number  of  nuclear 
reactions  that  would  not  operate  within  stars  and  eliminated  them.  He  indepen¬ 
dently  found  that  the  CNO  cycle  would  generate  about  the  same  energy  as  the 
proton-proton  process  for  each  nuclear  reaction  chain.  Bethe  also  showed  that  the 
greater  rate  and  temperature  dependence  of  the  CNO  cycle  could  account  for  the 
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Temperature  (106K) 

Fig.  10.11  Energy  generation  by  two  hydrogen-burning  processes  The  energy  output  ( left 
vertical  axis),  in  units  of  power  per  kilogram,  or  J  s-1  kg-1,  as  a  function  of  core  temperature 
0 bottom  horizontal  axis )  in  millions,  or  106,  degrees  kelvin,  designated  K.  The  proton-proton 
chain,  denoted  PP,  dominates  the  hydrogen-burning  energy  production  for  the  Sun  and  less 
massive  stars  that  have  lower  core  temperatures.  At  the  center  of  the  Sun,  where  the  temperature 
is  15.6  x  106  K,  the  PP  chain  is  the  dominant  nuclear-reaction  chain  for  converting  hydrogen 
nuclei  into  helium  nuclei,  with  an  energy  output  of  0.016  J  s-1  kg-1,  or  51  million  MeV  g-1  s-1 
in  the  units  used  by  nuclear  astrophysicists.  In  more  massive  main-sequence  stars,  the  central 
temperature  is  higher  and  the  CNO  cycle  of  hydrogen  burning  is  the  most  efficient  process.  Main- 
sequence  stars  of  mass  less  than  1.5  solar  masses  shine  by  the  PP  chain  of  nuclear  reactions, 
whereas  the  main-sequence  stars  with  mass  greater  that  1.5  solar  masses  bum  hydrogen  by  the 
CNO  set  of  nuclear  reactions.  (From  “The  Life  and  Death  of  Stars”  by  Kenneth  R.  Lang, 
published  by  Cambridge  University  Press,  2013.  Reprinted  with  permission.) 


high  luminosity  of  the  massive  stars.  Subsequently,  this  conclusion  was  placed  on 
a  firm  basis  when  William  A.  “Willy”  Fowler  (1911-1995)  and  his  colleagues  at 
the  California  Institute  of  Technology  made  laboratory  measurements  of  the 
nuclear  reaction  cross  sections  for  every  reaction  in  the  chain;  the  results  were 
published  in  a  series  of  papers  spanning  several  decades. 

Given  today’s  knowledge,  it  is  the  mass  and  therefore  the  central  temperature  of 
a  main- sequence  star  that  determine  which  hydrogen  burning  reaction  supplies 
most  of  its  power.  The  relevant  formula  for  the  energy  production  rate,  £pp,  of  the 
proton-proton  chain  is  given  by 


spp  =  0.24  pX2 


10 

~T 


61  2/3 


exp 


1/31 


J  s  1  kg  *, 


(10.79) 


where  p  is  the  mass  density  and  X  is  the  mass  fraction  of  hydrogen.  The  energy 
production  rate  for  the  CNO  cycle  is: 


£cno  —  8.7  x  1020 pXcnoX 


2/3 

exp 


152.3 


10 

Y 


6\  1/3' 


J  s  1  kg  1 , 


(10.80) 
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where  XCno  is  the  sum  of  the  mass  fractions  for  carbon,  nitrogen,  and  oxygen.  The 
variations  of  the  two  energy-producing  reactions  as  a  function  of  the  core  tem¬ 
perature  T  is  illustrated  in  Fig.  10.11  for  a  typical  stellar  composition. 

Both  the  proton-proton  chain  and  the  CNO  cycle  operate  within  main- sequence 
stars,  and  release  about  the  same  amount  of  energy  during  each  reaction  chain; 
however,  the  total  amount  of  energy  generation  differs  depending  on  the  mass  and 
central  temperature  of  a  star  (Fig.  10.11).  The  CNO  cycle  contributes  little  energy 
in  low-mass  stars  with  low  central  temperatures,  and  the  proton-proton  chain 
produces  almost  all  of  the  energy  radiated  by  main-sequence  stars  with  a  mass  less 
than  or  equal  to  the  Sun’s  mass  (Bahcall  et  al.  2003).  However,  the  CNO  cycle  is 
faster  and  generates  more  energy  in  massive  stars  that  have  high  central  temper¬ 
atures.  Stars  with  a  mass  of  about  2  solar  masses  or  above  generate  almost  their 
entire  energy  output  by  the  CNO  cycle. 

In  the  Sun,  with  a  central  temperature  of  15.6  million  K,  only  1.5  %  of  its 
energy  is  generated  by  the  CNO  cycle.  That  is  why  the  proton-proton  chain 
explains  the  Sun  so  well.  However,  with  increasingly  more  mass  and  an  increas¬ 
ingly  hotter  stellar  core,  the  CNO  cycle  becomes  the  dominant  energy  source  for  a 
main-sequence  star.  For  a  star  with  a  mass  of  1.5  times  the  mass  of  the  Sun,  where 
the  central  temperature  reaches  1 8  million  K,  each  method  of  burning  hydrogen 
produces  the  same  amount  of  total  energy  and  half  the  luminosity  of  a  star. 

The  role  of  radiation  and  convection  in  transporting  energy  out  of  the  stellar 
core  also  depends  on  the  mass  of  a  main-sequence  star  (Fig.  10.12).  When  the 
stellar  mass  is  comparable  to  that  of  the  Sun,  the  energy-generating  core  is  sur¬ 
rounded  by  a  radiative  zone  and  topped  by  a  convective  zone  (see  Sect.  8.5).  Main- 
sequence  stars  with  a  mass  of  more  than  2  solar  masses  have  a  convective  core.  In 
these  stars,  the  rate  of  energy  generation  by  the  CNO  cycle  is  sensitive  to  tem¬ 
perature,  so  the  fusion  is  highly  concentrated  in  the  core.  Consequently,  there  is  a 
high-temperature  gradient  in  the  core  region,  which  results  in  a  central  convection 
zone.  The  outer  regions  of  such  a  massive  star  transport  energy  by  radiation  with 
little  or  no  convection. 

All  stars  that  begin  their  relatively  long  and  placid  life  on  the  main  sequence  are 
composed  mainly  of  hydrogen.  These  stars  are  initially  uniform  balls  of  plasma 
with  the  same  composition  throughout.  As  time  passes,  the  central  stellar  core  is 
changed  slowly  from  hydrogen  to  helium,  so  the  inside  of  a  main- sequence  star 
eventually  becomes  different  from  the  outside.  Eventually,  the  core  is  all  used  up, 
exhausting  its  supply  of  hydrogen  by  converting  it  into  helium.  The  star  has  to 
leave  the  main  sequence  and  become  hot  enough  inside  to  burn  helium  -  the  ash  of 
its  former  hydrogen-burning  fires. 
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Fig.  10.12  Convection  inside  stars  of  ditferent  mass  Most  stars  have  convective  zones  in 
which  energy  is  transported  by  the  wheeling  motion  of  convection,  denoted  here  by  closed  curves 
with  arrows  for  stars  of  different  mass,  designated  by  M,  and  compared  to  the  Sun’s  mass  denoted 
Mq.  The  symbol  <  means  less  than  and  the  symbol  >  denotes  greater  than.  Low-mass  stars,  with 
less  than  half  a  solar  mass,  are  fully  convective  from  core  to  visible  disk  and  therefore  of  uniform 
composition.  Their  low  temperatures  result  in  a  high  opacity  to  radiation.  In  intermediate-mass 
stars,  such  as  the  Sun,  radiation  transport  dominates  convection  in  the  hot  central  regions,  which 
are  enveloped  by  a  cooler  convective  region.  The  visible  disks  of  these  stars  do  not  include  the 
nuclear-fusion  products  from  their  core  but  rather  retain  the  same  composition  as  the  interstellar 
medium  from  which  these  stars  were  formed.  High-mass  stars,  with  more  than  1.5  times  the  mass 
of  the  Sun,  have  a  large  radiative  zone  that  is  not  enveloped  by  a  convective  zone.  The 
temperature-sensitive  hydrogen-burning  reactions  of  the  CNO  cycle  cause  the  development  of  a 
convective  core  in  these  stars.  (From  “The  Life  and  Death  of  Stars”  by  Kenneth  R.  Lang, 
published  by  Cambridge  University  Press,  2013.  Reprinted  with  permission.) 


10.3.3  Helium  Burning  in  Giant  Stars 

The  fact  that  giant  stars  are  connected  to  the  main  sequence  of  the  H-R  diagram 
suggested  that  the  giants  are  the  next  stage  of  stellar  evolution.  However,  because 
giant  stars  have  larger  luminosities  at  lower  disk  temperatures  than  main-sequence 
stars,  they  seemed  to  shine  by  a  different  and  unknown  process.  The  enigma  was 
resolved  partially  when  the  Estonian  astronomer  Ernst  Opik  (1893-1985),  working 
at  the  Armagh  Observatory  in  Northern  Ireland,  argued  that  the  inside  of  a  giant 
star  can  become  very  hot  and  dense  at  the  same  time  that  its  outer  parts  become 
cool  and  rarefied  (Opik  1938). 

The  hydrogen-burning  process  of  a  main-sequence  star  is  confined  to  the  central 
stellar  core,  which  is  surrounded  by  an  inert,  nonburning  envelope  in  which  no 
nuclear  reactions  take  place.  When  the  core  hydrogen  is  expended,  the  core  is 
forced  to  contract,  for  it  can  no  longer  support  itself  under  the  crush  of  gravity.  The 

o 

central  temperature  will  rise  to  about  100  million,  or  10  ,  K  once  gravitational 
forces  compress  the  core  to  a  smaller  volume  and  increase  the  mass  density  1 ,000 

_ Q 

fold,  to  about  10  million  kg  m  .  The  rapid  increase  in  core  temperature  causes 
the  surrounding  hydrogen  envelope  to  expand,  producing  a  vast,  cool  envelope  of 
low  mass  density  (Fig.  10.13).  These  spectacular  changes  in  both  the  inside  and 
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Fig.  10.13  Formation  of  a  giant  star  When  a  main-sequence  star  consumes  the  hydrogen  in  its 
core,  the  inside  of  the  star  contracts  and  heats  up,  causing  the  outside  to  expand  and  cool  down. 
Hydrogen  burning  resumes  in  a  shell  that  envelops  the  collapsing  core.  The  center  of  the  star 
eventually  heats  up  to  about  100  million  K,  which  is  hot  enough  to  bum  helium  and  stop  the  core 
collapse.  A  giant  star  then  has  been  created  with  a  luminosity  of  about  100  times  that  of  the  Sun 
and  a  radius  of  approximately  50  times  the  radius  of  the  Sun.  (From  “The  Life  and  Death  of 
Stars”  by  Kenneth  R.  Lang,  published  by  Cambridge  University  Press,  2013.  Reprinted  with 
permission.) 


outside  of  a  dying  main-sequence  star  account  for  the  observed  characteristics  of 
red  giant  stars. 

Opik  realized  that  the  large  increase  in  central  temperatures  and  densities  of 
giant  stars  open  up  a  new  source  of  energy  not  available  to  the  main- sequence 
stars,  and  he  proposed  that  the  helium  ash  produced  by  hydrogen  burning  would 
serve  as  the  nuclear  fuel  for  giant  stars. 

The  main  difficulty  with  this  scenario  was  that  there  is  no  stable  nucleus  of 
atomic  weight  5,  and  this  gap  seemed  to  provide  an  impenetrable  barrier  for  the 
synthesis  of  heavier  elements  from  helium  of  weight  4  using  protons  of  weight  1 . 
A  proton  could  not  be  attached  to  a  helium  nucleus  to  make  the  next  heavier 
substance. 

It  took  more  than  a  decade  to  resolve  the  difficulty,  which  was  explained  almost 
simultaneously  and  independently  by  Opik  and  the  American  astronomer  Edwin  E. 
Salpeter  (1924-2008),  at  Cornell  University  (Opik  1951;  Salpeter  1952).  When  the 
core  of  a  star  reaches  a  sufficiently  high  temperature,  of  about  100  million,  or 

o 

10  ,  K,  helium  nuclei  can  be  converted  to  carbon  nuclei  by  triple  collisions  of 
helium  nuclei,  thus  circumventing  the  mass  5  difficulties. 


342 


10  The  Sun  Amongst  the  Stars 


This  release  of  energy  by  fusing  helium  into  carbon  within  a  star  is  known  as 
helium  burning.  It  also  is  called  the  triple  alpha  process  because  the  helium 
nucleus  is  an  alpha  particle  and  a  triple  collision  is  required  to  make  a  nucleus  of 
the  carbon  atom. 

The  net  result  of  the  helium  burning,  triple  alpha  chain  reaction  is 

4He  +4He  +4He^12C  +y,  (10.81) 

A  1  9 

where  He  is  a  helium  nucleus  or  alpha  particle,  is  a  carbon  nucleus,  the  y 

_ i  o 

denotes  gamma  radiation.  About  7.275  MeV  or  1.165  x  10  ^  J  of  energy  is 
released  each  time  it  occurs.  The  triple  alpha  process  happens  so  frequently  at  the 
high  central  temperatures  of  giant  stars  that  it  can  power  their  intense  luminosity. 

The  difficulty  in  pushing  together  even  two  helium  nuclei  is  exacerbated  by 
their  electrical  charge.  Each  helium  nucleus  contains  two  protons;  therefore,  the 
electrical  repulsion  between  two  helium  nuclei  is  four  times  that  between  two 
protons. 


Example:  Hot  enough  to  burn  helium 

To  overcome  the  electrical,  or  Coulomb,  repulsion  between  two  helium  ions, 
of  charge  2e  =  2  x  1.6022  x  10-19  C  each,  the  kinetic  energy  mHeV2 / 2  of 
a  colliding  helium  ion,  of  mass  mHe  =  6.6445  x  10  ^  kg  and  velocity 

r\ 

V  must  be  equal  to  the  electrical  potential  energy  4e  / ( 4n£0RHe )  when  the 
two  ions  touch,  where  n  =  3.14159,  the  permittivity  of  free  space  is 
s0  =  10_9/(367i)  =  8.854  x  10“ 12  F  m  \  and  the  radius  of  the  helium  ion 
is  RHe  «  10  m.  Solving  for  the  velocity  V  ^  10  ms  ,  and  setting  it 

i  /o 

equal  to  the  thermal  velocity  V  =  Vtherma\  =  (3 kT/mHe)  ,  with  the  Boltz- 

_ oo  _ 1 

mann  constant  k  =  1.38065  x  10  J  K  ,  we  obtain  a  temperature  of 
T  x  1.6  x  1010  K  to  overcome  the  electrical  repulsion. 

This  is  an  impossibly  high  temperature,  about  4  times  hotter  than  that 
required  for  the  direct  fusion  of  two  protons,  but  quantum-mechanical 
tunneling  lowers  the  central  star  temperature  needed  to  about  five  billion,  or 
Tc  «  5  x  109  K.  The  increase  in  temperature  is  created  by  core  collapse  to 
a  radius  Rc.  By  equating  the  thermal  energy  3kTc/2  to  the  gravitational 
energy  GmHeMc/Rc ,  where  the  gravitational  constant  G  =  6.674 
x  10  m  kg  s  ,  and  assuming  a  core  mass  about  equal  to  that  of  the 

or)  7 

Sun  with  M0  =  1.989  x  10  kg,  we  can  infer  a  core  radius  of  Rc  «  10 

o 

m  «  0.014  Rq  for  the  Sun’s  radius  RQ  =  6.955  x  10  m  and  a  mass  den¬ 
sity  of  p  =  3  MQ/(4nRl)  «  0.5  x  109  kg  m-3. 

It  turns  out  that  certain  nuclear  resonances  also  enhance  the  reaction  rate 
of  the  triple  helium  burning  process,  and  modern  calculations  indicate  that  a 

o 

core  temperature  of  100  million,  or  10  K  might  suffice  to  initiate  the  helium 
fusion,  but  we  have  the  basic  idea.  The  central  core  must  collapse  to  increase 
the  temperature  and  enable  helium  fusion. 
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The  quantum-mechanical  tunneling  effect  will  help,  as  it  does  in  permitting  two 
protons  to  fuse  together  in  the  Sun  (see  Sect.  8.3).  The  rise  in  the  central  tem¬ 
perature  of  a  giant  star  is  needed  to  increase  the  number  of  helium  nuclei  moving 
fast  enough  to  penetrate  the  larger  electrical  barrier  with  the  aid  of  tunneling, 
which  incidentally  also  explains  why  helium  burning  doesn’t  occur  in  low-tem¬ 
perature,  main-sequence  stars.  Only  the  helium  nuclei  in  the  high-velocity  tail  of 
the  Maxwellian  speed  distribution  can  merge  together  in  a  giant  star,  which  means 
that  most  of  the  helium  nuclei  are  not  moving  fast  enough  to  merge  and  that  the 
helium-burning  reactions  occur  relatively  slowly. 

Under  most  circumstances,  helium  burning  still  would  be  exceedingly  unlikely 
because  it  involves  the  nearly  simultaneous  collision  of  not  two  but  rather  three 
helium  nuclei.  Such  a  triple  collision  is  favored  by  two  exceptionally  large  col¬ 
lision  cross  sections,  termed  resonance  reactions  in  the  parlance  of  nuclear 
physics.  The  bigger  the  cross  section  for  a  collision,  the  more  likely  it  will  occur. 

In  1952,  Salpeter,  who  was  unaware  of  Opik’s  work  the  previous  year,  pre¬ 
sented  some  of  these  more  detailed  explanations  for  the  formation  of  carbon  in  the 
triple  alpha,  helium  burning  process.  Two  helium  nuclei  can  combine  to  form  a 
beryllium  8  nucleus,  but  because  the  beryllium  is  unstable  only  a  tiny  fraction 
remains  at  any  instant.  As  Salpeter  noticed,  a  beryllium  nucleus  can  occasionally 
combine  with  a  third  helium  nucleus  to  form  carbon  12.  Nevertheless  because 
beryllium  is  so  extremely  rare,  it  must  have  a  large  cross  section  for  helium 
capture  if  any  substantial  amount  of  carbon  is  to  be  produced. 

Two  years  later,  Fred  Hoyle  (1915-2001)  showed  that  the  triple  alpha  process 
occurs  at  a  rapid  enough  rate  to  provide  the  luminosity  of  a  red  giant  star  if  the 
carbon  goes  through  an  excited  state  and  the  giant  core  temperature  reaches  about 
10  K  at  a  mass  density  of  about  10  kg  per  cubic  meter  (Hoyle  1954).  William 
Fowler  and  his  colleagues  subsequently  showed  that  the  required  excited  state  of 
carbon  does  in  fact  exist.  The  formation  of  carbon  from  helium  is  thus  enhanced 
enormously  by  two  facts:  the  existence  of  beryllium  8  (itself  a  kind  of  resonance) 
and  the  existence  of  the  excited  state  of  carbon. 

The  details  are  somewhat  complicated  but  suffice  it  to  say  that  the  mass  5 
barrier  can  be  overcome;  Fynbo  et  al.  (2004)  provide  relatively  recent  estimates  for 
the  rates  of  the  stellar  triple-alpha  process.  In  retrospect,  it  seems  almost  mirac¬ 
ulous  that  nature  has  conspired  in  this  way  to  make  helium  burning  and  giant  stars 
exist.  Our  understanding  of  the  evolution  of  main-sequence  stars  into  these  larger, 
more  luminous  counterparts  was  stimulated  by  investigations  of  star  clusters. 


10.4  Using  Star  Clusters  to  Watch  How  Stars  Evolve 

Entire  stars  have  had  a  beginning,  followed  by  a  long  period  of  growth  and 
inevitable  decay,  eventually  turning  into  something  else,  and  we  can  use  the  H-R 
diagrams  of  star  clusters  to  map  out  the  stages  of  stellar  transfiguration.  As  time 
elapses,  the  more  massive  stars  evolve  into  the  next  phase  of  stellar  life  and  the 
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Fig.  10.14  Open  star  clusters  in  the  H-R  diagram  Open  star  clusters  are  relatively  young,  and 
most  of  their  stars  have  not  yet  left  the  main  sequence  in  the  H-R  diagram.  The  youngest  clusters, 
such  as  the  Pleiades,  retain  all  but  the  topmost  part  of  the  main  sequence.  The  turnoff  point  of  the 
Pleiades  star  cluster  from  the  main  sequence  indicates  an  age  of  roughly  100  million  years.  The 
Hyades  star  cluster,  which  turns  off  about  halfway  down  the  main  sequence,  is  about  600  million 
years  old.  The  lowest  open  cluster  in  this  diagram,  M  67,  is  an  estimated  5  billion  years  old,  with 
a  main  sequence  that  stops  just  above  the  Sun.  One  globular  star  cluster,  M  3,  is  shown  for 
comparison.  [Adapted  from  Allan  Sandage  (1957).] 


main  sequence  disappears  from  the  top  down.  Very  massive  stars  at  the  upper  left 
of  the  main  sequence  become  supergiants;  those  with  intermediate  masses  com¬ 
parable  to  the  Sun  become  red  giants. 

Benacquista  (2013)  gives  a  fine  introduction  to  the  evolution  of  both  single  and 
binary  stars.  Gallart  et  al.  (2005)  have  provided  a  review  of  stellar  evolution 
models  and  color  magnitude  diagrams.  Iben  and  Renzini  (1983)  have  reviewed  the 
asymptotic  giant  branch  (AGB)  evolution  and  beyond,  and  Winckel  (2003)  has 
reviewed  the  post  AGB  stars.  Chiosi  and  Maeder  (1986)  have  reviewed  the  evo¬ 
lution  of  massive  stars  with  mass  loss;  Gilles  and  Baraffe  (2000)  have  reviewed  the 
theory  of  low-mass  stars  and  substellar  objects;  and  Maeder  and  Meynet  (2000) 
have  discussed  the  evolution  of  rotating  stars. 

Stars  within  a  star  cluster  are  all  of  the  same  approximate  age,  within  a  few 
million  years,  dating  back  to  the  formation  of  the  cluster.  They  also  began  with  the 
same  initial  composition  of  material  and  exhibit  a  full  range  of  stellar  mass. 
Because  the  stars  in  a  given  cluster  are  all  at  the  same  distance  from  the  Earth,  we 
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Fig.  10.15  How  old  is  a  globular  star  cluster?  A  plot  of  the  luminosity  ( left  vertical  axis),  in 

r\  s'  1 

units  of  the  Sun’s  luminosity  L0  =  3.828  x  10  J  s”  ,  against  the  color  index,  B-V  ( bottom 
horizontal  axis),  for  the  stars  in  the  southern  globular  star  cluster  47  Tucanae,  also  designated 
NGC  104.  The  absolute  visual  magnitudes,  Mv,  of  the  stars  also  are  shown  ( right  vertical  axis). 
Although  low-mass,  relatively  faint  stars  are  still  on  the  main  sequence  (« diagonal  line  from 
middle  left  to  bottom  right),  the  massive,  bright  stars  in  the  cluster  have  left  the  main  sequence 
and  are  evolving  into  giant  stars  ( top  right).  Theoretical  tracks,  called  isochrones,  show  the 
evolutionary  distributions  at  different  ages  of  10  billion,  12  billion,  14  billion  and  16  billion  years 
from  top  to  bottom  and  left  to  right.  The  best  fit  to  the  observed  data  corresponds  to  an  age 
between  12  billion  and  14  billion  years  for  this  star  cluster.  (Courtesy  of  James  E.  Hesser.) 


can  obtain  direct  observations  of  their  relative  luminosity  without  knowing  the 
distance. 

A  cluster  H-R  diagram  can  be  used  as  a  clock,  dating  the  age  of  the  cluster  and 
the  stars  in  it  by  the  place  of  their  turnoff  from  the  main  sequence  to  become 
supergiants  or  giants.  Stars  with  a  luminosity  and  temperature  greater  than  the 
turnoff  value  all  have  evolved  away  from  the  hydrogen-burning  state  of  stellar  life, 
and  the  age  of  the  cluster  is  equal  to  the  main-sequence  lifetime  of  stars  at  this 
turnoff  point.  The  lower  its  luminosity  or  temperature,  the  older  is  the  star  cluster. 

The  main-sequence  turnoffs  of  the  loosely  bound  open  star  clusters  (Fig.  10.14) 
indicate  that  they  are  approximately  100  million  years  old  (Mermilliod  1981; 
Sandage  1957).  Such  relatively  young  clusters  are  identified  by  the  membership  of 
O  and  B  stars,  which  would  leave  the  main  sequence  in  a  relatively  short  time. 
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The  main  sequence  of  the  H-R  diagram  for  globular  star  clusters  does  not 
contain  these  hot,  luminous  stars  (Fig.  10.15),  and  these  star  clusters  have  ages 
between  10  billion  and  14  billion  years  -  or  about  two  to  three  times  the  age  of  the 
Sun.  This  indicates  that  the  masses  of  the  stars  at  the  main-sequence  turnoff  point 
of  globular  clusters  are  less  than  the  Sun.  Soderblom  (2010)  provided  a  review  of 
the  age  of  stars,  whereas  Van  den  Berg  et  al.  (1996)  reviewed  the  age  of  the 
galactic  globular  cluster  system. 

Due  to  their  great  age  and  numerous  stars,  the  H-R  diagrams  of  these  dense 
stellar  concentrations  help  us  watch  how  stars  evolve  to  the  later  stages  of  stellar 
life.  Such  investigations  involve  theoretical  calculations  of  precisely  how  long  a 
main-sequence  star’s  central  fuel  supply  can  last  and  models  of  what  happens 
when  its  fuel  is  used  up.  Martin  Schwarzschild  (1912-1997),  the  son  of  German 
astronomer  Karl  Schwarzschild  (1873-1916),  was  one  of  the  first  to  examine  this 
phase  of  stellar  evolution.  After  emigrating  to  the  United  States,  Martin 
Schwarzschild  used  theoretical  models  and  primitive  computers,  developed  by  his 
Princeton  colleague  John  von  Neumann  (1903-1957),  to  chart  the  evolutionary 
trajectory  of  a  star  and  compare  it  to  the  various  kinks,  bends,  and  gaps  of  missing 
stars  on  the  H-R  diagrams  of  globular  star  clusters.  His  theoretical  evolutionary 
models  were  facilitated  by  the  fact  that  all  of  the  stars  in  globular  star  clusters  have 
the  same  initial  chemical  composition. 

Martin  teamed  with  Allan  Sandage  (1926-2010),  then  a  graduate  student  at  the 
California  Institute  of  Technology,  whose  H-R  diagrams  included  faint  stars  that 
connected  the  main  sequence  to  the  red-giant  branch  (Fig.  10.16).  When 
Sandage’ s  data  were  compared  to  the  model  results,  it  improved  our  understanding 
of  evolution  away  from  the  main  sequence  and  this  provided  a  sound  observational 
basis  for  stellar  aging  (Sandage  and  Schwarzschild  1952;  Sandage  1957; 
Schwarzschild  1958). 

When  the  hydrogen-burning  fires  are  quenched  in  a  star  of  roughly  solar  mass, 
the  shrinking  stellar  core  heats  up  and  causes  the  star  as  a  whole  to  swell  into  a 
bloated  red  giant.  The  gravitational  energy  released  by  the  collapsing,  nonburning 
core  is  then  spread  over  a  much  larger  area,  resulting  in  a  lower  disk  temperature 
and  a  shift  of  the  visible  starlight  into  the  red  part  of  the  spectrum.  This  accounts 
for  the  red-giant  branch  in  the  H-R  diagram  of  globular  star  clusters. 

The  rising  heat  from  the  collapsing  core  ignites  hydrogen  burning  in  an  internal 
shell  that  envelops  a  core  of  inactive  helium.  As  realized  by  Schwarzschild  and  the 
English  astronomer  Fred  Hoyle  (1915-2001),  the  giant  core  is  compressed  into  a 

o 

degenerate  state  and  eventually  heats  up  to  a  temperature  of  about  10  K,  when 
core  helium  burning  begins  (Hoyle  and  Schwarzschild  1955).  Once  the  star  fuses 
helium  into  carbon  within  its  core,  it  enters  the  horizontal  branch  of  the  cluster  H- 
R  diagram  and  is  technically  no  longer  considered  a  red  giant.  As  then  demon¬ 
strated  by  Schwarzschild  and  the  Richard  Harm  (1909-1996),  helium  begins  to 
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Fig.  10.16  Globular  star  cluster  in  the  H-R  diagram  The  Hertz sprung-Rus sell  diagram  for 
the  globular  star  cluster  M  5,  where  the  absolute  visual  magnitude  ( left  vertical  axis )  is  plotted  as 
a  function  of  color  index  (, bottom  horizontal  axis).  It  is  very  different  from  the  H-R  diagrams  for 
open  star  clusters  shown  in  Fig.  10.14.  The  high-mass  stars  in  this  globular  star  cluster  have  left 
the  main  sequence  (MS)  at  a  relatively  low  turnoff  point,  denoted  TO,  indicating  a  greater  age 
than  the  open  star  clusters.  This  diagram  illustrates  the  evolutionary  tracks  of  these  stars  into  the 
red-giant  branch,  designated  RGB  (top  right),  as  well  as  other  evolutionary  stages  such  as  the 
subgiant  branch,  denoted  SGB;  the  asymptotic  giant  branch,  designated  AGB;  and  the  horizontal 
branch,  denoted  HB,  that  extends  to  the  left.  The  gap  of  missing  stars  in  the  horizontal  branch  for 
the  globular  star  cluster  M  5  shows  the  instability  strip  of  pulsating  stars,  known  as  RR  Lyrae 
stars.  [Adapted  from  Arp  (1962).] 


burn  abruptly  in  the  core,  releasing  a  flash  of  intense  energy  that  rejuvenates  a 
star’ s  luminous  output  and  initiates  core  helium  fusion  into  carbon,  which  can  last 

o 

about  10  years  (Schwarzschild  and  Harm  1962).  Harm  was  born  and  educated  in 
Estonia,  but  moved  to  the  United  States  during  World  War  II  (1939-1945), 
spending  his  entire  subsequent  career  at  Princeton  University. 

When  the  core  helium  is  exhausted,  the  carbon  and  oxygen  core  collapses, 
helium  is  burned  in  a  surrounding  shell,  and  the  star  briefly  rises  to  giant  status  a 
second  time,  along  the  asymptotic  branch  of  the  H-R  diagram.  A  short  period  of 
instability  then  begins,  where  the  star  can  pulsate,  but  it  is  approaching  its  end. 

An  even  briefer  stellar  life,  with  more  violent  winds,  is  expected  for  the  more 
massive  main-sequence  stars  that  become  supergiants.  They  help  explain  where 
most  of  the  heavier  elements  originate. 
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10.5  Where  did  the  Chemical  Elements  Come  From? 

10.5.1  Advanced  Nuclear  Burning  Stages  in  Massive 
Supergiant  Stars 

A  supergiant  star  has  a  much  greater  mass,  interior  compression,  central  temper¬ 
ature,  and  luminosity  than  a  giant  star,  which  is  why  we  call  them  “super.” 
Supergiants  pass  through  the  same  early  stages  of  stellar  life  as  giants  but  at  a 
faster  rate.  Unlike  their  counterparts  of  lesser  mass,  massive  stars  with  a  mass 
above  10  solar  masses  quickly  consume  the  hydrogen  in  their  cores  and  transform 
into  colossal  supergiants  that  bum  the  next  available  nuclear  fuel:  helium.  A  star 
with  a  mass  of  25  times  that  of  the  Sun,  for  example,  will  complete  hydrogen 
burning  by  the  CNO  cycle  in  about  7  million  years  and  helium  burning  in  a  mere 
660,000  years.  In  contrast,  the  Sun  will  take  roughly  10  billion  years  to  complete 
hydrogen  burning  by  the  proton-proton  chain  and  another  100  million  years  to 
consume  the  helium  within  its  core. 

When  the  core  helium  is  consumed,  the  evolutionary  paths  of  high-mass  and 
moderate-mass  stars  diverge.  The  cores  of  supergiant  stars  are  so  massive  that  they 
can  contract  and  heat  up  enough  to  burn  carbon,  thereby  stopping  the  core  collapse 
and  shining  with  renewed  vigor.  Carbon  nuclei  have  an  electrical  charge  equiv¬ 
alent  to  six  protons;  therefore,  a  formidable  electrical  repulsion  separates  them, 
and  collisions  at  great  speed  are  required  for  its  penetration.  This  can  happen  when 
the  temperature  rises  to  about  a  billion  K,  or  109  K. 

In  contrast,  the  giant  stars  are  not  sufficiently  massive  to  burn  anything  heavier 
than  helium  because  they  never  get  hot  enough  inside.  In  technical  terms, 
degeneracy  pressure  halts  the  contraction  of  the  inert,  nonburning  carbon  core  in  a 
giant  star  before  it  can  become  hot  enough  for  carbon  fusion.  These  stars  expel 
their  outer  layers  into  surrounding  space  and  collapse  inside  to  an  Earth-sized 
white  dwarf  star. 

A  supergiant  is  so  massive  that  it  can  enter  progressively  more  advanced 
nuclear-burning  stages  (Salpeter  1957;  Weaver  et  al.  1978;  Heger  et  al.  2003).  The 
core  helium  is  converted  into  carbon,  and  some  of  the  newly  formed  carbon  nuclei 
can  fuse  with  helium  nuclei  to  make  oxygen.  When  the  helium  is  gone,  the  core 
contracts  until  it  becomes  hot  enough  to  bum  carbon  into  neon,  which  temporarily 
re- stabilizes  the  core.  Each  time  the  core  depletes  the  elements  that  it  is  fusing,  or 
“burning,”  it  shrinks  and  heats  up  until  it  becomes  hot  enough  for  fusion  reactions 
of  the  nuclear  ash,  continuing  up  the  chain  of  successively  heavier  abundant 
elements.  At  the  same  time,  nuclear  fusion  of  the  earlier  fuel  continues  in  over¬ 
lapping  shells  at  lower  temperatures.  Layer  upon  layer  of  nuclear  burning  shells 
are  created  deep  down  inside  a  supergiant. 

The  aging  of  a  supergiant  star  accelerates  rapidly,  consuming  its  internal  fuel 
sources  at  ever-increasing  central  temperatures  and  rates  (Table  10.8).  Due  to  the 
higher  temperatures  needed  for  these  nuclear  reactions  to  occur,  they  also  proceed 
at  a  much  more  rapid  rate  than  hydrogen  burning,  and  the  thermonuclear  lifetime 
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Table  10.8  Nuclear  fusion  processes  in  a  supergiant  star  of  25  solar  massesa 


Core  fusion  process 

Central  temperature 
(K) 

Central  density 

(kg  m-3) 

Duration 

(years) 

Hydrogen  burning  (H 

-►  He) 

3.7  x  107 

3.8  x  103 

7,300,000 

Helium  burning  (He  - 

->  C  and 

1.8  x  108 

6.2  x  105 

660,000 

O) 

Carbon  burning  (C  — >• 

Ne) 

7.2  x  108 

6.4  x  108 

165 

Neon  burning  (Ne  -> 

Mg  and 

1.4  x  109 

3.7  x  109 

1.2 

Si) 

Oxygen  burning  (0  - 

>  Si) 

1.8  x  109 

1.3  x  1010 

0.5 

Silicon  burning  (Si  -> 

Fe) 

3.4  x  109 

1.1  x  10“ 

0.004 

a  Adapted  from  Weaver  et  al.  (1978) 


of  the  supergiant  stars  therefore  is  much  shorter  than  those  of  even  the  giant  stars. 
This  is  one  reason  that  so  few  supergiant  stars  are  observed. 

In  the  terminal  stages  of  a  supergiant’s  life,  its  inner  core  is  converting  silicon 
into  iron,  and  onion-like  overlying  layers  are  burning  lighter  elements,  such  as 
magnesium,  oxygen,  neon,  carbon,  and  helium.  However,  when  the  iron  nuclei  in 
the  core  of  such  a  star  are  pushed  together,  no  energy  is  released.  The  iron  does  not 
burn,  regardless  of  how  hot  the  star’s  core  becomes.  The  nuclear  fires  are  extin¬ 
guished;  and  the  star  has  reached  the  end  of  its  life.  It  can  never  again  shine  by  any 
slow  nuclear-fusion  process,  and  there  is  no  energy  left  to  support  the  core.  The 
star  has  become  bankrupt,  having  completely  spent  all  of  its  internal  resources. 
There  is  nothing  left  to  do  but  collapse. 

The  inert  core  collapses  in  less  than  1  s,  bounces  and  then  explodes  as  a  supernova 
with  the  light  of  1  billion  Suns.  The  shattered  star  and  all  of  the  elements  made  inside 
it  are  then  dispersed  into  surrounding  space.  This  material  provides  the  seeds  for 
future  planets  and  stars,  which  explains  where  most  of  the  heavy  elements  that  are 
now  found  in  the  Earth  and  the  Sun  came  from. 


10.5.2  Origin  of  the  Material  World 

What  accounts  for  the  origin  of  the  chemical  elements  that  make  up  our  everyday 
world?  The  English  astronomer  Arthur  Stanley  Eddington  (1882-1944)  was  one  of 
the  first  to  propose  that  the  light  elements  are  compounded  into  more  complex 
elements  within  stars  (Eddington  1920).  Then,  within  a  decade,  the  great  stellar 
abundance  of  hydrogen  had  been  established,  and  Robert  d’Escourt  Atkinson 
(1898-1982)  showed  how  element  synthesis  in  stars,  starting  with  the  proton- 
proton  reaction,  might  account  for  both  stellar  energy  and  the  origin  of  the 
elements  (Atkinson  1931).  In  his  view,  the  observed  relative  abundance  of  the 
elements  could  be  explained  by  the  creation  of  less  abundant,  heavy  nuclei  from 
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more  abundant,  lighter  nuclei,  particularly  hydrogen  and  helium.  The  formation  of 
heavy  nuclei  from  the  nuclear  reactions  of  lighter  nuclei  is  termed  nucleosynthesis. 

Baron  Carl  Friedrich  von  Weizsacker  (1912-2007)  proposed  two  mechanisms 
for  nucleosynthesis:  either  within  stars  or  in  the  primeval  “fireball”  explosion,  out 
of  which  the  expanding  universe  arose  (Weizsacker  1937,  1938).  Today,  we  know 
that  our  material  world  indeed  was  synthesized  in  both  places.  Most  of  the  heavy 
elements  that  now  are  found  in  the  universe  were  created  within  former  stars  by 
various  nuclear  reactions  at  different  times  and  under  different  physical  conditions. 
However,  ah  of  the  hydrogen  and  most  of  the  helium  now  observed  in  the  universe 
was  synthesized  in  the  big-bang  “fireball”  explosion.  Alpher  and  Herman  (1950), 
Trimble  (1975),  Penzias  (1979),  and  Fowler  (1984)  have  reviewed  the  early  his¬ 
tory  of  ideas  concerning  element  formation  in  stars  and  the  big  bang.  Our 
understanding  of  the  details  of  these  processes  is  intimately  related  to  the  observed 
element  abundance  in  the  Sun. 


10.5.3  The  Observed  Abundance  of  the  Elements 

An  important  key  to  understanding  how  stars  synthesize  the  elements  is  obtained 
from  their  relative  abundances,  initially  studied  by  chemists  rather  than  astrono¬ 
mers.  The  American  chemist  William  D.  Harkins  (1873-1951),  for  example,  found 
an  important  clue  to  the  mystery  of  the  origin  of  the  elements  when  he  noticed  that 
elements  of  low  atomic  weight  are  more  abundant  than  those  of  high  atomic 
weight  and  that,  on  average,  the  elements  with  even  atomic  numbers  are  about  10 
times  more  abundant  than  those  with  odd  atomic  numbers  of  about  the  same 
number.  These  features  led  Harkins  to  conjecture  that  the  relative  abundances  of 
the  elements  depend  on  nuclear  rather  than  chemical  properties  and  that  heavy 
elements  must  have  been  synthesized  from  lighter  ones  (Harkins  1917,  1931). 

Decades  later,  astronomers  showed  that  stars  are  the  crucibles  in  which  ah  but 
the  lightest  elements  are  formed  by  nuclear  reactions  that  convert  light  elements 
into  heavier  ones,  and  that  the  systematic  decline  in  the  abundance  of  heavier 
elements  can  be  attributed  to  the  relative  scarcity  of  stars  that  have  evolved  to  the 
stage  that  creates  them. 

Two  other  American  chemists,  Hans  E.  Suess  (1909-1993)  and  Harold  Clayton 
Urey  (1893-1981),  provided  a  detailed  discussion  of  the  elemental  and  isotopic 
abundances  of  the  Sun  and  similar  stars,  calling  attention  to  the  many  fluctuations 
that  appear  in  the  general  trend  of  an  exponential  decline  of  abundance  with 
increasing  atomic  weight  (Suess  and  Urey  1956).  This  discussion  served  as  a  major 
stimulus  for  modern  ideas  concerning  stellar  nucleosynthesis  (Fig.  10.17). 

Asplund  et  al.  (2009)  have  reviewed  observations  of  the  chemical  composition 
of  the  Sun;  Wilson  and  Rood  (1994)  provided  an  early  summary  of  abundances  in 
the  interstellar  medium;  and  Savage  and  Sembach  (1996)  reviewed  interstellar 
abundances  from  absorption-line  observations  with  the  Hubble  Space  Telescope. 
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Fig.  10.17  Abundance  and  origin  of  the  elements  in  the  Sun  The  relative  abundance  of 

elements  in  the  solar  photosphere,  plotted  as  a  function  of  their  atomic  number  Z,  which  is  the 

number  of  protons  in  an  atom’s  nucleus  and  roughly  half  the  atomic  weight.  The  abundance  is 

12 

plotted  on  a  logarithmic  scale  and  normalized  to  a  value  of  1  million  million,  or  1.0  x  10  ,  for 
hydrogen.  Hydrogen,  the  lightest  and  most  abundant  element  in  the  Sun,  was  formed  about 
14  billion  years  ago  in  the  immediate  aftermath  of  the  big  bang  that  led  to  the  expanding 
universe.  Most  of  the  helium  now  in  the  Sun  was  also  created  then.  All  of  the  elements  heavier 
than  helium  were  synthesized  in  the  interiors  of  massive  stars  that  were  then  wafted  or  blasted 
into  interstellar  space  where  the  Sun  subsequently  originated.  Carbon,  nitrogen,  oxygen,  and  iron 
were  created  over  long  time  intervals  during  successive  nuclear  burning  stages  in  former  massive 
stars  as  well  as  during  their  explosive  death.  Elements  heavier  than  iron  were  produced  by 
neutron-capture  reactions  during  the  supernova  explosions  of  stars  that  lived  and  died  before  the 
Sun  was  bom.  The  light  elements  boron,  beryllium  and  most  lithium  are  believed  to  originate 
from  heavier  cosmic -ray  particles  that  were  stripped  of  some  of  their  ingredients  by  collisions,  in 
a  process  called  spallation.  The  exponential  decline  of  abundance  with  increasing  atomic  number 
and  weight  can  be  explained  by  the  rarity  of  stars  that  evolved  to  later  stages  of  life.  (Data 
courtesy  of  Nicolas  Grevesse.) 


10.5.4  Synthesis  of  the  Elements  Inside  Stars 

The  English  astronomer  Fred  Hoyle  (1915-2001)  introduced  the  grand  concept  of 
nucleosynthesis  in  stars  in  the  mid  twentieth  century  (Hoyle  1946,  1954).  He 
showed  that  both  theoretical  and  experimental  considerations  were  required,  and 
placed  the  concept  of  stellar  nuclear  reactions  within  the  framework  of  stellar 
structure  and  evolution,  using  the  then-known  nuclear  data. 

By  1957  the  detailed  abundance  data  of  Suess  and  Urey  had  served  as  an 
inspiration  for  a  comprehensive  review  of  the  major  element- producing  reactions 
in  stars  by  Geoffrey  R.  Burbidge  (1925-2010),  E.  Margaret  Burbidge  (1919-  ), 
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Fred  Hoyle  and  William  A.  “Willy”  Fowler  (1911-1995).  Their  seminal  publi¬ 
cation,  entitled  “Synthesis  of  the  Elements  in  Stars”  was  published  in  the  Reviews 
of  Modern  Physics  (Burbidge  et  al.  1957);  it  is  known  as  the  B“FH  paper,  after  the 
surnames  of  the  four  authors.  It  provided  the  fundamental  framework  on  which 
most  subsequent  studies  of  stellar  nucleosynthesis  are  based  (Fig.  10.17). 

As  a  star  ages,  it  experiences  successive  core-contraction  and  stable-core 
burning  stages,  in  which  heavier  elements  are  synthesized  from  lighter  ones,  and 
the  nuclear  ash  of  one  stage  becomes  the  fuel  of  the  next  one.  These  core  nuclear- 
burning  reactions  proceed  at  a  progressively  hotter,  denser,  and  faster  pace,  such  as 
successive  hydrogen,  helium,  carbon,  neon,  oxygen,  and  silicon  burning.  Elements 
with  an  even  number  of  protons  in  their  nuclei,  such  as  carbon  and  oxygen,  are 
more  abundant  because  they  are  formed  by  nuclear  reactions  with  helium,  which 
contains  two  protons  rather  than  one  in  its  nucleus. 

As  realized  by  B“FH,  the  temperature  is  not  the  same  everywhere  inside  a  star; 
therefore,  its  nuclear  evolution  is  most  advanced  in  the  central  regions  and  least  or 
not  at  all  advanced  near  its  outer  regions.  Thus,  the  composition  of  an  aging  star  is 
not  uniform  throughout.  At  the  end  of  its  life,  a  massive  star  would  be  layered  with 
successively  thinner  shells  of  helium,  carbon,  oxygen,  and  silicon  surrounding  an 
inert,  nonburning  iron  core  that  explodes  as  a  supernova,  casting  out  the  heavy 
elements  from  the  overlying  layers  of  the  star. 

Although  the  general  flow  of  the  observed  relative  abundance  of  the  more 
abundant  elements  can  be  explained  by  successive  static,  or  nonexplosive,  burning 
stages  within  stars,  elements  heavier  than  iron  -  as  well  as  many  of  the  detailed  ups 
and  downs  of  the  abundance  curve  for  lighter  elements  -  were  attributed  to  fast 
nuclear  reactions  during  the  explosive,  supernova  death  of  massive  stars.  The 
American  hydrogen-bomb  tests  in  the  1950s  had  indicated  that  heavy  elements  are 
created  by  rapid  neutron  bombardment  during  the  explosions,  and  Hoyle  and 
Fowler  knew  that  similar  processes  occur  in  supernovae  (Hoyle  and  Fowler  1960). 

B~FH  used  slow,  s,  neutron  capture;  rapid,  r,  neutron  capture;  and  proton,  p, 
capture  to  explain  some  of  the  details  of  the  abundance  curve.  Some  of  the 
nuclear-burning  reactions  produced  free  neutrons,  residing  outside  any  atomic 
nucleus.  These  neutrons  can  slowly  fuse  with  abundant  heavy  nuclei  to  produce 
the  relatively  rare  ones,  encountering  no  electrical  obstacle  because  the  neutron  is 
electrically  uncharged.  Unhindered  by  electrical  repulsion,  the  neutrons  permit  the 
extension  of  stellar  nucleosynthesis  from  iron  all  the  way  to  uranium.  A  free 
neutron  also  can  decay  into  a  proton,  and  the  rapid  capture  of  both  neutrons  and 
protons  during  supernova  explosion  helps  to  forge  the  heavier  elements. 

Busso  et  al.  (1999)  reviewed  nucleosynthesis  in  asymptotic  giant  branch  (AGB) 
stars.  Arnett  (1973,  1995)  reviewed  explosive  nucleosynthesis  in  stars,  whereas 
Meyer  (1994)  gave  us  a  review  of  the  r-,  s-  and  p-  processes  in  nucleosynthesis. 
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10.5.5  Big-Bang  Nucleosynthesis 

The  synthesis  of  elements  inside  stars  is  an  incomplete  scenario  for  it  does  not 
explain  the  origin  of  any  of  the  hydrogen  and  most  of  the  helium  in  the  observable 
universe.  Moreover,  because  deuterium  is  destroyed  rapidly  inside  stars,  there  also 
must  be  another  explanation  for  its  cosmic  existence.  As  proposed  by  the  eclectic 
George  Gamow  (1904-1968)  and  his  colleagues,  these  elements  must  have  been 
produced  during  the  exceptionally  hot  and  dense  first  moments  of  the  big-bang 
explosion  that  gave  rise  to  the  expansion  of  the  universe.  In  fact,  it  once  was 
thought  that  all  of  the  elements  found  today  might  have  been  created  back  then. 

Working  with  his  young  colleague  Ralph  A.  Alpher  (1921-2007),  Gamow 
proposed  that  all  of  the  elements  were  produced  in  a  chain  of  nuclear  reactions 
during  the  earliest  stages  of  the  expanding  universe.  They  supposed  that  the  ori¬ 
ginal  substance  of  the  material  universe,  the  cosmic  “ylem,”  consisted  solely  of 
neutrons  at  high  temperature.  Some  of  these  neutrons  decayed  into  protons,  and 
successive  captures  of  neutrons  by  protons  led  to  the  formation  of  the  elements. 

This  novel  idea  was  published  in  1948,  in  a  paper  titled  “The  Origin  of  the 
Chemical  Elements,”  with  Hans  A.  Bethe  (1906-2005)  added  as  an  author,  even 
though  he  contributed  nothing  to  the  research  or  the  writing.  This  was  done  to  make  a 
pun  on  the  first  letters  of  the  Greek  alphabet  -  alpha,  beta,  and  gamma  or  a,  /?,  and  y 
(Alpher  et  al.  1948). 

Two  years  after  the  ot-fi-y  paper,  the  Japanese  astrophysicist  Chushiro  Hayashi 
(1920-2010)  showed  that  in  the  first  moments  of  the  expansion,  the  temperature 
was  hot  enough  to  create  particles  such  as  neutrinos  and  positrons,  the  anti-matter 
particles  of  electrons  (Hayashi  1950).  The  mutual  interaction  of  all  of  the  sub¬ 
atomic  particles  present  in  the  first  moments  of  the  big  bang  establishes  the  relative 
number  of  neutrons  and  protons,  which  in  turn  determine  the  amount  of  helium 
produced. 

With  this  correction,  modern  computations  by  Robert  V.  Wagoner  (1938-  ), 
David  Schramm  (1945-1997)  and  their  colleagues  conclusively  demonstrated  that 
all  of  the  hydrogen  and  most  of  the  helium  found  in  the  cosmos  today  were 
synthesized  in  the  immediate  aftermath  of  the  big  bang  (Wagoner  et  al.  1967; 
Peebles  et  al.  1991). 

These  processes  are  known  as  big-bang  nucleosynthesis ,  and  they  tell  us  how 
much  matter  is  now  in  the  universe  in  both  visible  and  invisible  forms.  Boesgaard 
and  Steiman  (1985)  have  reviewed  the  theory  and  observations  of  big-bang 
nucleosynthesis. 

So  Gamow  was  partly  right.  The  lightest  and  most  abundant  element,  hydrogen  - 
and  therefore  the  majority  of  atoms  that  we  see  today  -  indeed  were  formed  even 
before  the  stars  existed,  in  the  immediate  aftermath  of  the  big  bang  that  produced  the 
expanding  universe.  All  of  the  hydrogen  that  is  now  found  in  stars,  interstellar  space, 
and  the  rest  the  universe  was  created  then,  about  14  billion  years  ago,  and  so  was 
most  of  the  helium,  the  second  most  abundant  element. 
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Why  weren’t  all  of  the  heavier  elements  produced  in  the  early  stages  of  the  big 
bang?  By  the  time  that  the  expanding  universe  became  sufficiently  cool  to  allow 
nucleosynthesis  to  occur,  at  a  temperature  of  about  1  billion  K,  it  had  expanded 
into  a  greater  volume  and  become  less  dense.  The  rate  of  helium  burning  by  the 
triple  alpha  process  is  proportional  to  the  square  of  the  density  at  a  given  tem¬ 
perature,  and  at  1  billion  degrees  the  density  of  the  expanding  universe  is  too  low 
by  many  orders  of  magnitude  for  appreciable  operation  of  the  triple  alpha  process. 

In  simpler  terms,  the  expanding  universe  was  not  and  is  not  in  equilibrium,  so  it 
rapidly  cooled  down  and  thinned  out  to  low  density,  making  the  simultaneous 
collision  of  three  helium  nuclei  -  alpha  particles  -  nearly  impossible.  The  dense 
cores  of  giant  stars,  however,  are  in  equilibrium  for  the  long  intervals  of  time 
needed  to  accrue  noticeable  amounts  of  carbon  by  the  triple  alpha  process.  Only 
the  very  light  elements,  such  as  deuterium  and  helium  could  be  produced  by  two- 
body  reactions  during  the  big  bang,  whereas  carbon  had  to  be  synthesized  by  three- 
body  reactions  in  the  interior  of  stars. 

This  completes  our  account  of  the  origin  of  the  elements  inside  stars  and  during 
the  big  bang,  with  one  oversight:  the  under-abundant,  light  elements  with  atomic 
weights  between  hydrogen  and  helium.  These  light  elements  -  boron,  beryllium, 
and  most  of  the  lithium  -  probably  were  produced  by  spallation  reactions  in  which 
energetic  charged  particles,  known  as  cosmic  rays,  strip  off  the  components  of 
heavy  nuclei  to  form  light  nuclei  (Reeves  et  al.  1973;  Reeves  1994).  Wallerstein 
and  Conti  (1969)  reviewed  lithium  and  beryllium  in  stars.  For  references  to 
research  papers  on  the  observations  and  explanations  of  the  abundance  of  light 
elements  see  Lang  (1999). 


10.5.6  The  First  and  Second  Generation  of  Stars 

Because  big-bang  nucleosynthesis  produced  no  elements  heavier  than  helium,  the 
earliest  stars  had  to  be  composed  of  the  lightest  abundant  elements:  hydrogen  and 
helium.  This  first  generation  of  stars  is  known  as  Population  I  stars.  Then,  as  a 
result  of  ongoing  stellar  alchemy,  the  most  massive  first- generation  stars  forged 
heavier  elements  in  their  cores,  scattering  them  into  space  by  winds  or  explosions 
as  they  died.  These  heavy  elements,  known  as  metals  to  astronomers,  then  were 
recycled  and  incorporated  into  second  generation  stars.  Because  some  of  their 
material  came  from  previous  stars,  these  Population  II  stars  are  polluted  somewhat 
by  the  heavier  elements,  the  metals. 

Baade  (1952)  proposed  the  existence  of  two  stellar  Populations,  finding  that 
pulsating  variable  stars  of  the  two  types  have  different  period-luminosity  relations. 
Ivezic  et  al.  (2012)  have  provided  a  recent  review  of  galactic  stellar  Populations. 
The  first  stars  to  be  formed  in  the  universe  have  been  designated  Population  III. 

Observations  of  stellar  spectra  confirm  this  scenario.  Very  old  Population  I 
stars,  which  formed  when  the  universe  was  young,  have  less  than  0.1  %  of  their 
mass  in  elements  heavier  than  hydrogen  or  helium.  We  see  these  survivors  of  the 
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earliest  times  in  the  oldest  globular  star  clusters.  Astronomers  have  not  yet  found 
completely  pure  stars  with  absolutely  no  metals,  but  they  are  confident  that  the 
second-generation  Population  II  stars  contain  a  greater  proportion  of  heavy  ele¬ 
ments,  at  2-3  %  of  their  mass. 

Audouze  and  Tinsley  (1976)  provided  an  early  discussion  of  the  chemical 
evolution  of  galaxies;  Rana  (1991)  subsequently  reviewed  the  chemical  evolution 
of  our  Galaxy;  and  McWilliam  (1997)  reviewed  abundance  ratios  and  galactic 
chemical  evolution. 

Beers  and  Christlieb  (2005)  reviewed  the  discovery  and  analysis  of  very  metal- 
poor  stars  in  the  Galaxy,  whereas  Sandage  (1986)  reviewed  the  population  concept 
and  related  topics  such  as  the  collapse  of  our  Galaxy.  Gratton  et  al.  (2004) 
reviewed  abundance  variations  within  globular  star  clusters;  Wheeler  et  al.  (1989) 
have  reviewed  the  abundance  ratios  as  a  function  of  metallicity. 


10.5.7  Cosmic  Implications  of  the  Origin  of  the  Elements 

During  the  billions  of  years  before  the  Sun  was  bom,  massive  stars  reworked  the 
chemical  elements,  fusing  lighter  elements  into  heavier  ones  within  their  nuclear 
furnaces.  Carbon,  oxygen,  nitrogen,  silicon,  iron,  and  most  of  the  other  heavy 
elements  were  created  this  way.  The  enriched  stellar  material  then  was  cast  out 
into  interstellar  space  by  the  short-lived  massive  stars,  gently  blowing  out  in  their 
stellar  winds  or  explosively  ejected  within  supernova  remnants. 

The  Sun  and  its  retinue  of  planets  condensed  from  this  material  about 
4.6  billion  years  ago.  They  are  composed  partly  of  heavy  elements  that  were 
synthesized  in  stars  that  lived  and  died  before  the  Sun  and  planets  were  born.  The 
Earth  and  everything  on  it  spawned  from  this  recycled  material. 

Moreover,  the  first  stars  could  not  have  had  rocky  planets  like  the  Earth  because 
there  initially  was  nothing  but  hydrogen  and  helium.  The  only  possible  planets 
would  have  been  icy  balls  of  frozen  gas.  Without  carbon,  life  as  we  know  it  could 
not  evolve  on  these  planets. 

Perhaps  the  most  fascinating  aspect  of  stellar  alchemy  is  its  implications  for  life 
on  the  Earth.  Most  of  the  chemical  elements  in  our  bodies,  from  the  calcium  in  our 
teeth  to  the  iron  that  makes  our  blood  red,  were  created  billions  of  years  ago  in  the 
hot  interiors  of  long-vanished  stars.  Therefore,  we  are  all  made  of  “star  stuff”.  If 
the  universe  were  not  very,  very  old,  there  would  not  have  been  enough  time  to 
forge  the  necessary  elements  of  life  in  the  ancient  stars.  The  lightest  element, 
hydrogen,  needed  for  the  water  in  our  bodies,  was  synthesized  when  the  obser¬ 
vable  universe  was  very  young,  in  the  first  instants  of  the  big  bang.  Therefore,  we 
are  the  offspring  of  both  the  star  and  the  big  bang  -  true  children  of  the  cosmos. 


Chapter  11 

The  Material  Between  the  Stars 


11.1  Gaseous  Emission  Nebulae 

The  space  between  the  stars,  which  looks  like  an  empty  black  void  is  filled  with 
cold  atoms  of  hydrogen.  Stars  form  out  of  this  supposed  emptiness  and  eventually 
return  to  it.  We  see  some  of  this  interstellar  material  when  the  brightest  stars 
illuminate  nearby  regions  (Fig.  11.1).  The  energetic  starlight  heats,  ionizes,  and 
lights  up  the  surrounding  material.  Such  regions  are  known  as  emission  nebulae.  In 
the  parlance  of  modern  astronomers,  they  also  are  known  as  H  II  regions ,  pro¬ 
nounced  “H  two  regions,”  because  they  contain  ionized  hydrogen,  where  H 
denotes  hydrogen  and  II  indicates  that  the  atom  is  missing  its  one  electron. 

Bright,  extended  regions  in  the  cosmos  have  been  noticed  ever  since  telescopes 
were  first  trained  on  the  night  sky.  Since  they  looked  like  large,  diffuse  places  with 
a  definite  size,  unlike  the  unresolved  stars,  early  astronomers  called  these  places 
nebulae ,  the  Latin  word  for  “clouds.”  In  the  late  eighteenth,  century,  the  French 
astronomer  Charles  Messier  (1730-1817)  published  a  famous  catalogue  that  listed 
the  brightest  nebulae  as  well  as  star  clusters  (Messier  1781);  they  are  now  des¬ 
ignated  by  M  followed  by  the  number  in  his  list.  Some  of  these  regions  include  the 
emission  nebulae  (Table  11.1). 

In  1864  the  English  astronomer  William  Huggins  (1824-1910)  discovered 
emission  lines  in  the  visible-light  spectra  of  these  objects  (Huggins  1864),  which 
explains  why  the  word  emission  is  part  of  their  name.  Lines  of  ionized  hydrogen, 
ionized  nitrogen,  and  a  then-unidentified  substance  named  nebulium  were  found. 

As  it  turned  out,  an  emission  nebula,  or  H  II  region,  is  nothing  more  than  a  great 
ball  of  ionized  gas  with  an  exceptionally  hot,  massive  star  at  its  center.  The 
ultraviolet  radiation  of  a  luminous  blue  star,  of  spectral  class  05,  ionizes  and  heats 
the  surrounding  gas  to  temperatures  of  about  10,000  K  (Eddington  1926a),  and  the 
hydrogen  emission  lines  are  emitted  when  the  electrons  freed  by  ionization 
recombine  with  the  protons  to  make  hydrogen  atoms  (Zanstra  1927,  1928). 
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Example:  Ionization  wavelength  for  hydrogen  and  oxygen 

The  radiation  photon  energy  hv  =  he /X  required  for  ionization  of  an  atom  to 
a  given  state  of  ionization  is  given  by: 

/!V=y  =  £;,  (11.1) 

or 

.  he  1.9864  x  10-25 

E~ - Et - m’  (1L2) 

where  v  is  the  radiation  frequency,  X  is  the  radiation  wavelength,  the  Planck 
constant  h  =  6.6261  x  10-34  Js,  the  speed  of  light  c  =  2.9979  x  108  ms-1, 
and  Et  is  the  ionization  potential  for  the  stage  of  ionization  in  the  joule  unit  of 
energy.  The  ionization  potential  is  often  given  in  electron  volts  (eV),  where 
1  eV  =  1.6022  x  10  19  J.  The  number  of  electrons  freed  from  the  atom  is 
equal  to  the  ion’s  identifying  Roman  numeral  number  minus  one. 

The  H  II  ion  is  a  hydrogen  atom  missing  its  one  electron  with 
Et  =  13.5  eV  and  O  III  is  an  oxygen  atom  missing  two  electrons  with 
Et  =  35.116  eV.  The  radiation  wavelength  2  at  which  the  photon  energy  is 
just  equal  to  the  ionization  energy  is  given  by  2  =  ( hc)/Ei  = 

r\  ^ 

1.9864  x  10  /Et  =  91.8  nm  and  35.3  nm  for  ionized  hydrogen  and 
doubly  ionized  oxygen,  respectively,  where  1  nm  =  10-9  m.  These  wave¬ 
lengths  lie  in  the  ultraviolet  range,  on  the  short- wavelength  side  of  blue  light, 
or  roughly  from  10  to  400  nm. 


As  shown  by  the  American  astronomer  Ira  S.  Bowen  (1898-1973),  spectral 
lines  of  emission  nebulae  also  arise  from  ionized  atoms  of  oxygen  and  nitrogen 
undergoing  “forbidden”  transitions  in  the  nebulae;  such  transitions  are  improbable 
in  the  laboratory  situation.  In  a  remarkable  piece  of  detective  work,  Bowen  was 
able  to  match  previously  unidentified  nebular  lines  with  [O  II],  [N  II],  [O  III]  and 
[N  III],  where  O  and  N  denote  oxygen  and  nitrogen,  respectively;  II  and  III  denote 
that  the  atom  is  missing  one  or  two  electrons,  respectively;  and  the  square  brackets 
[]  denote  a  forbidden  line  (Bowen  1927,  1928).  Thus,  in  addition  to  hydrogen,  the 
dominant  spectral  lines  of  emission  nebulae  are  those  of  cosmically  abundant 
oxygen  and  nitrogen  (Table  11.2),  as  well  as  sulfur,  S  II. 

As  a  luminous  star’ s  ultraviolet  radiation  moves  out  into  surrounding  space,  it  is 
absorbed  and  consumed  by  interstellar  hydrogen  atoms  and  other  less  abundant 
atoms  (Fig.  11.2).  Eventually,  all  of  the  available  ultraviolet  rays  are  used  up  in 
ionizing  atoms  close  to  the  star.  As  a  result,  the  rays  cannot  travel  farther  than  the 
immediate  vicinity  of  the  star.  This  is  one  of  the  reasons  why  the  night  sky  remains 
black  outside  the  periphery  of  emission  nebulae;  the  other  reasons  are  the  low 
temperature  and  density  of  the  interstellar  material. 
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Fig.  11.1  Rosette  Nebula  This  large,  circular  emission  nebula,  or  H  II  region,  known  as  the 
Rosette  Nebula,  lies  at  a  distance  of  about  1.6  kpc,  or  5,200  light-years  from  the  Earth  and  is 
about  40  pc  or  130  light-years  in  diameter.  Parts  of  this  region  include  nebulae  designated  as 
NGC  2237,  NGC  2238,  and  NGC  2239,  as  well  as  the  open  star  cluster  NGC  2244.  The  mass  of 
the  Rosette  Nebula  is  estimated  to  be  10,000  solar  masses,  or  104  M0.  Hot  O  and  B  stars  in  the 
core  of  the  Rosette  Nebula  exert  pressure  on  the  nearby  interstellar  material,  triggering  star 
formation,  and  heat  the  surrounding  gas  to  a  temperature  of  about  6  million  K,  causing  it  to  emit 
x-rays  observed  from  the  Chandra  x-ray  Observatory.  (Courtesy  of  KPNO/CTIA.) 


Table  11.1  Bright  named  emission  nebulae 

RA  (2000)  December  (2000)  9a  Distanceb(light-years) 

Name  h  m  °  '  'x' 

Lagoon  Nebula,  M  8  18  03.6  -24  23  40  x  90  4,100 

(Also  known  as  NGC  6523,  central  region  Hourglass  Nebula  and  star  cluster  NGC  6530) 
Omega  Nebula,  M  17  18  20.4  -16  11  37  x  47  5,000 

(Also  known  as  NGC  6618,  Horseshoe  Nebula,  and  Swan  Nebula) 

Orion  Nebula,  M  42  05  35.3  -05  24  60  x  65  1,344 

(Also  known  as  NGC  1976,  northwestern  part  is  M  43,  NGC  1982) 

Rosette  Nebula,  NGC  2237  06  33.8  +05  00  78  x  78  5,200 

(Nebulous  region  includes  NGC  2238,  NGC  2239,  NGC  2246  and  cluster  NGC  2244) 

Trihd  Nebula,  M  20  18  02.4  -23  02  27  x  29  7,600 

(Also  known  as  NGC  6514) 

a  Angular  diameter  9  in  minutes  of  arc,  designated  ' 

b  Distance  in  light-years,  where  1  light-year  =  9.46  x  1015  m.  For  comparison,  1 
pc  =  1  parsec  =  3.0857  x  1016  m 
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Table  11.2  Intense  spectral 
lines  of  emission  nebulae 


Element 

Wavelength  (nm) 

[OH] 

372.62 

[0 II] 

372.89 

[O  III] 

436.32 

HjS 

486.1332 

[0  III] 

495.891 

[O  III] 

500.684 

[N  II] 

654.81 

Ha 

656.28 

[N  II] 

658.36 

[S  II] 

671.6440 

[S  II] 

673.0816 

A  central  blue  star  of  spectral  class  05  has  a  main- sequence  lifetime  of  only 
about  3.7  million  years,  and  these  massive,  relatively  young  stars  are  still 
embedded  within  the  material  from  which  they  formed.  Radiation  pressure  of  the 
hot,  luminous  star  will  drive  most  of  the  surrounding  gas  away  in  a  few  million 
years.  So  an  emission  nebula,  or  H  II  region,  is  no  older  than  its  central  star,  which 
is  enveloped  by  dense  material  that  coalesced  to  form  the  central  star. 


Fig.  11.2  Spheres  of  ionization  Ultraviolet  radiation,  denoted  by  UV,  from  a  hot  star  ionizes 
hydrogen  and  other  atoms  in  its  immediate  vicinity,  creating  a  nebulous  region  that  radiates 
emission  lines  and  contains  abundant  ionized  hydrogen,  denoted  H  II.  They  are  known  as 
emission  nebulae  or  H  II  regions.  In  this  figure,  the  size  of  the  atoms,  ions,  electrons,  and  protons 
are  exaggerated  greatly.  The  ionization  by  the  UV  creates  numerous  free  electrons  and  protons 
that  are  not  attached  to  atoms.  They  subsequently  recombine  to  make  atoms  in  a  process  of 
continued  ionization  and  recombination.  The  free  electrons  can  emit  two  types  of  radiation, 
illustrated  in  Fig.  11.3.  At  large  distances  from  the  star,  its  ultraviolet  rays  are  all  absorbed  and 
can  travel  no  farther  into  surrounding  space.  This  limits  the  radius  of  the  emission  nebula,  or  H  II 
region,  to  the  Stromgren  radius  at  about  30  light-years.  (From  “The  Life  and  Death  of  Stars”  by 
Kenneth  R.  Lang,  published  by  Cambridge  University  Press,  2013.  Reprinted  with  permission.) 
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Table  11.3  Physical 
properties  of  emission 
nebulae  (H  II  regions) 


n  iq  _ o 

N  =  number  density  =  Ne  =  electron  density  =10-10  m 
Te  =  electron  temperature  =  10,000  K  =  104  K 
Rs  =  Stromgren  radius  =  0.65-326  light-years  =  0.2-100 
pc  ss  (0.6-300)  X  1016  m 

Teff  =  effective  temperature  of  ionizing  star,  spectral  class  09- 
05  =  (3  -  5)  x  104  K 


Because  hydrogen  is  by  far  the  most  abundant  element  in  interstellar  space,  as  it 
is  in  the  stars,  the  emission  nebulae  contain  large  amounts  of  ionized  hydrogen. 
They  contain  electrons  that  were  freed  from  their  atomic  bonds  during  ionization. 
An  electron  number  density,  Ne,  and  electron  temperature,  Te,  characterize  these 
free  electrons.  A  large,  extended  emission  nebula  might  have  an  electron  density 
of  Ne  =10  m  ,  while  a  compact  one  could  have  Ne  =  10  m  or  more 
(Table  11.3).  These  number  densities  can  be  significantly  greater  than  the  typical 

Z  _ Q 

density  of  interstellar  hydrogen  atoms  at  about  10  m  .  The  electron  temperature 
of  an  emission  nebula  is  Te  =  104  K,  the  temperature  required  to  ionize  its  con¬ 
tents  and  roughly  equal  to  the  disk  temperature  of  its  central  star. 

We  can  infer  the  density  of  an  emission  nebula,  or  H  II  region,  once  we 
understand  how  it  shines.  When  the  ultraviolet  radiation  of  a  hot  star  ionizes  a 
nearby  hydrogen  atom,  an  electron  breaks  free  of  its  former  atomic  bond  with  a 
proton.  The  electron  moves  away  at  high  speed  due  to  its  relatively  low  mass 
compared  to  that  of  the  proton,  but  when  the  free  electron  encounters  another 
proton,  the  two  particles  attract  one  another  by  their  opposite  electrical  charge,  and 
one  of  two  things  can  happen:  The  electron  can  continue  moving  into  surrounding 
space,  deflected  from  a  straight  path;  or  the  electron  can  be  captured,  joining 
together  with  the  proton  to  again  form  a  hydrogen  atom.  The  electron  emits 
radiation  in  either  situation  (Fig.  11.3),  and  the  intensity  of  the  radiation  increases 
with  the  number  of  free  electrons  participating  in  the  process.  When  a  free  electron 
is  captured  by  a  proton  to  make  a  hydrogen  atom,  the  electron  cascades  down 
through  the  ladder  of  possible  orbital  energies,  emitting  radiation  in  the  process. 
This  radiation  is  known  as  recombination  radiation ,  because  the  two  recombine  to 
make  an  atom;  it  also  is  called  free-bound  radiation  because  the  former  free 
electron  is  bound  once  again. 

We  have  encountered  the  allowed  electron  orbits  when  discussing  the  Bohr 
atom  and  the  Balmer  series  of  hydrogen  lines  (Sect.  6.2),  and  the  recombination 
transitions  are  between  those  electron  orbits,  designated  by  integer  n  =  1,  2,  3... 
and  m  =  1,  2,  3,....  The  alpha  transition,  designated  a,  is  for  m  —  n  =  1.  At 
visible  wavelengths,  the  most  intense  hydrogen  emission  line  is  the  Balmer  a 
transition  at  a  red  wavelength  of  656.28  nm  while  the  ultraviolet  a  transition  is 
known  as  Lyman  a  at  121.567  nm. 

Recombination  lines  are  also  detected  from  emission  nebulae,  or  H  II  regions, 
at  radio  frequencies,  for  the  cosmically  abundant  elements  of  hydrogen,  helium 
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Proton 


Fig.  11.3  Radiative  interactions  between  electrons  and  protons  When  an  electron  moves 
rapidly  and  freely  outside  an  atom,  it  inevitably  passes  near  a  proton  in  the  ambient  gas.  There  is 
an  electrical  attraction  between  the  electron  and  proton  because  they  have  equal  and  opposite 
charge  and  this  pulls  the  electron  toward  the  proton.  If  the  interaction  is  distant,  it  bends  the 
electron’s  trajectory  and  alters  its  speed.  The  electron  then  emits  electromagnetic  radiation  known 
as  bremsstrahlung  from  the  German  for  “braking  radiation”;  this  also  is  called  free-free 
radiation  because  the  electron  remains  free  and  unattached  to  the  proton.  In  a  close  encounter,  the 
electron  goes  into  orbit  around  the  proton,  forming  a  hydrogen  atom  and  cascading  down  through 
allowed  orbital  energies.  In  this  case,  the  electron  emits  recombination  radiation,  also  known  as 
free-bound  radiation.  (From  “The  Life  and  Death  of  Stars”  by  Kenneth  R.  Lang,  published  by 
Cambridge  University  Press,  2013.  Reprinted  with  permission.) 


and  carbon  (Fig.  11.4).  Initial  suggestions  and  pioneering  observations  of  radio 
recombination  lines  are  given  in  Lang  (1999). 

The  frequency  vmn  of  the  m  —  n  transition  when  recombining  to  an  ion  is  given  by: 


1 

Vmn  —  T 

h 


Em—E„  =  cR 


jn 


1  1 


YlA 


nr 


(11.3) 


where  denotes  the  absolute  value,  Em  designates  the  energy  of  the  mth  electron 
orbit,  and  the  wavelength  of  the  m  —  n  transition  is  given  by  fmn  =  c/vmn.  Here 

o  _ 1 

the  speed  of  light  c  =  2.9979  x  10  m  s  ,  the  Planck  constant  h  =  6.6261  x 

_ rlA 

10  J  s,  and  the  Rydberg  constant  R,  for  an  ion  is: 


Ri  =  Roo 


(11.4) 


where  Mt  is  the  mass  of  the  ion  and  the  Rydberg  constant,  is: 
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Fig.  11.4  Recombination  lines  Radio-frequency  spectrum  of  the  Orion  Nebula,  also  designated 
as  M  42,  showing  the  hydrogen,  H,  and  helium,  He,  recombination  lines  from  the  a  (m  —  n  =  1) 
and  p  (m  —  n  =  2)  transitions  at  high  quantum  numbers  n  =  109  and  n  =  137.  The  hydrogen 
lines  are  more  intense  than  the  helium  or  carbon  lines  because  of  the  greater  abundance  of 
hydrogen.  (Adapted  from  Lang  1979.) 


2n2mee4 

cs^h? 


1.09737316  x  107m_1, 


(11.5) 


_ Q  1 

for  an  electron  of  mass  me  =  9.1094  x  10  kg  and  elementary  charge 
e  =  1.6022  x  10“ 19  C,  where  £0  =  8.8542  x  10“ 12  F  m  \  the  electric  constant. 

For  hydrogen,  the  ion  is  a  proton  with  a  mass  Mt  =  mp  =  1,836.153  me,  and  the 
frequency  of  the  m  -  n  transition  is  given  by: 


vmn  «  3.28805  x  1015 


(11.6) 


Example:  What  is  the  recombination  line  frequency  of  the  H  109a 
transition? 

For  the  quantum  number  n  =  109  and  the  a  transition  with  m  —  n  =  1,  the 
frequency,  v,  of  the  hydrogen  recombination  line,  denoted  H  109a,  is: 


V  =  3.28805  x  1015 


«  5.0089  x  109  Hz  ss  5008.9MHz. 


(11.7) 


This  line  has  been  detected  from  many  emission  nebulae,  or  H  II  regions, 
such  as  the  Orion  Nebula  (Fig.  11.4). 


When  the  electron  continues  on  its  way,  it  has  changed  speed  during  the 
encounter  with  the  proton,  and  emitted  electromagnetic  radiation  in  the  process. 
This  radiation  is  known  as  bremsstrahlung ,  from  the  German  word  for  “braking 
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radiation”;  it  also  is  called  free-free  radiation  because  the  electron  begins  and 
remains  free.  It  is  emitted  at  all  wavelengths,  but  at  different  intensities,  and  is 
therefore  known  as  continuum  radiation. 

The  free-free  radiation,  or  thermal  bremsstrahlung,  from  emission  nebulae,  or 
H  II  regions,  is  also  detected  at  radio  frequencies.  The  ionized  gas  within  the  H  II 
region  is  in  thermal  equilibrium,  characterized  by  a  single  temperature  and  a 
blackbody  brightness  distribution  given  by  the  Rayleigh-Jeans  law  at  radio  fre¬ 
quencies  (Sect.  2.4).  The  ratio  of  the  emission  and  absorption  coefficients  is  equal 
to  this  brightness  (Sect.  2.7).  When  the  absorption  coefficient  is  integrated  over  the 
line  of  sight  through  the  H  II  region  we  obtain  an  optical  depth,  tv,  given  by: 

tv  «  3.28  x  10 -7t;135v~2AEM  (11.8) 

where  the  electron  temperature,  Te,  is  in  units  of  104  K,  the  frequency,  v,  is  in  units 
of  GHz  or  10  Hz,  and  EM  denotes  the  emission  measure  in  parsecs  cm  .  It  is  the 
integral  of  the  square  of  the  electron  density,  Ne ,  over  the  line  of  sight,  with 
EM  &  2 Ne  Rs  for  a  region  of  radius  Rs.  The  optical  depth  becomes  unity  at  a 
frequency  v  of  about  109  Hz.  For  lower  frequencies  the  optical  depth  is  large,  and 
the  brightness  depends  only  on  the  electron  temperature.  At  higher  frequencies,  the 
brightness  also  depends  on  the  emission  measure,  or  on  the  square  of  the  electron 
density  and  the  size  of  the  H  II  region. 

The  detailed  calculations  are  complex  and  have  a  long  history.  The  relevant 
formulae  and  pioneering  references  for  the  emission  coefficients  and  absorption 
coefficients  for  recombination  radiation  and  thermal  bremsstrahlung  are  given  in 
Lang  (1999).  In  our  introductory  treatment,  we  only  need  to  realize  that  the  total 
emission  depends  on  the  square  of  the  electron  density  and  the  size  of  the  H  II 
region,  and  this  brings  us  to  estimates  for  their  size. 

As  shown  by  the  Danish  astronomer  Bengt  Stromgren  (1908-1987),  the  central 
star  creates  a  sphere  of  ionized  gas  that  envelops  it  (Stromgren  1939).  He  found 
that  although  the  interstellar  hydrogen  usually  is  electrically  neutral,  or  un-ionized, 
the  very  hot  O  and  B  stars  could  generate  enormous  but  sharply  bounded  spheres 
of  ionization  (Fig.  11.2).  For  some  years,  these  regions  were  referred  to  as 
“Stromgren  spheres,”  and  their  size  still  is  designated  as  the  Stromgren  radius, 
which  depends  on  the  temperature  of  the  star  and  the  density  of  the  surrounding 
material.  The  hotter  and  more  luminous  the  exciting  star,  the  larger  the  Stromgren 
radius  becomes,  and  the  denser  the  surrounding  interstellar  hydrogen  gas,  the 
smaller  the  Stromgren  radius. 

For  typical  interstellar  hydrogen  densities  of  10  million-20  billion  atoms  per 
cubic  meter,  the  Stromgren  radius  ranges  from  30  to  300  light-years,  or  roughly 
1-100  parsecs,  for  central  stars  with  disk  temperatures  between  26,000  and 
48,000  K. 

The  Stromgren  radius,  Rs,  depends  on  the  rate,  S ,  of  emission  of  the  ionizing 
radiation  from  the  exciting  star  and  the  atomic  number  density,  A,  of  the  gas 
surrounding  it.  Here  S  is  the  total  number  of  ionizing  photons  that  leave  the  star. 
An  ionizing  photon  is  one  with  a  wavelength  short  enough  to  ionize  hydrogen,  or 
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at  a  wavelength  less  than  91.8  nm.  Under  steady  state  conditions,  the  total  number 
of  hydrogen  recombinations  per  unit  time  inside  the  Stromgren  sphere  must  equal 
the  total  number  of  photoionizations  per  unit  time,  which  means  that 

S  =  ^nR3soc(T)N2 ,  (11.9) 

or: 

/  3  V  X1/3 

k-=(^w)  ■ 

where  we  assume  the  surrounding  hydrogen  is  fully  ionized  within  the  Stromgren 
sphere,  where  N  =  Ne,  the  electron  density,  for  a  fully  ionized  gas,  and  the  tem¬ 
perature-dependent  recombination  coefficient,  a(7),  of  hydrogen  to  all  its  excited 
states  except  the  ground  state  can  be  calculated  from  quantum  mechanics  and  has 
the  value  of: 


a(r)  «  2.6  x  lo- 16re“a75m3  s— i,  (li.ii) 

for  T  =  Te  the  electron  temperature.  For  a  typical  equilibrium  temperature  of 
Te  =  104  K  of  an  emission  nebula  (Osterbrock  1965),  we  have 
a(T)  =  2.6  x  10  19  m3  s"1,  and  S  =  3  x  1049  s-1  for  an  05  V  star  with  an 
effective  temperature  of  48,000  K. 

The  number  density  of  the  fully  ionized  gas  surrounding  the  emission  nebula  is 

7  _ Q  i  m  _ Q 

N  =  Ne  =  Np  =10  m  -10  m  ,  with  a  respective  Stromgren  radius 
Rs  =  21  —  0.2  pc,  where  1  pc  =  1  parsec  =  3.085678  x  1016  m.  A  lower  den¬ 
sity  corresponds  to  a  larger  Stromgren  radius,  since  the  ionizing  radiation  can 
travel  farther  from  the  central  star  before  becoming  fully  absorbed.  The  mass,  M, 
of  the  emission  nebula,  or  H  II  region,  is  given  by: 

A-tc 

M  =  —mPNeR3s,  (11.12) 

_ O  -7 

where  the  proton  mass  mP  =  1.6726  x  10  kg  and  the  proton  density  is  equal  to 
the  electron  density.  For  a  Stromgren  radius  of  Rs  =  1  pc  and  an  electron  density 
of  Ne  =  109  m-3,  the  mass  is  M  «  2  x  1032  kg  «  100  M0,  where  the  Sun’s 
mass  Mq  =  1.989  x  103°  kg. 

Example:  Temperature,  extent,  and  mass  of  an  emission  nebula  or  H  II 
region 

A  star  of  spectral  class  05  V  has  an  effective  Teff  ^  0.5  x  105  K  for  its 

visible  disk,  and  it  can  ionize  the  surrounding  hydrogen  to  a  temperature  of 

T  =  Te  ~  104  K,  where  Te  denotes  the  electron  temperature,  emitting 

S  =  3  x  1049  ionizing  photons  every  second.  If  the  number  density  of  the 

surrounding  gas  N  =  Ne  =  10  m  ,  where  Ne  denotes  the  electron  density, 

1  8 

the  05  V  star  would  have  the  Stromgren  radius  of  Rs  =  0.6  x  10iO  m 
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=  21  pc  =  68.7  light  years,  where  1  pc  =  3.085  x  1016  m  =  3.26  light- 
years  is  the  typical  spacing  between  adjacent  stars.  For  an  electron  density  of 

10  Q 

Ne  =  10  u  m  ,  we  obtain  a  Stromgren  radius  of  Rs  ~  0.617  x 
1016  m  «  0.2  pc. 

The  mass,  M,  of  the  H  II  region  is  given  by  M  =  AnmPNeRs /3,  where  the 
proton  mass  mp  =  1.6726  x  10  “  kg,  the  proton  density  is  equal  to  the 
electron  density  Ne ,  and  the  protons  dominate  the  mass  rather  than  lighter 
electrons.  For  Ne  =  107  m-3  we  obtain  M  «  1.92  x  1034  kg  «  104  M0, 
where  the  Sun’s  mass  M0  =  1.989  x  1030  kg,  and  for  Ne  =  1010  m-3  the 
mass  of  the  H  II  regions  is  M  «  1.6  x  10  kg  ^  8  M0.  The  higher  the 
electron  density,  the  lower  the  total  mass,  which  is  in  inverse  proportion  to  that 
density. 


11.2  Solid  Dust  Particles  in  Interstellar  Space 

Vast,  dark  regions  provide  contrast  to  the  bright  light  of  luminous  stars  (Fig.  11.5). 
The  blackness  initially  suggested  emptiness  -  literally,  holes  in  the  sky.  The  dark 
regions  are  instead  filled  with  interstellar  dust  particles  that  absorb  and  scatter  the 
light  from  stars  that  lie  behind  them. 

Salpeter  (1977)  has  reviewed  the  formation  and  destruction  of  dust  grains.  Savage 
and  Mathis  (1979)  provided  an  early  review  of  the  properties  of  interstellar  dust,  and 
Mathis  (1990)  has  discussed  interstellar  dust  and  extinction.  Draine  (2003)  provided 
a  review  of  interstellar  dust  grains;  Mann  (2010)  reviewed  interstellar  dust  in  the 
solar  system;  and  Stein  and  Soifer  (1983)  described  dust  in  galaxies. 

It  was  the  American  astronomer  Edward  E.  Barnard  (1857-1923)  who  con¬ 
tributed  most  to  our  early  awareness  of  these  dark  and  bright  regions  of  the  Milky 
Way.  His  systematic  photographic  survey  spanned  30  years,  first  at  the  Lick 
Observatory  on  remote  Mount  Hamilton  in  California  and  then  at  the  Yerkes 
Observatory  near  Chicago,  culminating  in  two  stunning  catalogues  of  the  regions, 
which  he  noncommittally  called  dark  markings  (Barnard  1919,  1927).  Their 
nebulous  form  and  shape  suggested  to  Barnard  that  the  dark  places  were  not 
empty;  in  some  cases,  they  even  seemed  to  interact  with  the  bright  regions  that 
enfolded  them. 

Whereas  Barnard  was  interested  chiefly  in  the  peculiar  shapes  of  the  dark 
regions,  the  German  astronomer  Maximillian  Wolf  (1863-1932)  was  concerned 
with  measuring  their  distances  and  absorbing  powers.  By  counting  stars  in  an 
obscured  and  an  adjacent  unobscured  region,  Wolf  was  able  to  demonstrate  that 
the  dark  areas  absorb  the  light  of  distant  stars  (Wolf  1923). 

Because  Wolf  could  not  detect  any  substantial  difference  in  the  colors  of  stars 
that  lie  outside  of  and  behind  the  dark  nebulae,  he  concluded  that  the  dark  regions 
must  be  composed  of  solid  dust  particles;  their  scattering  properties  depend  weakly 
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Fig.  11.5  Dark  dust  and 
bright  gas  Interstellar  dust 
blocks  the  light  of  distant 
stars,  while  bright  young  stars 
illuminate  nearby  gas  in  this 
part  of  the  North  America 
Nebula,  designated  NGC 
7,000.  The  infrared  detectors 
aboard  the  Spitzer  Space 
Telescope  have  penetrated  the 
dark  clouds  pictured  here  in 
optically  visible  light, 
viewing  young  stars  in  many 
stages  of  formation,  including 
gas  and  dust  cocoons,  disks, 
and  jets.  The  North  America 
Nebula  is  about  1,500  light- 
years  from  Earth  and  spans 
about  50  light-years. 
(Courtesy  of  Karl- 
Schwarzschild 

Observatorium,  Tautenburg.) 


on  wavelength,  unlike  gas  atoms  that  scatter  light  much  more  effectively  at  shorter 
wavelengths  (Sect.  2.7). 

The  interstellar  absorption  of  starlight  was  demonstrated  by  Robert  J.  Trumpler’ s 
(1886-1956)  ten-year  study  of  open  star  clusters  at  the  Lick  Observatory  (Trumpler 
1930).  The  Swiss-born  astronomer  used  the  colors  of  the  brightest  stars  in  each  open 
cluster  to  infer  their  luminosity,  or  absolute  magnitude,  which  he  combined  with  their 
observed  brightness,  or  apparent  magnitude,  to  determine  their  distances.  However, 
when  these  distance  estimates  were  combined  with  the  measured  angular  diameters 
of  the  open  clusters  to  infer  their  size,  he  found  that  the  linear  diameters  increase  with 
distance  -  and  the  difference  was  not  trivial.  In  whatever  direction  he  looked,  more 
remote  clusters  seemed  to  be  about  twice  as  large  as  the  closer  ones.  Moreover,  the 
effect  was  systematic.  The  farther  away  a  cluster  was,  the  larger  it  appeared  to  be. 

Concluding  that  this  pervasive,  systematic  change  in  physical  size  was 
impossible,  Trumpler  instead  assumed  that  all  open  clusters  actually  have  the  same 
linear  diameters  or  physical  extent.  This  meant  that  the  initial  distance  estimates 
were  overestimated  due  to  the  absorption  of  starlight  by  an  amount  that  increases 
with  distance.  The  greater  the  distance,  the  more  the  absorption,  making  the 
remoter  clusters  look  systematically  fainter  and  even  farther  away  than  their  actual 
distance. 
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The  removal  of  short-wavelength,  blue-colored  light  that  occurs  when  starlight 
passes  through  dust  is  known  as  reddening ,  because  it  makes  the  observed  star 
redder  than  it  would  be  without  the  intervening  dust  (Schild  1977).  When  radiation 
passes  through  a  longer  path  containing  a  greater  density  of  interstellar  dust,  the 
amount  of  reddening  increases,  making  more  distant  stars  appear  redder. 

The  unfolding  story  of  interstellar  dust  took  an  unexpected  turn  when  two 
American  astronomers,  William  A.  Hiltner  (1914-1991)  and  John  Scoville  Hall 
(1908-1991),  independently  observed  highly  polarized  light  from  reddened  stars 
(Hiltner  1949;  Hall  1949).  The  polarization  was  oriented  along  a  common  plane 
due  to  an  alignment  of  the  elongated  dust  particles  by  the  interstellar  magnetic 
field  (Davis  and  Greenstein  1951)  -  (Fig.  11.6).  A  magnetic  field  strength  of 
roughly  10“ 10  tesla  is  required  to  align  the  interstellar  dust,  and  the  Zeeman  effect 
from  such  an  interstellar  magnetic  field  was  eventually  observed. 

What  is  this  interstellar  dust?  The  particles  must  be  smaller  than  1  /  10,000th  of 
a  meter,  or  10-4  m,  across  or  they  would  completely  block  starlight  and  not  scatter 
it;  also  they  must  be  larger  than  gas  molecules,  the  scattering  of  which  depends 
strongly  on  wavelength  (Oort  and  Van  de  Hulst  1946;  Van  de  Hulst  1949).  To 
weakly  absorb  and  scatter  starlight,  interstellar  dust  particles  must  be  comparable 
in  size  to  the  wavelength  of  visible  light,  or  roughly  10  to  10  m  across,  which 
is  about  the  same  size  as  the  particles  of  cigarette  smoke  or  the  dust  motes  seen 
when  we  look  back  toward  a  movie  projector. 

Cosmic  dust  absorbs  and  scatters  starlight,  which  means  that  interstellar  space 
is  not  transparent,  and  the  combined  effects  of  scattering  and  absorption  of  star¬ 
light  by  interstellar  dust  is  called  extinction.  The  observed  intensity  of  radiation,  A, 
at  wavelength,  2,  is  given  by: 


h  =  ho  exp(-TA)  (11.13) 

where  LAo  is  the  intensity  that  would  be  received  at  the  Earth  in  the  absence  of 
interstellar  extinction  and  ta  is  the  optical  depth  at  the  observed  wavelength.  The 
approximate  interstellar  extinction  in  the  optical,  or  visible  wavelength,  region 

varies  as  2_1  with  an  optical  depth: 

T  X=J,  (11.14) 

where  the  constant  C  depends  on  the  star  (Whitford  1958;  Savage  and  Mathis  1979). 

The  shape  of  the  curve  of  starlight  extinction  as  a  function  of  wavelength 
provides  information  on  the  size  and  composition  of  the  interstellar  dust  grains; 
when  observations  were  extended  to  ultraviolet  and  infrared  wavelengths,  more 
mysteries  arose.  The  amount  of  extinction  increases  in  a  “bump”  centered  at  an 
ultraviolet  wavelength  of  about  217.5  nm;  it  may  be  caused  by  graphite. 

Ice,  carbon,  and  silicates,  or  some  combination  of  these,  are  most  likely  the 
principal  ingredients  of  interstellar  dust.  Most  of  the  solid  silicate  dust  found  in 
interstellar  space  probably  came  from  the  cool  expanding  outer  envelopes  of 
evolved  giant  stars  that  are  rich  in  oxygen  or  carbon;  the  outer  atmospheres  of 
these  stars  exhibit  the  absorption  bands  of  silicates. 


11.2  Solid  Dust  Particles  in  Interstellar  Space 
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Fig.  11.6  Polarized  starlight  The  light  from  nearly  7,000  stars  is  polarized,  with  the  strength 
and  direction  of  polarization  designated  by  the  short  lines.  The  observations  are  plotted  in  galactic 
coordinates  where  the  galactic  plane,  or  the  Milky  Way,  runs  horizontally  across  the  middle  of 
the  figure.  The  starlight  polarization  has  been  attributed  to  dust  grains  elongated  along  the 
interstellar  magnetic  field.  (Courtesy  of  D.  S.  Mathewson.) 


Dust  produces  strong  infrared  emission  that  is  studied  using  instruments  aboard 
the  Spitzer  Space  Telescope.  Interstellar  dust  is  measured  from  the  emission  at 
long  infrared  wavelengths,  while  dust  that  is  being  formed  in  old,  evolved  stars  is 
observed  using  medium-infrared  wavelengths. 


11.3  Radio  Emission  from  the  Milky  Way 

The  American  radio  engineer  Karl  Jansky  (1905-1950)  inadvertently  discovered 
radio  noise  of  cosmic  origin  in  the  early  1930s,  when  radio  waves  were  being  used 
extensively  for  global  communications.  At  that  time,  the  Bell  Telephone  Labo¬ 
ratories  assigned  Jansky  the  task  of  tracking  down  and  identifying  natural  sources 
of  radio  noise  that  were  interfering  with  ship-to-shore  radio  communications  at  a 
wavelength  of  14.6  m.  He  constructed  a  rotating  antenna  that  pointed  sideways  at 
the  horizon  and  enabled  identification  of  the  interference,  including  the  radio  static 
produced  by  lightning  discharges  from  distant  thunderstorms. 

Fortunately,  the  antenna’ s  wide  field  of  view,  of  25°,  also  pointed  part  way  up  into 
the  sky;  therefore  it  could  detect  a  persistent  extraterrestrial  hiss  of  unknown  origin 
that  was  comparable  in  intensity  to  terrestrial  lightning.  By  observing  the  variation  of 
its  intensity  as  a  function  of  direction  and  time  of  arrival,  Jansky  established  that  the 
radio  source  must  lie  outside  the  solar  system  (Jansky  1933a,  b,  1935). 

The  astronomical  community  almost  completely  ignored  Jansky’s  results,  most 
likely  because  he  did  not  publish  them  in  an  astronomical  journal  and  his  radio 
techniques  were  outside  the  conventional  methods  of  traditional  astronomy.  It  was 
not  until  a  decade  later  that  amateur  astronomer  and  radio  engineer  Grote  Reber 
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Fig.  11.7  Spectrum  of  thermal  and  nonthermal  radiation  When  the  intensity  of  thermal 
radiation  is  spread  out  as  a  function  of  wavelength,  the  resultant  spectrum  is  most  intense  in  a 
band  of  wavelengths  that  depends  on  the  temperature.  This  peak  occurs  at  visible  wavelengths 
when  the  temperature  is  about  6,000  K.  Such  a  hot  gas  emits  radiation  at  longer  wavelengths  but 
at  a  lower  intensity.  In  contrast,  nonthermal  radiation  is  more  intense  at  longer  radio  wavelengths. 
High-speed  electrons  emit  nonthermal  synchrotron  radiation  in  the  presence  of  a  magnetic  held 
(see  Fig.  11.8) 


(1911-2002)  confirmed  and  extended  Jansky’s  investigations;  he  published  these 
new  results  on  the  “cosmic  static”  in  the  Astrophysical  Journal  (Reber  1944). 

Because  the  radio  emission  of  even  the  nearby  Sun  could  not  be  detected  with 
Jansky’s  antenna,  normal  stars  were  ruled  out  as  a  possible  source  of  the  cosmic 
radio  emission.  Their  thermal  radio  emission  would  be  exceptionally  faint,  when 
compared  to  that  at  visible  wavelengths,  and  further  diluted  by  the  greater  dis¬ 
tance.  Some  nonthermal  source  of  radiation  was  required  that  emitted  most 
intensely  at  the  longer  radio  wavelengths  (Fig.  11.7). 

As  it  turned  out,  the  extraterrestrial  radio  signals  were  not  coming  from  stars  but 
instead  from  interstellar  space.  The  cosmic  emission  is  attributed  to  energetic 
electrons  traveling  at  a  speed  close  to  that  of  light,  and  spiraling  about  the  inter¬ 
stellar  magnetic  field.  The  energy  of  the  high-speed  electrons  is  comparable  to  that 
of  the  cosmic-ray  electrons  entering  the  Earth’s  atmosphere  (Sect.  7.5),  with  an 
energy  E  109  eV  with  1  eV  =  1.6022  x  10  19  J.  The  electrons  are  known  as 
relativistic  electrons  because  the  equations  of  relativity  apply  at  such  high  speeds. 
The  speed  of  the  electrons  is  so  great  that  their  kinetic  energy  of  motion  cannot  be 
attributed  to  the  thermal  energy  of  a  hot  gas  at  any  plausible  temperature. 

Example:  Nonthermal  velocities 

The  thermal  velocity,  Vthermab  of  an  electron  of  mass,  me ,  at  temperature,  Te,  is 

obtained  by  equating  its  kinetic  energy  meV  / 2  to  the  thermal  energy  3kTe/2, 

_ Q  1 

where  the  electron  mass  me  =  9.1094  x  10  kg  and  the  Boltzmann  constant 
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OQ  1 

k  =  1.38065  x  10  ^  J  K  .If  the  electron  is  moving  at  the  speed  of  light, 

o  _ 1 

with  Vthermal  =  V  =  c  =  2.9979  x  10  m  s  ,  then  the  suggested  electron 

o  o 

temperature  would  be  Te  =  mec  / ( 3k )  ^2  x  10  K.  And  a  proton  moving  at 
that  speed  would  imply  a  proton  temperature  Tp  =  mpTe/me  «  3.6  x  10  K, 
where  mp/me  =  1836  is  the  ratio  of  the  proton  mass  to  the  electron  mass. 
Cosmic  objects  in  thermal  equilibrium,  even  at  the  center  of  the  Sun  where  the 

^ 7 

temperature  is  1.5  x  10  K,  cannot  plausibly  obtain  such  high  temperatures. 
The  radiation  emitted  by  electrons  moving  at  nearly  the  speed  of  light  is 
therefore  known  as  nonthermal  radiation. 


A  moving  charged  particle  cannot  move  straight  across  a  magnetic  field,  but 
instead  gyrates  around  it  (Focus  11.1),  so  the  high-speed  electrons  move  away 
from,  or  around,  the  magnetism. 


Focus  11.1  Charged  particles  gyrate  around  magnetic  fields 

If  a  charged  particle  approaches  a  magnetic  field  in  the  perpendicular 
direction,  a  magnetic  force  pulls  it  into  a  circular  motion  about  the  magnetic 
field  line.  Since  the  particle  can  move  freely  in  the  direction  of  the  magnetic 
field,  it  spirals  around  it  with  a  helical  trajectory.  The  English  scientist 
Oliver  Heaviside  (1850-1925)  first  derived  the  radius,  Rg ,  of  this  circular 
gyration  (Heaviside  1904).  It  is  given  by: 


~m~ 

v± 

IZel 

B 

(11.15) 


where  V±,  is  the  velocity  of  the  particle  in  the  perpendicular  direction,  B  is 
the  magnetic  field  strength,  and  m  and  Ze  respectively  denote  the  mass  and 
charge  of  the  particle.  Thus,  a  stronger  magnetic  field  tightens  the  gyration 
into  smaller  coils,  and  faster  particles  will  gyrate  in  larger  circles. 

_ Q  1 

For  an  electron  with  mass  me  =  9.1094  x  10  kg  and  charge 
e  =  1.602  x  10  19  C,  with  Z  =  1.0,  the  corresponding  gyration  radius  is: 


Rg (electron)  «  5.7  x  10  n[V±/B]  m, 


(11.16) 


where  is  in  m  s  1  and  B  is  in  tesla,  and  for  a  proton  it  is: 


Rg  (proton)  «  1.05  x  10  8[V±/B 


m. 


(11.17) 


where  the  mass  of  the  protons  is  mp  =  1,836  me,  and  the  charge  of  the  proton 
is  the  same  as  that  of  the  electron,  just  opposite  in  sign. 

The  angular  gyrofrequency,  denoted  oog  and  also  called  the  angular 
cyclotron  frequency,  is  given  by: 


372 


1 1  The  Material  Between  the  Stars 


V±  _  ZeB 
8  2nRg  2nm  ’ 


(11.18) 


and  the  gyrofrequency ,  vg,  also  called  the  cyclotron  frequency,  is  given  by 
vg  =  C0g/(2n).  For  an  electron  the  gyrofrequency  is: 


Vo  =  — —  = - «  2.8  x  1010S  Hz,  (11.19) 

g  2  %  2  mne  v  ’ 

with  a  magnetic  field  strength  B  in  tesla. 

These  expressions  only  apply  if  the  velocity  is  not  close  to  the  velocity  of 
light,  c.  At  high  particle  velocities  approaching  that  of  light,  the  radius 
equation  is  multiplied  by  a  Lorentz  factor  y  =  [1  —  (V/c)2]-1^2  and  the 
gyrofrequency  expression  is  divided  by  y,  owing  to  the  increase  in  mass  from 
a  rest  mass  m  to  ym ,  which  becomes  unimportant  at  low  velocities  when 

7=1- 

The  frequency,  vs,  of  synchrotron  radiation  from  high-speed  electrons  is 
amplified  by  an  additional  factor  of  y3  so  vs  =  y1  vg  (see  subsequent  text). 


High-speed  electrons  spiral  about  the  interstellar  magnetic  field  and  emit 
nonthermal  synchrotron  radiation  at  radio  wavelengths  (Fig.  11.8).  The  name  is 
derived  from  General  Electric  Company’s  synchrotron  particle  accelerator  where 
the  linearly  polarized  light  was  first  seen  (Elder  et  al.  1947).  Theoretical  studies  by 
Schott  (1912)  indicated  that  a  charged  particle  moving  in  the  presence  of  a 
magnetic  field  would  emit  radiation,  and  the  discovery  of  synchrotron  radiation  in 
a  terrestrial  particle  accelerator  stimulated  further  theoretical  treatments  of  this 
type  of  non- thermal  emission  (Schwinger  1949). 

Ginzburg  (1956)  and  Ginzburg  and  Syrovatskii  (1965)  have  demonstrated  how 
synchrotron  radiation  can  explain  the  radio  emission  of  supernova  remnants  and 
galaxies.  Nonthermal  synchrotron  radiation  is  more  intense  at  longer  wavelengths. 
More  energetic  electrons  have  a  shorter  lifetime,  expending  their  energy  by  syn¬ 
chrotron  radiation  at  a  greater  rate  than  less  energetic  electrons.  High-energy 
electrons  also  emit  synchrotron  radiation  at  a  higher  frequency  or  shorter  wave¬ 
length  than  low-energy  electrons.  This  accounts  for  the  nonthermal  radiation 
spectrum  in  which  the  source  of  synchrotron  radiation  is  most  intense  at  longer 
rather  than  the  shorter  wavelengths,  since  the  high-energy  electrons,  that  emit 
short- wavelength  radiation,  will  be  the  first  to  radiate  their  energy  away. 

Synchrotron  radiation  is  the  non-thermal  emission  of  a  high-speed,  relativistic 
electron,  of  energy,  E ,  spiraling  about  a  magnetic  field  of  strength,  B.  By  rela- 

o  _ 1 

tivistic,  we  mean  moving  near  the  speed  of  light,  c  =  2.9979  x  10  m  s  .  Such 
nonthermal  radiation  is  more  intense  at  longer  wavelengths,  2.  When  the  electron 
moves  at  high  velocities  near  the  speed  of  light,  the  radiation  of  a  slow-moving, 
non-relativistic  electron  is  modified  by  the  Lorentz  factor,  y,  given  by: 
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Fig.  11.8  Synchrotron  radiation  Electrons  moving  at  velocities  near  that  of  light  emit  a  narrow 
beam  of  synchrotron  radiation  as  they  spiral  around  a  magnetic  held.  This  emission  sometimes  is 
called  nonthermal  radiation  because  the  electron  speeds  are  much  greater  than  those  of  thermal 
motion  at  any  plausible  temperature.  The  name  synchrotron  refers  to  the  manmade  ring-shaped 
synchrotron  particle  accelerator  where  this  type  of  radiation  was  first  observed;  a  synchronous 
mechanism  keeps  the  particles  in  step  with  the  acceleration  as  they  circulate  in  the  ring 


(11.20) 


for  an  electron  moving  at  a  velocity,  V,  with  energy,  E,  and  an  electron  mass  m  =  yme 

_ Q  1  O 

with  a  rest-mass  me  =  9.1094  x  10  kg,  or  non-moving,  energy  mec  given  by 

mec 2  =  8.187  x  10~ 14  J  =  5.11  x  10"4  GeV  =  0.51 1  MeV,  (11.21) 

where  1  GeV  =  1.6022  x  10— 10  J  =  103  MeV.  High-speed  electrons  produce 
synchrotron  radiation  at  a  frequency  given  by: 


V±y2  eBy 2 

2nRg  2nme 


2.8  x  1010  Z?  y2  Hz, 


(11.22) 


where  vg  is  the  gyrofrequency,  V±  is  the  velocity  of  the  electron  in  the  perpen¬ 
dicular  to  the  magnetic  field  direction,  Rg  is  the  gyroradius,  the  magnetic  field 
strength,  B ,  is  in  tesla,  and  the  gyrofrequency,  vg,  of  a  non-relativistic,  slow-speed 
electron  moving  in  a  circle  about  a  magnetic  field  of  strength  B  is  given  by 
Vg  =  eB/(2nme )  =  2.8  x  1010  B  Hz  with  electron  charge  e  =  1.6022  x  10“ 19 
coulombs  and  electron  mass  me  =  9.109  x  10-31  kg. 

The  average  energy  loss  rate,  or  luminosity  Ls ,  of  a  single  electron  by  syn¬ 
chrotron  radiation  is  given  by: 


4  (  eA  \  B2 

3  \6m^cAm2e)  \cy  2 /i0 


•> 


(11.23) 
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or  equivalently 


4 

-cjtcUb 


(11.24) 


where  the  electron  velocity  V  «  c,  the  Thomson  scattering  cross  section,  aT,  is 
given  by  (Sect.  2.7): 

871  9  871  /  e 2  \2  _9Q  9  .  . 

«r  =  Tr,=T(^5-i—jj  «  6.65246  x  10-  m,  (11.25) 

and  the  energy  density  of  the  magnetic  field  is: 

UB=^~,  (11.26) 

z/t0 

_ s  _ 1 

where  the  magnetic  constant  /i0  =  1.2566  x  10"°  N  A  . 

Collecting  terms,  evaluating  the  constants,  and  noting  V/c  «  1,  the 
luminosity,  Ls ,  of  the  synchrotron  radiation  of  a  single  electron  is  given  by: 


2d2 


Ls  = 


4aTcy  B 
6^o 


10“1452y2 


(11.27) 


The  lifetime,  tv,  for  a  relativistic  electron  against  synchrotron  radiation  is 
given  by: 


E  ymec 2  8.187 

To  =  -  =  -  =  - T -  S, 

Ls  Ls  B2y 


(11.28) 


2  _ i/L 

where  the  rest  mass  energy  of  the  electron  is  mec  =  8.187  x  10  J.  Electrons 
which  have  a  higher  Lorentz  factor  y,  have  a  shorter  lifetime,  and  because  such 
electrons  radiate  at  higher  frequencies  and  shorter  wavelengths  than  those  with 
lower  y,  it  means  that  there  is  less  synchrotron  emission  at  shorter  wavelengths  and 
more  at  longer  wavelengths,  which  explains  its  nonthermal  spectrum. 

The  total  synchrotron  luminosity,  Ltot,  for  a  source  of  radius,  R ,  and  relativistic 
(high-speed)  electron  density,  Ne,  is: 

L,0,  =Lr3N(,Ls.  (11.29) 


Example:  Synchrotron  radiation  of  high-speed  electrons  in  the  Milky 
Way 

Cosmic  ray  electrons  that  enter  the  Earth’s  atmosphere  have  an  energy 
E  «  1  GeV,  where  1  GeV  =  1.6022  x  10-10  J.  The  cosmic-ray  electrons 
have  a  lower  flux  than  cosmic  ray  protons,  with  comparable  energies,  but  it  is 
the  electrons  that  give  rise  to  synchrotron  radiation,  not  the  protons.  The 
Lorentz  factor  for  an  electron  with  this  energy  is  y  =  E/mec  =  1.96  x  10  , 
where  the  rest  mass  energy  of  the  electron  is  mec  =  0.511  MeV.  If  an 
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electron  with  an  energy  similar  to  those  of  cosmic  rays  encountered  an  inter¬ 
stellar  magnetic  field,  of  strength  B  =  10“ 10  tesla,  the  frequency  of  the  syn¬ 
chrotron  radiation  from  the  electron  will  be  =  2.8  x  1010  B  y  107 
Hz  «  10  MHz,  and  the  wavelength  of  the  radiation  is  2  =  c/vs  ~  30  m, 

o  _ 1 

where  the  speed  of  light  c  =  2.9979  x  10  m  s  .  So  these  electrons  give  rise 
to  synchrotron  radiation  at  radio  wavelengths.  For  one  electron,  the  synchro¬ 
tron  luminosity  =  10-14  B2y2  J  s-1  ~  3.84  x  10-28  J  s_1,  and  its  lifetime 
for  expending  all  its  energy  by  synchrotron  radiation  is 
ts  =  E / Ls  ~  0.42  x  10  s  ^  13  billion  years,  which  is  comparable  to  the 

^ 7 

age  of  the  observable  universe,  where  1  year  =  3.1557  x  10  s.  However, 
electrons  that  emit  synchrotron  radiation  at  higher  frequencies  and  shorter 
wavelengths  have  a  much  shorter  lifetime.  To  produce  a  synchrotron  lumi¬ 
nosity  at  radio  wavelengths  comparable  to  the  solar  luminosity  in  visible  light 

oz  _ 1 

L0  =  3.828  x  10^  J  s  in  a  volume  whose  radius  R  is  comparable  to  the 
distance  between  adjacent  stars,  or  R  =  1  pc  =  3.0857  x  10  m,  the  number 
density  Ne  of  the  electrons  would  be  obtained  from  L©  =  4nR  NeLs/3,  or 
Ne  =  8.15  x  10  m  and,  for  example,  about  100  times  less  than  the  density 
of  interstellar  hydrogen  atoms. 


11.4  Interstellar  Hydrogen  Atoms 

Only  rare,  hot,  luminous  stars  can  ionize  nearby  hydrogen,  and  we  wonder  what  is 
in  the  vast  dark  places  outside  them,  other  than  the  dust  already  discussed.  Stars 
still  are  forming  out  of  something  and  because  the  stars  are  composed  mostly  of 
hydrogen,  there  should  be  hydrogen  atoms  within  interstellar  space.  However,  it  is 
so  cold  out  there  that  we  cannot  detect  any  atoms  at  visual  wavelengths.  None  of 
the  atoms  are  ionized  and  the  electrons  are  all  in  their  lowest  energy  ground  state, 
unable  to  change  their  atomic  orbits. 

But  the  spin,  or  rotation,  of  the  electron  and  proton  in  the  hydrogen  atom  may 
be  oriented  in  the  same  direction  or  opposite  directions,  or  in  parallel  and  anti¬ 
parallel  configurations.  In  technical  terms,  the  lowest  orbital  energy  state  of  atomic 
hydrogen  has  a  hyperfine  splitting  arising  from  the  spins  of  the  proton  and  electron 
changing  from  a  parallel  to  anti-parallel  configuration.  This  transition  is  highly 
forbidden  with  an  extremely  small  probability,  but  when  it  occurs,  radiation  is 
released  with  a  photon  energy  equal  to  the  energy  difference  between  these  two 

_ z 

levels,  about  6  x  10  eV,  which  corresponds  with  photons  with  a  frequency  of 
v  =  1.420  x  109  Hz  or  wavelength  2  =  21.1  cm. 

Hendrik  C.  “Henk”  Van  de  Hulst  (1918-2000)  predicted  that  a  radio  wave¬ 
length  spectral  line  might  be  detected  from  interstellar  regions  of  electrically 
neutral,  or  un-ionized,  hydrogen  atoms  (Van  de  Hulst  1945).  He  realized  that  these 
regions,  now  designated  HI  regions  and  pronounced  “H  one  regions”,  would  be 
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cold  and  that  most  of  the  atoms  would  be  in  their  lowest  energy  state.  The  electron 
of  the  hydrogen  atom  in  this  state  has  two  possibilities  in  the  direction  of  its  spin, 
or  rotation;  a  rare  collision  between  two  atoms  could  result  in  a  change  of  the  spin 
direction  of  an  electron  in  one  of  the  atoms.  The  atom  then  is  in  an  unstable 
configuration,  so  its  electron  will  soon  return  to  the  original  spin  direction.  This 
releases  a  small  amount  of  energy  and  produces  radiation  at  a  wavelength 
of  21  cm. 

The  prediction  was  published  in  an  obscure  Dutch  journal,  Nederlands  ti- 
jdschrift  voor  natuurkunde ,  but  the  Soviet  theorist  Iosif  S.  Shklovskii  (1916-1985) 
confirmed  the  prediction  4  years  later,  with  greater  detail,  in  the  Russian  language 
(Shklovskii  1949).  At  about  this  time,  Harold  I.  “Doc”  Ewen  (1922),  a  graduate 
student  at  Harvard  University,  became  interested  in  radio  astronomy.  His  advisor, 
Edward  M.  Purcell  (1912-1991),  asked  his  wife,  Beth,  who  was  good  with  lan¬ 
guages,  to  translate  Shklovskii’ s  paper  about  the  21  cm  line.  After  reading  it, 
Purcell  encouraged  Ewen  to  build  a  radio  receiver  to  search  for  the  transition.  They 
also  had  a  copy  of  Van  de  Hulst’s  original  work,  translated  from  the  Dutch. 

Ewen  constructed  a  radiometer  to  detect  the  hypothetical  radiation  using 
electronic  components  scavenged  from  other  places  or  bought  with  a  $500  grant 
and  $300  from  Purcell.  Much  of  the  equipment  was  borrowed  every  Friday  and 
returned  each  Monday  from  Harvard’s  Cyclotron  Laboratory,  using  a  wheel¬ 
barrow.  Accurate  measurements  of  the  expected  hydrogen-line  wavelength,  using 
terrestrial  atomic  hydrogen  in  a  laboratory  at  Columbia  University,  enabled  Ewen 
to  tune  his  receiver  to  the  precise  wavelength  of  21.106  cm.  At  Purcell’s  sug¬ 
gestion,  the  receiver  was  switched  between  the  wavelength  of  the  expected  signal 
and  an  adjacent  one,  with  a  difference  that  might  contain  the  expected  21  cm  line. 
Such  a  wavelength-switched,  or  frequency-switched,  receiver  has  since  been 
widely  adapted  by  radio  astronomers  to  remove  unwanted  noise  from  an  observed 
signal  containing  a  spectral  line. 

The  receiver  was  connected  to  a  simple  horn  antenna  constructed  of  plywood, 
lined  with  copper  sheeting,  and  mounted  on  a  ledge  outside  a  window  in  the 
university’s  physics  building.  Shortly  past  midnight,  when  the  Earth’s  rotation 
brought  the  plane  of  the  Milky  Way  through  the  beam  of  the  horn  antenna,  Ewen 
succeeded  in  detecting  the  21  cm  transition  with  the  novel  receiver, 

As  it  turned  out,  Van  de  Hulst  was  in  Cambridge,  Massachusetts,  as  a  visiting 
professor  at  Harvard  College  Observatory  and  the  Australian  radio  astronomer 
Frank  Kerr  (1918-2000)  was  visiting  Harvard  on  a  Fulbright  grant.  So  Ewen  and 
Purcell  invited  them  over  to  describe  the  discovery  of  the  21  cm  line  and  urge 
them  to  have  their  people  confirm  the  result. 

At  the  meeting,  the  Harvard  team  learned  for  the  first  time  that  the  Dutch  group 
at  Leiden  had  been  actively  trying  to  detect  the  radio  transition  for  several  years. 
A  description  of  the  wavelength- switched  receiver  was  provided  to  Van  de  Hulst, 
leading  to  the  conversion  of  the  Dutch  system  and,  in  a  gracious  move  Purcell 
insisted  that  publication  of  their  discovery  be  delayed  until  the  Dutch  group 
confirmed  it.  Their  discovery  was  published  when  it  was  confirmed  by  the  Dutch 
and  Australian  radio  astronomers.  A  coordinated  report  from  all  three  centers  then 
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was  published  in  the  journal  Nature  (Ewen  and  Purcell  1951,  Muller  and 
Oort  1951). 

The  detection  of  the  21  cm  line  revolutionized  studies  of  the  interstellar 
medium.  Snow  and  McCall  (2006)  reviewed  diffuse  atomic  and  molecular  clouds, 
whereas  Kalberla  and  Kerp  (2009)  have  reviewed  the  H  I  distribution  of  the  Milky 
Way  -  Kerr  (1969)  previously  reviewed  the  large-scale  distribution  of  hydrogen  in 
our  Galaxy.  Frisch  et  al.  (2011)  have  reviewed  the  interstellar  medium  surrounding 
the  Sun.  Cox  and  Reynolds  (1987)  reviewed  our  knowledge  of  the  local  interstellar 
medium;  McCray  and  Snow  (1979)  discussed  the  violent  interstellar  medium;  and 
Spitzer  (1990)  reviewed  theories  of  the  hot  interstellar  gas. 

The  H  I  regions  do  not  emit  visible  light;  hence  they  are  invisible  at  optical 
wavelengths,  but  they  do  emit  radio  waves  that  are  21  cm  long,  with  typical 
parameters  given  in  Table  11.4. 

As  Van  de  Hulst  pointed  out,  these  spin  transitions  occur  rarely  in  the  tenuous 
interstellar  gas;  however,  an  observer  might  well  detect  them  when  looking 
through  the  vast  extent  of  interstellar  space.  The  probability,  A10,  of  a  single  atom 
undergoing  such  a  change  in  its  electron  spin  is: 

Aio  =  2.8689  x  10“15  s_1.  (11.30) 

The  radiative  half-life  of  this  hyperfine  transition  is  ti/2  = 
l/Amn  «  11.1  million  years,  where  1  year  =  3.1557  x  10  s.  At  an  average 

z  _ o 

interstellar  number  density  of  NH  =  10  m  ,  one  21  cm  transition  of  a  hydrogen 
atom  would  occur  every  second  when  observing  an  H  I  region  with  a  radius  of 
about  440  m,  and  when  looking  through  thousands  of  parsecs,  with  1  par- 

1  Z  1  ^1 

sec  =  3.0856  x  10  m,  one  could  detect  about  10  transitions  every  second. 

An  interesting  aspect  of  the  first  observations  of  the  21  cm  transition  was  that 
the  detected  line  was  seen  in  emission.  Collisions  between  atoms  determined  the 
line  excitation  temperature,  which  is  equal  to  the  kinetic  temperature  of  the 
interstellar  gas.  This  can  be  inferred  from  the  Doppler  broadening,  A2,  at 
2  =  21  cm,  with  a  velocity  V/c  ~  A 2/2,  where  the  speed  of  light 

o  1 

c  =  2.9979  x  10  ms  and  the  kinetic  temperature,  TK ,  of  hydrogen  atoms  of 
mass,  mH  1.67  x  10  ^  kg  moving  at  velocity,  V,  obtained  from  equating 
thermal  energy  to  kinetic  energy.  It  is  given  by: 


Kinetic  temperature  =  TK 


tuhV" 

3k 


(11.31) 


_ OQ  _  1 

where  the  Boltzmann  constant  k  =  1.38065  x  10  ~  J  K  .  The  excitation 
temperatures  of  the  observed  21  cm  lines  confirmed  theoretical  expectations  that 
the  neutral  hydrogen  in  interstellar  space  has  a  temperature  of  about  100  K  (see 
Spitzer  1948;  Spitzer  and  Savedoff  1950). 
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Table  11.4  Physical  properties  of  atomic  hydrogen  (H  I)  regions 

/r  o  _ o 

Nh  =  density  of  hydrogen  atoms  =10-10  m 
Tk  —  kinetic  temperature  =  10-100  K 

R  =  radius  =  3-33  light-years  =  1-10  pc  «  (3-30)  x  1016  m 

v10  =  frequency  of  hyperfme  spin  transition  =  1.420  405  751  768  x  109  Hz  «  1,420  MHz 
210  =  wavelength  of  hyperhne  spin  transition  =  21.106  114  054  13  cm  «  21.1  cm 
A10  =  probability  of  hyperhne  spin  transition  =  2.8689  x  10-15  s_1 


Example:  Column  density  of  interstellar  hydrogen  atoms 

Radio  astronomers  use  observations  of  the  21  cm  line  to  infer  the  line- 
integrated  column  density  NCh  of  hydrogen  atoms  along  the  line  of  sight.  For 
a  spin  excitation  temperature,  Ts,  and  optical  depth  t,  the  column  density 
inferred  from  a  21  cm  line  of  width  AvL  is: 

Nch  =  1.82  x  1022  tTsAvl  m“2,  (11.32) 

where  the  numerical  coefficient  gives  the  number  of  hydrogen  atoms  per  square 
meter  when  the  full- width  to  half  maximum  of  the  line  is  in  units  of  km  s-1. 

For  the  optically  thin  case,  the  measured  brightness  temperature,  TB , 
which  is  equal  to  tTs,  will  be  proportional  to  the  column  density  per  unit 
velocity.  For  a  line  width  of  about  1  km  s-1  and  a  brightness  temperature  of 
Tb  10  K,  we  would  have  NCh  ~  2  x  10"  m  ,  and  since  the  number 

X  _ Q 

density  of  interstellar  hydrogen  is  NH  ^  10  m  ,  you  would  have  to  look 
through  a  length  L  —  NCh/ Nh  10  m  to  detect  such  a  line,  which  is  just  a 
few  parsecs. 

The  kinetic  temperature  can  be  inferred  from  the  Doppler  broadening  of 
the  line,  and  used  to  infer  the  spin  temperature.  If  the  spin  temperature  is 
equal  to  the  kinetic  temperature  of  TK  =  100  K,  then  the  thermal  velocity  is 
Vth  =  (3kTK/  mH)112  «  1.57  x  103  ms-1  =  1.57  km  s  \  where  the  Boltz- 

_ oo  _ 1 

mann  constant  k  =  1.38065  x  10  ^  J  K  and  the  mass  of  the  hydrogen 

_ 0^7 

atom  mH  1.67  x  10  kg. 


11.5  Interstellar  Molecules 

Soon  after  the  discovery  of  interstellar  atomic  hydrogen  at  radio  wavelengths, 
astronomers  began  to  speculate  about  the  possibility  of  detecting  molecules  with 
radio  waves,  which  are  sensitive  to  the  coldest  clouds  of  interstellar  matter  where 
molecules  might  survive  (Shklovskii  1953;  Townes  1957).  Observations  of  any 
molecule  first  required  accurate  measurements  of  the  wavelength  of  its  radio 
spectral  features  in  the  terrestrial  laboratory.  Furthermore,  radio  telescopes  with 
surfaces  accurate  to  a  few  centimeters  or  better  had  to  be  constructed  for  receiving 
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the  short-wavelength  emission  of  molecules,  and  new  methods  of  spectral  analysis 
needed  to  be  developed. 

Charles  H.  Townes  (1915-  )  and  his  colleagues  at  Columbia  University  made 
the  first  precision  laboratory  measurements  of  the  radio-frequency  transitions  of 
the  hydroxyl  (OH)  molecule,  composed  of  an  atom  of  oxygen,  O,  and  an  atom  of 
hydrogen,  H.  At  an  international  symposium  of  radio  astronomy  in  1955,  he 
presented  laboratory  measurements  of  the  rotational  transitions  of  other  molecules 
that  might  be  detected,  including  carbon  monoxide  and  water  (Townes  1957). 

Alan  Barrett  (1927-1991)  and  his  colleagues  at  the  Massachusetts  Institute  of 
Technology  built  the  sophisticated  equipment  needed  to  obtain  the  first  observa¬ 
tions  of  interstellar  OH,  at  the  wavelength  of  18.005  cm  specified  by  Townes 
(Weinreb  et  al.  1963).  The  discovery  was  followed  closely  by  intensive  searches 
for  OH  and  eventually  resulted  in  the  realization  that  some  sources  are  as  small  as 
stars  and  act  like  cosmic  masers  -  maser  is  an  acronym  for  “microwave  ampli¬ 
fication  by  stimulated  emission  of  radiation.”  Reid  and  Morgan  (1981)  have 
reviewed  cosmic  masers;  many  pioneering  research  papers  for  radio  detection  of 
interstellar  molecules  are  mentioned  in  Lang  (1999). 

Zuckerman  and  Palmer  (1974)  have  given  us  a  review  of  radio  radiation  from 
interstellar  molecules.  Ho  and  Townes  (1983)  have  described  interstellar  ammo¬ 
nia;  Combes  (1991)  has  reviewed  the  distribution  of  CO  in  the  Milky  Way  -  also 
see  Gordon  and  Burton  (1976).  Herbst  and  Van  Dishoeck  (2009)  have  reviewed 
complex  organic  interstellar  molecules;  Ehrenfreund  and  Chamley  (2000) 
reviewed  organic  molecules  in  the  interstellar  medium,  comets  and  meteorites. 
Snow  and  McCall  (2006)  have  discussed  both  diffuse  atomic  and  molecular 
clouds;  Lada  and  Lada  (2003)  have  reviewed  embedded  clusters  in  molecular 
clouds;  Lukui  and  Kawamura  (2010)  have  reviewed  molecular  clouds  in  nearby 
galaxies;  and  Solomon  and  Vanden  Bout  (2005)  have  discussed  molecular  gas  at 
high  redshift.  The  dense  molecular  clouds  are  the  sites  of  star  formation,  which  we 
next  discuss. 

Townes  moved  to  the  University  of  California  at  Berkeley  and,  within  a  year, 
he  and  his  graduate  students  discovered  ammonia  and  water  in  interstellar  space 
(Cheung  et  al.  1968).  This  was  followed  soon  by  the  detection  of  the  embalming 
fluid  formaldehyde  (Snyder  et  al.  1969),  as  well  as  carbon  monoxide  (Wilson  et  al. 

1970). 

As  might  be  suspected,  the  interstellar  molecules  are  composed  of  the  most 
abundant  atoms  in  the  universe,  beginning  with  hydrogen,  but  also  carbon  (C), 
oxygen  (O),  and  nitrogen  (N).  They  combine  to  form  molecules  such  as  ammonia, 
NH3;  carbon  monoxide,  CO;  hydrogen  cyanide,  HCN;  water,  H20;  and  formal¬ 
dehyde,  H2CO  (Table  11.5). 

The  early  findings  triggered  an  avalanche  of  molecular  searches  in  which 
groups  of  young  radio  astronomers  engaged  in  an  exciting  pursuit  of  previously 
unseen  interstellar  molecules.  The  net  result  was  the  discovery  of  hundreds  of 
interstellar  molecules,  including  complex  organic  molecules  such  as  ethyl  alcohol, 
or  ethanol  -  the  substance  that  gives  beer,  wine,  and  liquor  their  intoxicating 
power.  Although  interstellar  matter  is  generally  sparse  and  tenuous,  the  molecules 
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Table  11.5  Abundant  interstellar  moleculesa 


Chemical  symbol 

Name  of  molecule 

Year  of  discovery 

Frequencya 

Wavelengtha  (cm) 

OH 

Hydroxyl 

1963 

1665.4  MHz 

18.0054 

CO 

Carbon  monoxide 

1970 

115.27  GHz 

0.25911 

h2o 

Water 

1968 

22.235  GHz 

1.3483 

HCN 

Hydrogen  cyanide 

1970 

88.632  GHz 

0.33824 

nh3 

Ammonia 

1968 

23.694  GHz 

1.2653 

h2co 

Formaldehyde 

1969 

4829.7  MHz 

6.2072 

a  The  transition  frequencies  are  given  in  MegaHertz,  abbreviated  MHz,  or  GigaHertz,  abbrevi¬ 
ated  GHz.  One  MHz  =  106  Hz  or  a  million  Hertz  and  1  GHz  =  109  Hz  or  a  billion  Hertz.  The 
transition  wavelengths  are  given  in  units  of  centimeters,  cm,  where  1  cm  =  0.01  m  =  0.01  m 


are  concentrated  within  dark  dust  clouds  that  can  be  1  million  times  as  dense  as  a 
typical  region  of  interstellar  hydrogen  atoms. 

The  transition  frequencies  of  the  numerous  interstellar  molecules  that  have  been 
detected  are  given  online  at  Splatalogue.net.  For  example,  the  CO  line  has  a 
frequency  of  v  =  115.271208  GHz,  where  1  GHz  =  109  Hz.  This  transition  has  a 
spontaneous  emission  coefficient  of  A10  =  7.202  x  10-8  s-1  «  2.3  yr-1.  The 
minimum  temperature  required  to  excite  this  transition  is  given  by  equating 
the  photon  energy  hv  to  the  thermal  energy  3kT/2,  to  obtain  f  ^  3.7  K,  using  the 

_ oq 

Planck  constant  h  =  6.6261  x  10  J  s  and  the  Boltzmann  constant 
k  =  1.38065  x  1(T23  J  K-1. 

The  first  discoveries  of  interstellar  molecules  had  not  been  anticipated  even  a 
decade  earlier  because  astronomers  had  overlooked  the  importance  of  interstellar 
dust  grains  in  shielding  molecules  from  destructive  ultraviolet  starlight  and  acting 
as  a  catalyst  in  forming  complex  molecules;  for  reviews  of  pioneering  work  see 
Lang  (1999).  Nevertheless,  it  had  long  been  known  that  molecules  could  be  found 
in  the  cool  outer  atmospheres  of  some  stars  (Tsuji  1986). 

The  most  abundant  cosmic  molecule  is  molecular  hydrogen,  denoted  H2,  which 
consists  of  two  atoms  of  hydrogen,  designated  H.  This  molecule  cannot  be 
observed  directly  at  radio  wavelengths;  however,  the  second  most  abundant 
interstellar  molecule,  carbon  monoxide,  maps  out  its  distribution.  Field, 
Somerville  and  Dressier  (1966)  discussed  hydrogen  molecules  in  astronomy,  and 
Shull  and  Beckwith  (1982)  reviewed  interstellar  molecular  hydrogen. 


Chapter  12 

Formation  of  the  Stars  and  Their  Planets 


12.1  How  the  Solar  System  Came  into  Being 

12.1.1  The  Nebular  Hypothesis 

Where  did  the  Sun  and  its  attendant  planets  come  from?  How  and  when  did  they 
form?  The  most  likely  explanation  is  provided  by  the  nebular  hypothesis ,  which 
states  that  the  Sun  and  planets  formed  together,  as  a  result  of  the  gravitational 
collapse  of  an  interstellar  cloud  of  gas  and  dust  also  known  as  the  solar  nebula 
(Fig.  12.1).  This  theory  accounts  for  the  orderly,  aligned  motions  of  the  major 
planets.  They  all  move  in  a  narrow  band  across  the  sky,  implying  that  their  orbits 
all  lie  in  nearly  the  same  plane,  which  nearly  coincides  with  the  Sun’s  equatorial 
plane.  All  of  the  planets  move  in  the  same  direction  within  their  Sun-centered 
orbits,  and  both  the  Sun  and  most  of  the  major  planets  rotate  in  this  direction  - 
Venus  and  Uranus  are  the  exceptions. 

The  orbits  of  most  of  the  planetary  moons,  or  natural  satellites,  imitate  those  of 
the  planets  in  being  confined  to  the  planet’s  equatorial  plane  and  revolving  about 
the  planet  in  the  same  direction  that  the  planet  rotates.  It  is  exceedingly  unlikely 
that  the  major  planets  and  large  moons  became  aligned  by  chance. 

Although  Newton’s  laws  and  Kepler’s  laws  describe  the  present  motions  of  the 
solar  system,  they  cannot  explain  the  remarkable  arrangement  of  its  planets  and 
satellites.  Additional  constraints  are  required,  which  describe  the  situation  before 
the  planets  were  formed  and  set  in  motion.  These  initial  conditions  are  provided  by 
the  nebular  hypothesis. 

The  German  philosopher  Immanuel  Kant  (1724-1804)  introduced  the  basic 
idea  (Kant  1755).  He  pictured  an  early  universe  filled  with  thin  gas  that  collected 
into  dense,  rotating  gaseous  clumps.  One  of  these  primordial  concentrations  was 
the  spinning  solar  nebula.  The  Sun  formed  at  the  center  of  the  solar  nebula,  and  the 
planets  formed  from  swirling  condensations  in  a  flattened  disk  revolving  around 
the  Sun. 


K.  R.  Lang,  Essential  Astrophysics,  Undergraduate  Lecture  Notes  in  Physics, 
DOI:  10.1007/978-3-642-35963-7_12,  ©  Springer- Verlag  Berlin  Heidelberg  2013 
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Fig.  12.1  Formation  of  the  solar  system  An  artist’s  impression  of  the  nebular  hypothesis,  in 
which  the  Sun  and  planets  were  formed  at  the  same  time  during  the  collapse  of  a  rotating 
interstellar  cloud  of  gas  and  dust  that  is  called  the  solar  nebula.  The  center  collapsed  to  ignite 
nuclear  reactions  in  the  nascent  Sun,  and  the  surrounding  material  was  whirled  into  a  spinning 
disk  where  the  planets  coalesced.  (Courtesy  of  Helmut  K.  Wimmer,  Hayden  Planetarium, 
American  Museum  of  Natural  History.) 


According  to  another  version  of  the  nebular  hypothesis,  suggested  by  Pierre 
Simon  Laplace  (1749-1827),  the  shrinking  Sun  shed  a  succession  of  gaseous  rings, 
and  each  ring  condensed  into  a  planet  (Laplace  1796).  Then,  each  planet,  in  turn, 
became  a  small  rotating  nebula  in  which  its  own  family  of  rings  and  satellites  was 
born. 

There  is  now  so  much  evidence  for  the  nebular  hypothesis  that  it  has  acquired 
the  status  of  a  theory,  whose  basic  tenets  are  still  valid.  The  spinning  solar  nebula, 
attracted  by  its  own  gravity,  fell  in  on  itself  4.6  billion  years  ago,  becoming 
increasingly  dense,  until  the  middle  became  so  packed,  tight,  and  hot  that  the  Sun 
began  to  shine.  The  planets  formed  at  the  same  time,  within  a  flattened  rotating 
disk  centered  on  the  contracting  proto-Sun. 

This  is  the  essence  of  the  original  nebular  theory,  which  explains  qualitatively 
the  fact  that  the  major  planets  and  their  large  moons  all  revolve  in  the  same 
direction  within  the  plane  that  coincides  with  the  equator  of  the  rotating  Sun.  This 
regular,  aligned  pattern  of  motion  is  a  natural  consequence  of  the  rotation  and 
collapse  of  a  solar  nebula  composed  of  gas  and  dust  from  which  the  Sun  and 
planets  were  produced. 


12.1.2  Composition  of  the  Planets 

If  the  nebular  theory  is  correct,  we  might  expect  that  all  of  the  planets  would  have 
the  same  composition  as  the  Sun  because  they  all  formed  from  the  same  inter¬ 
stellar  nebula.  After  all,  they  should  have  the  same  ingredients  as  the  material  from 
which  they  formed  -  they  do,  but  with  a  varying  mix. 
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The  abundance  of  elements  in  the  giant  planet  Jupiter  does  indeed  mimic  that  of 
the  Sun,  with  a  predominance  of  the  lightest  element,  hydrogen.  Unlike  the  Sun, 
the  Earth  is  mainly  composed  of  heavier  elements,  and  this  difference  must  be 
explained.  It  is  related  to  the  fact  that  there  are  two  main  types  of  major  planets  - 
the  terrestrial  planets  and  giant  planets  -  that  differ  in  size,  composition,  and 
distance  from  the  Sun. 

The  four  planets  closest  to  the  Sun  -  Mercury,  Venus,  Earth  and  Mars  -  are 
known  as  terrestrial  planets  because  they  are  similar  to  the  Earth.  These  inner 
planets  are  rocky  and  relatively  compact  and  dense.  In  contrast,  the  four  giant 
planets  -  Jupiter,  Saturn,  Uranus  and  Neptune  -  which  reside  in  the  outer  parts  of 
the  planetary  system,  are  big,  gaseous,  planets  that  have  relatively  low  mean  mass 
densities.  Unlike  the  inner  terrestrial  planets,  rings  and  numerous  satellites  encircle 
the  outer  giant  planets. 

The  radius  of  a  planet  is  determined  from  its  distance  and  angular  size,  and  the 
mass  of  a  planet  can  be  inferred  from  the  orbital  motions  of  its  large  satellites  or 
moons.  The  radius  of  Jupiter,  for  example,  is  1 1.2  times  the  radius  of  the  Earth  and 
its  mass  is  3 1 8  times  that  of  the  Earth.  If  a  planet  has  no  moon,  like  Mercury  and 
Venus,  its  mass  can  be  obtained  from  detailed  observations  of  its  gravitational 
effects  on  spacecraft  that  pass  or  orbit  near  it.  The  mean,  or  average,  mass  density 
of  planet  can  be  computed  by  dividing  its  mass,  in  kilograms,  by  its  volume  in 
cubic  meters.  The  volume  with  a  planet  with  radius  R  is  4tzR  / 3,  so  the  mean  mass 
density  of  a  planet  of  mass  M  is  3M/(4nR  ).  For  instance,  the  mean  mass  density 
of  Jupiter  is  1,330  kilograms  per  cubic  meter,  abbreviated  1,330  kg  m-3.  This  is 
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comparable  to  the  Sun’s  mean  mass  density  of  1,409  kg  m  ,  but  it  is  almost  four 

_ o 

times  lower  than  that  of  the  Earth  at  5,513  kg  m  . 

A  clue  to  these  differences  comes  from  the  locations  of  the  two  types  of  planets 
when  they  originated.  The  terrestrial  planets  formed  in  the  warm  regions  of  the 
flattened  solar  nebula,  close  to  the  bright,  young  Sun,  whereas  the  giant  planets 
formed  farther  from  the  Sun  in  the  colder  outer  regions  of  the  solar  nebula  (Lewis 
1974,  2004). 

Modem  observations  of  the  interstellar  medium  indicate  that  it  consists  of  7 1  % 
hydrogen,  27  %  helium,  and  2  %  heavier  elements,  where  the  percentage  numbers 
are  by  mass.  In  terms  of  the  number  of  atoms,  about  92  %  are  hydrogen  atoms, 
nearly  8  %  are  helium  atoms,  and  all  the  heavier  elements  make  up  less  than  1  %, 
which  is  roughly  consistent  with  the  composition  of  the  Sun. 

When  the  temperature  was  low  enough,  some  of  the  materials  condensed  out  of 
the  gas  and  dust  of  the  solar  nebula,  but  only  a  very  modest  fraction  of  the  nebular 
material  outside  the  Sun  ever  condensed  into  the  planets.  The  substances  that  did 
contribute  to  the  formation  of  planets  can  be  divided  into  three  categories,  totaling 
up  to  no  more  than  2  %  by  mass  of  the  original  solar  nebula.  They  are  the  metals 
(0.2  %),  the  rocks  (0.4  %)  and  the  ices  (1.4  %),  and  they  condensed  at  different 
temperatures  and  distances  from  the  young  Sun. 

Metals,  like  iron,  condensed  from  gaseous  into  solid  form  at  the  highest  tem¬ 
peratures  close  to  the  Sun,  rock  condensed  at  moderate  temperatures  of  about 
1,000  K  and  water,  methane,  and  ammonia  condensed  into  ices  at  temperatures 
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below  150  K.  The  asteroid  belt,  located  between  Mars  and  Jupiter,  is  located  at  the 
distance  from  the  Sun  where  it  is  cold  enough  for  ices  to  condense,  marking  the 
transition  between  the  warm  inner  regions  where  the  terrestrial  planets  formed  and 
the  cold  outer  domain  of  the  giant  planets. 

In  the  inner  regions  of  the  solar  nebula,  the  higher  temperatures  vaporized  icy 
material  that  could  not  condense,  leaving  only  rocky  substances  of  relatively  high 
mass  density  to  coalesce  and  merge  to  form  the  terrestrial  planets.  Also,  the  low 
total  mass  and  high  initial  temperature  of  these  planets,  as  well  as  their  proximity 
to  the  Sun,  did  not  allow  them  to  capture  and  retain  the  abundant  lighter  gases  - 
hydrogen  and  helium  -  directly  from  the  solar  nebula. 

The  rocky  terrestrial  planets  were  so  hot  in  their  formative  stages,  beginning 
about  4.6  billion  years  ago,  that  their  interior  rock  and  metal  melted  and  gravity 
separated  them  by  density.  In  a  process  known  as  differentiation ,  the  denser 
material  (iron)  sank  toward  the  center,  whereas  the  less  dense  rocks  (the  silicates) 
remained  closer  to  the  surface.  The  planets  then  cooled  from  the  outside  in  as  time 
elapsed,  so  we  can  now  walk  across  the  Earth’s  solid  surface.  Our  planet  still  has  a 
molten  core  due  to  heat  generated  by  radioactive  elements  inside  it. 

At  larger  distances  from  the  Sun,  where  the  solar  nebula  was  colder,  icy  sub¬ 
stances  condensed  to  form  the  cores  of  giant  planets.  These  cores  became  suffi¬ 
ciently  massive  to  gravitationally  capture  some  of  the  surrounding  hydrogen  and 
helium,  which  was  pulled  into  the  giant  planets.  The  low  temperatures  at  remote 
distances  from  the  Sun  thus  enabled  the  giant  planets  to  retain  the  abundant  light 
gases  and  grow  even  bigger,  with  large  masses  and  low  mass  densities.  A ’Hearn 
(2011)  reviewed  comets  as  building  blocks  of  the  cores  of  giant  planets. 

Jupiter’s  low  mass  density,  for  example,  indicates  that  it  is  composed  largely  of 
hydrogen  and  helium,  just  as  the  Sun  is.  Under  the  enormous  pressures  inside 
massive  Jupiter,  the  abundant  hydrogen  is  compressed  into  liquid  molecular  form, 
and  its  central  temperature  of  about  17,000  K  is  leftover  heat  from  its  formation. 
Since  Jupiter  is  not  as  massive  as  the  Sun,  nuclear  fusion  reactions  cannot  occur 
within  the  giant  planet. 

When  the  masses  of  the  Sun  and  planets  are  determined,  we  find  that  the  Sun 
does  not  only  lie  at  the  heart  of  our  solar  system;  it  also  dominates  it,  which  means 
that  most  of  the  nebular  mass  outside  the  Sun  never  became  part  of  any  planet. 
Some  99.866  %  of  all  of  the  matter  between  the  Sun  and  halfway  to  the  nearest 
star  is  contained  in  the  Sun.  All  of  the  objects  that  orbit  the  Sun  -  the  planets  and 
their  satellites,  the  comets,  and  the  asteroids  -  add  up  to  only  0.134  %  of  the  mass 
of  our  solar  system.  Relative  to  the  Sun,  the  planets  are  insignificant  specks,  left 
over  from  its  formation  and  held  captive  by  its  massive  gravity. 

Almost  all  of  the  hydrogen  and  helium  gas  that  enveloped  the  newly  formed 
Sun  must  have  disappeared.  The  powerful  winds  of  the  young  Sun  apparently 
cleaned  out  the  solar  system,  blasting  away  all  of  the  remaining  gases  that  had  not 
condensed  to  make  planets.  Some  of  the  leftover  rocky  material  not  found  in  the 
terrestrial  planets  is  located  in  the  asteroid  belt,  and  some  of  the  remaining  ice  is 
located  in  the  distant  precincts  of  the  solar  system  where  the  comets  reside.  Most 
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of  the  hydrogen  and  helium  that  was  not  near  the  giant  planets  or  in  the  Sun  also 
was  blown  away  in  the  formative  stages  of  the  solar  system. 


12.1.3  Mass  and  Angular  Momentum  in  the  Solar  System 

The  nebular  hypothesis  must  also  be  adjusted  to  explain  the  current  distribution  of 
angular  momentum  in  the  solar  system.  Most  of  it  is  concentrated  in  the  orbital 
angular  momentum  of  Jupiter,  and  the  rotating  Sun  has  less  than  one  percent  of  the 
amount  of  angular  momentum  carried  by  this  giant  planet.  In  other  words,  a  very 
small  fraction  of  the  mass  of  the  solar  system  has  significant  angular  momentum, 
while  most  of  the  mass  has  relatively  little  angular  momentum. 

According  to  the  law  of  conservation  of  angular  momentum,  the  rotation  of  a 
shrinking  object  will  speed  up  as  the  radius  decreases  (Focus  12.1).  The  young  Sun 
should  have  therefore  been  rotating  very  rapidly  when  it  formed,  with  a  rotation 
period  of  just  a  few  hours  or  less,  but  the  Sun  rotates  quite  slowly  today,  with  each 
full  rotation  taking  25.7  days  at  the  solar  equator.  That  corresponds  to  a  rotation 
velocity  of  about  2,000  ms-1.  The  spinning  Sun  must  have  slowed  down  as  it 
aged. 


Focus  12.1  How  fast  was  the  young  Sun  rotating? 

According  to  the  nebular  hypothesis,  the  Sun  and  planets  formed  together  as 
the  result  of  the  gravitational  collapse  of  a  rotating  interstellar  cloud,  called 
the  solar  nebula.  Before  the  formation  of  the  solar  system,  the  solar  nebula 
might  have  been  rotating  at  a  leisurely  rate,  but  collapse  would  inevitably 
increase  its  rotation  speed.  This  is  a  consequence  of  the  conservation  of 
angular  momentum  during  gravitational  collapse.  For  a  body  of  mass,  M, 
rotation  velocity,  V,  and  radius,  R ,  the  angular  momentum  is  M  x  V  x  R, 
and  since  V  =  2nR/P  for  a  rotation  period,  P,  the  conservation  law  means 
that: 

InMR2 

MVR  = - =  Constant.  (12.1) 

P  v  y 

Since  the  Sun  contains  99.87  %  of  the  mass  of  the  solar  system,  we  can 
assume  that  the  mass  remains  constant  during  the  collapse  of  the  solar  nebula 
to  form  the  Sun  and  planets. 

The  nearest  star,  other  than  the  Sun,  is  Proxima  Centauri,  located  at  a 
distance  of  D  =  4.23  light-years  =  1.30  parsecs  =  268,000  AU.  We  might 
assume  the  solar  nebula  initially  extended  to  half  this  distance  or  roughly  to 
a  radius  of  134,000  AU,  where  1  AU  =  1.496  x  1011  m. 
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In  fact,  some  comets  in  our  solar  system  have  orbits  at  about  a  distance  of 
a  =  100,000  AU  from  the  Sun.  Kepler’s  third  law  gives  the  orbital  period 
P  of  such  a  comet  as: 


(12.2) 


where  n  «  3.14159,  the  gravitational  constant  G  =  6.674  x 
10-11  m3  kg-1  s-2  and  the  Sun’s  mass  M0  =  1.989  x  1030kg.  Solving  for 
the  period  of  the  comet  we  obtain  P  ^  10  s  ^  3  x  10  years,  and  an 
orbital  velocity  of  V  =  2na/P  10  m  s  . 

If  we  assume  that  the  solar  nebula  began  at  the  distance  and  orbital  period 
of  a  comet,  with  a  radius  RSn  =  100,000  AU  and  rotation  period  of 
PSN  =  10 15  s,  then  by  the  time  the  Sun  collapsed  to  its  present  size  of 
Rq  =  6.955  x  108  m,  its  rotation  period,  P0,  should  have  been: 


(12.3) 


which  gives  P0^2.1s  and  a  rotation  velocity  of  V0  = 
2tzRq/Pq  ^2.2  x  109ms_1  if  the  angular  momentum  is  perfectly  con¬ 
served  during  gravitational  collapse.  But  this  is  an  impossibly  fast  rotation, 
for  the  outward  centrifugal  force  of  rotation  would  stop  the  collapse  long 
before  this  occurred.  The  stars  with  the  fastest  rotation,  have  rotation  periods 
of  about  10  h  or  36,000  s,  and  rotation  velocities  of  about 
300  km  s_1  =  3  x  105  m  s"1  (Sect.  4.3). 

This  suggests  that  a  star  is  not  formed  by  gravitational  collapse  alone; 
some  other  phenomenon  must  slow  the  spin,  perhaps  through  mass  loss  from 
powerful  winds  or  by  magnetic  fields  that  link  the  star  to  the  surrounding 
material. 

However,  the  Sun  now  has  an  equatorial  rotation  period  of 
Pq  =  25.7  days,  or  2.22  x  10  s,  much  longer  than  expected,  and  a  rotation 
velocity  of  V0  «  1,971  ms-1.  Even  in  this  situation,  we  have  to  conclude 
that  some  process  other  than  gravitational  contraction  must  have  slowed  the 
Sun’s  spin  after  its  birth. 


A  powerful  solar  wind  and  intense  magnetic  field  during  the  Sun’s  youth  may 
have  conspired  to  produce  the  Sun’s  current  slow  rotation.  A  strong  magnetic  field, 
generated  by  the  fast  solar  rotation  in  the  Sun’s  early  epochs,  might  have  con¬ 
nected  the  Sun  to  the  distant,  slowly  rotating  material  in  the  surrounding  disk, 
acting  as  a  magnetic  brake  to  solar  rotation  (Mestel  1968).  The  rapidly  rotating 
Sun  swept  the  magnetic  field  by  the  slow-moving  charged  particles  in  the  outer 
solar  nebula,  producing  a  drag  that  slowed  down  the  Sun  (Fig.  12.2). 


12.1  How  the  Solar  System  Came  into  Being 


387 


Fig.  12.2  Magnetic  brakes 

The  central,  rapidly  rotating 
Sun  is  connected  to  an 
ionized,  slowly  rotating  disk 
by  magnetic  fields  ( side 
view).  The  magnetic  field  is 
twisted  into  a  spiral  shape 
( top  view )  and  acts  as  a  brake 
on  the  Sun’s  rotation, 
transferring  angular 
momentum  from  the  Sun  to 
the  proto-planetary  disk 


Top  View 


Magnetic  Field 


The  solar  wind  may  have  been  more  intense  during  the  Sun’s  youth,  carrying 
greater  amounts  of  mass  away  from  the  Sun  and  perhaps  slowing  its  rotation.  The 
young  Sun  might  also  have  been  more  active  than  it  is  today,  producing  strong, 
explosive  gusts  in  the  solar  wind  and  helping  to  diminish  the  Sun’s  angular 
momentum.  Fiegelson  and  Montmerle  (1999)  have  reviewed  high-energy  pro¬ 
cesses  in  young  stellar  objects. 

Young  stars  rotate  rapidly,  generate  powerful  winds,  and  have  strong  magnetic 
fields.  Older  stars  rotate  slowly  like  the  Sun,  and  exhibit  calmer  winds  and  reduced 
activity. 

Whatever  the  exact  mechanism  of  sweeping  angular  momentum  away  from  the 
Sun,  it  therefore  appears  to  apply  to  other  stars. 

If  the  nebular  theory  is  correct,  it  should  apply  to  the  formation  of  other  stars,  not 
only  the  Sun.  Massive  molecular  clouds,  in  fact,  are  even  now  in  the  process  of 
creating  young  stars  that  are  currently  embedded  in  the  interstellar  gas  and  dust  that 
spawned  them.  However,  not  every  interstellar  cloud  is  in  the  process  of  star  for¬ 
mation.  For  the  most  part,  the  interstellar  gas  and  dust  is  too  hot  and  too  tenuous  to 
spontaneously  collapse  into  stars,  which  is  why  there  is  still  sufficient  interstellar 
material  around  to  create  new  stars  after  billions  of  years  of  continued  star  formation. 
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12.2  Star  Formation 

Woodward  (1978),  Shu  et  al.  (1987),  Evans  (1999)  and  McKee  and  Ostricker 
(2007)  have  reviewed  star  formation.  Bastian  et  al.  (2010)  discussed  a  universal 
initial  mass  function  for  stars.  Zinnecker  and  Yorke  (2007)  reviewed  massive  star 
formation,  whereas  Churchwell  (2002)  has  discussed  ultra-compact  H  II  regions 
and  massive  star  formation.  Luhman  (2012)  has  reviewed  the  formation  and  early 
evolution  of  low  mass  stars  and  brown  dwarfs;  and  Zuckerman  and  Song  (2004) 
summarized  our  knowledge  of  young  stars  near  the  Sun. 


12.2.1  Giant  Molecular  Clouds 

Interstellar  clouds  of  gas  and  dust  provide  the  raw  material  for  new  stars,  but  they 
do  not  form  everywhere  within  interstellar  space.  Stars  form  within  particularly 
cold,  dense,  and  massive  regions  known  as  giant  molecular  clouds.  These  con¬ 
temporary  incubators  of  newborn  stars  have  temperatures  as  low  as  10  K,  span 
tens  of  light-years,  and  each  one  has  a  mass  of  up  to  1  million  solar  masses,  mainly 
in  the  form  of  hydrogen  molecules.  These  giant  molecular  clouds  are  now  the 
dominant  star-forming  component  of  the  interstellar  medium. 

As  many  as  1  million  million,  or  10  ,  hydrogen  molecules  can  be  packed  into 
every  cubic  meter  of  such  a  giant  molecular  cloud  (Table  12.1).  In  contrast,  there 
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are  no  more  than  100  million,  or  10  ,  hydrogen  atoms  in  1  m  of  the  interstellar 
material  outside  the  giant  molecular  clouds.  These  atoms  are  about  10  times  hotter, 
at  about  100  K,  than  molecular  clouds.  The  dust  in  the  dark  molecular  clouds 
blocks  the  harsh  ultraviolet  radiation  in  space  and  enables  chemical  reactions  to 
form  complex,  delicate  molecules  from  the  atomic  constituents  of  the  interstellar 
gas. 

If  a  giant  molecular  cloud  becomes  sufficiently  massive  and  dense,  the  mutual 
gravitation  of  its  parts  will  overcome  the  outward  gas  pressure  from  inside,  and  the 
cloud  starts  falling  in  on  itself.  Once  this  gravitational  collapse  is  underway,  the 
giant  cloud  fragments  into  smaller  components;  and  the  pieces  collapse  until  their 
cores  become  hot  enough  to  ignite  nuclear  fusion,  burning  hydrogen  to  become 
stars  like  the  Sun.  Some  interstellar  clouds  are  even  now  in  the  process  of  creating 
stars  (Fig.  12.3).  Thus,  stars  are  continually  reformed,  as  new  stars  arise  in  the  dark 
spaces  between  the  old  ones. 


Table  12.1  Physical  properties  of  giant  molecular  clouds 


N HI 

Tk 

R 

M 


‘12  m“3 


Density  of  hydrogen  molecules  =  10lo-10 
Kinetic  temperature  =  10-30  K 

Radius  =  0.6-32  light-years  =  0.2-10  pc  w  (0.6-30)  x  10 
Mass  =  104-106  Mq  %  2  x  1034  kg  -  2  x 


16 


m 


1036  kg 
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Fig.  12.3  Mountains  of  creation  The  infrared  heat  radiation  of  hundreds  of  embryonic  stars 
(white / yellow)  and  windblown,  star-forming  clouds  (red),  detected  from  the  Spitzer  Space 
Telescope.  The  intense  radiation  and  winds  of  a  nearby  massive  star,  located  just  above  the  image 
frame,  probably  triggered  the  star  formation  and  sculpted  the  cool  gas  and  dust  into  towering 
pillars.  (Courtesy  of  NASA/JPL-Caltech/Harvard-Smithsonian  CfA/ESA/STScI.) 

Bergin  and  Tafalla  (2007)  have  reviewed  cold  dark  clouds  as  the  initial  con¬ 
ditions  of  star  formation.  Lada  and  Lada  (2003)  have  reviewed  embedded  clusters 
in  molecular  clouds;  Fukui  and  Kawamura  (2010)  have  reviewed  molecular  clouds 
in  nearby  galaxies;  and  Solomon  and  Vanden  Bout  (2005)  have  discussed 
molecular  gas  at  high  redshift. 


12.2.2  Gravitational  Collapse 

Near  the  beginning  of  the  20th  century,  the  English  physicist  and  mathematician 
James  Jeans  (1877-1946)  considered  the  stability  conditions  of  a  gas  subject  to 
perturbations  in  mass  density,  showing  that  a  fluctuation  greater  than  a  critical 
size  -  now  called  the  Jeans  length  -  or  a  mass  greater  than  a  critical  mass  -  known 
as  the  Jeans  mass  -  will  become  unstable  to  gravitational  collapse  (Jeans  1902). 
This  collapse  occurs  when  there  is  insufficient  gas  pressure  to  support  a  large, 
massive  interstellar  cloud  against  the  combined  gravitational  attraction  of  its 
component  parts. 
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A  spherical  gas  cloud  of  a  given  radius,  R ,  and  temperature,  T,  will  undergo 
gravitational  collapse  if  the  mass  is  greater  than  the  Jeans  mass,  Mj ,  given  by: 


(12.4) 


where  m  is  the  particle  mass,  the  Boltzmann  constant  k  =  1.38065  x 
10-23  J  K  \  and  the  gravitational  constant  G  =  6.674  x  10“ 11  m3  kg-1  s-2. 
The  lower  the  temperature  and  size,  the  greater  is  the  likelihood  of  gravitational 


instability.  A  cloud  is  stable  for  masses  less  than  the  Jeans  mass,  and  only  becomes 


unstable  for  a  mass  greater  than  this  critical  value. 

Example:  Derivation  of  the  Jeans  mass 

Suppose  the  radius,  R ,  of  a  mass,  M,  contracted  by  an  amount  AR.  This 
would  compress  the  gas,  increasing  the  internal  thermal  energy,  Ethermah  by 
an  amount  AEthermal  given  by: 


3MkTAR 


A  Ethennal  =  P(4nR2AR)  =  NkT(4nR2AR ) 


(12.5) 


niR 


where  the  gas  pressure,  P  =  NkT ,  the  ideal  gas  law,  N  is  the  number  density, 
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the  Boltzmann  constant  k  =  1.38065  x  10  ^  J  K  ,  the  gas  temperature 
is  T,  the  gas  particle  mass  is  m,  and  the  decrease  in  volume  is  4nR  AR. 


The  gravitational  potential  energy,  Egrav ,  of  the  cloud  (Sect.  3.2)  will 
increase  by  an  amount  A Egrav  given  by: 


GM2AR 


(12.6) 


Collapse  will  ensue  if  the  increase  in  gravitational  potential  energy 
exceeds  the  pressure  increase  that  opposes  the  collapse,  or  for 
AEgrav  >  A Ethermai,  where  the  >  symbol  denotes  greater  than  or  equal.  This 
means  that  when  the  mass  M  exceeds  the  Jeans  mass  Mj ,  or  for 


3  kT 


(12.7) 


M>  Mj  = R 

Gm 


there  isn’t  any  equilibrium  between  the  gravitational  and  thermal  forces.  The 
situation  is  unstable  and  collapse  ensues. 

By  way  of  comparison,  the  mass,  M,  of  a  cloud  of  particles  of  mass,  m,  number 
density,  N,  and  mass  density,  p  =  Nm ,  is: 


(12.8) 
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where  n  =  3.14159.  The  cloud’s  mass  density  is  given  by: 


3  M 
^  47 zR3 


Nm. 


(12.9) 


An  equivalent  criterion  for  cloud  collapse  is  that  the  cloud  radius,  R ,  is  less  than 
the  Jeans  radius,  Rj ,  given  by: 


Gm 
3 kT 


M. 


(12.10) 


Collapse  will  also  occur  if  the  cloud  mass  density,  p,  is  greater  than  the  Jeans 
density,  pj ,  given  by: 


M  _  3  pkT\3 

9  3  ~  ±nR)  ~  4nM2  \Gm ) 


(12.11) 


Giant  molecular  clouds  can  have  a  mass  greater  than  the  Jeans  mass,  a  mass 
density  greater  than  the  Jeans  density,  and  a  radius  less  than  the  Jeans  radius. 


Example:  Gravitational  collapse  of  a  giant  molecular  cloud 

A  giant  molecular  cloud  with  a  number  density  of  hydrogen  molecules  of 
Nh 2  =  1012  m~3,  a  radius  of  R  =  1  parsec  =  3.0856  x  1016  m,  and  a 
temperature  of  T  =  10  K,  has  a  Jeans  mass,  Mj ,  of: 

3CT 

Mj  =  —R ,  (12.12) 

Gm 

where  m  =  2 mH  =  2  x  1.67  x  10  “  kg  is  the  mass  of  the  hydrogen 
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molecule,  the  Boltzmann  constant  k  =  1.38065  x  10  J  K  ,  and  the 
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gravitational  constant  G  =  6.674xl0  mkg  s  .  Substituting  these 

O  1 

numerical  values  into  the  equation  we  obtain  Mj  =  5.73  x  10  kg  for  the 
giant  molecular  cloud.  This  is  28.8  M0,  where  the  solar  mass 
Mq  =  1.989  x  1030  kg. 

The  mass,  Mc,  for  this  cloud  is: 


Mc 


4nR3p 

3 


8nmHNH2R 3 
3 


(12.13) 


where  the  mass  density  p  =  NH2  x  lmH.  For  the  giant  molecular  cloud 
under  consideration,  Mc  =  4.110  x  1035  kg  =  2.07  x  105  M0.  So  this 
giant  molecular  cloud  contains  a  mass  equivalent  to  200,000  stars  like  the 
Sun,  and  exceeds  the  Jeans  mass  by  a  factor  of  about  7,000,  definitely 
fulfilling  the  condition  for  gravitational  collapse. 


It  is  more  difficult  for  clouds  with  a  smaller  mass  than  the  Jeans  mass  to  form 
stars.  They  require  external  compression  to  begin  the  collapse. 
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12.2.3  Triggering  Gravitational  Collapse 

Why  haven’t  all  the  interstellar  gas  and  dust  drawn  together  to  make  stars?  In  most 
regions  of  interstellar  space,  the  temperatures  are  high  enough  and  the  mass 
densities  low  enough  for  a  long-lived  stable  equilibrium  between  outward  gas 
pressure  and  inward  gravitational  pull.  Take  a  typical  cloud  of  interstellar 
hydrogen,  an  H  I  region,  for  example.  At  a  temperature  of  about  100  K,  the 
hydrogen  atoms  have  a  typical  kinetic  energy  that  is  greater  than  the  gravitational 
potential  energy  of  the  region,  and  the  kinetic  energy  of  H  II  regions,  with  a 
temperature  of  10,000  K,  is  even  greater. 


Example:  The  equilibrium  of  H  I  and  H  II  regions 

The  number  density  of  un-ionized,  or  electrically  neutral,  hydrogen  atoms  in 
an  interstellar  H  I  region  is  NH  =  10  m  ,  the  temperature  is  T  =  100  K, 
and  the  radius  is  R  =  1  pc  =  3.0857  x  10  m,  the  typical  spacing  between 
adjacent  stars.  The  mass  density,  p,  of  the  region  is 
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p  =  NHmH  =  1.6739  x  10  ^  kg  m  ,  where  the  mass  of  the  hydrogen 
atom  is  mH  =  1.6739  x  10“27  kg.  The  mass  of  the  region  is 
M  =  4tiR3  p/3  ^  2.1  x  1030  kg  «  M0,  the  mass  of  the  Sun.  This  shows  that 
the  mass  of  neutral,  unionized  hydrogen  in  the  space  between  the  stars  is 
about  equal  to  the  mass  of  hydrogen  in  stars.  The  Jeans  mass,  My,  of  the 
region  is: 

3k T 

Mj  =  —R ,  12.14 

Gm 

where  m  =  mH  is  the  mass  of  the  hydrogen  atom,  the  Boltzmann  constant 
k  =  1.38065  x  10-23  J  K  \  and  the  gravitational  constant  G  =  6.674  x 
10  m  kg  s  .  Substituting  numerical  values  we  obtain  Mj  =  1.14x 
1033  kg  575  M0.  This  is  nearly  600  times  greater  than  the  mass  of  the 
region,  which  means  that  an  H  I  region  is  stable  against  gravitational 
collapse. 

A  typical  emission  nebulae,  a  region  of  ionized  hydrogen,  might  have  a 
proton  density  NP  =  109  m-3,  a  temperature  of  T  =  104  K,  and  radius  of 

1  7 

R  =  30  pc  =  9.257  x  101'  m.  The  mass  density  of  the  H  II  region  is 
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p  —  Npnip  =  1.6726  x  10  kg  m  ,  for  a  proton  mass  mP  =  1.6726  x 
10-27  kg,  and  the  mass  is  M  =  AnR? p/3  «  5.56  x  1036  kg  =  2.8  x 

/r 

10  M0.  This  indicates  that  this  H  II  region  contains  about  as  much  mass  as  a 
giant  molecular  cloud.  Moreover,  the  Jeans  mass  for  the  H  II  region  is 
Mj  =  3.43  x  1036  kg  «  1.7  x  106  M0,  which  is  so  close  to  the  H  II 
region’s  mass  that  it  seems  on  the  verge  of  collapse.  Many  of  the  central 
stars  of  H  II  regions,  or  emission  nebulae,  must  still  be  immersed  in  the 
material  from  which  they  formed. 
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The  mixture  of  gas  and  dust  also  is  stirred  into  motion  here  and  there  by 
gravitational  tugs,  the  radiation  pressure  of  hot  stars,  or  waves  expanding  from 
stellar  explosions.  These  movements  also  oppose  gravitational  collapse. 

Magnetic  fields  do  not  like  being  pushed  together  any  more  than  hot  particles 
do,  so  magnetic  pressure  can  also  help  an  interstellar  cloud  resist  gravity.  The 
interstellar  magnetic  field  can  generate  a  magnetic  pressure  that  is  comparable  to 
the  interstellar  gas  pressure  in  its  vicinity  and  magnetic  energy  can  support  a  gas 
cloud  against  gravity  (Focus  12.2), 


Focus  12.2  Magnetic  energy 

A  magnetic  field  of  strength  B  produces  a  magnetic  pressure,  PB ,  transverse 
to  the  direction  of  the  magnetic  field.  This  pressure  is  given  by: 

p‘= <i2i5> 

_ 'y  _ /:  _ o 

where  the  magnetic  constant  =  4n  x  KT7  =  1.2566  x  1(TD  N  A  . 
The  interstellar  magnetic  field  has  a  strength  of  B  «  10“ 10  tesla,  to  give 
PB  «  0.4  x  1(T14  Pa.  This  is  the  pressure  carried  by  the  magnetic  field 
(Focus  9.1,  Sect.  9.1). 

The  magnetic  pressure  is  a  magnetic  energy  density,  so  the  magnetic 
energy  of  a  spherical  volume  of  radius  R  will  by  4nR  PB/3.  If  a  gas  cloud  of 
mass,  M,  and  radius,  R ,  is  in  equilibrium  with  the  magnetic  pressure  alone, 

r\ 

the  magnetic  energy  is  equal  to  the  gravitational  potential  energy  GM^/R, 
and  the  magnetic  field  strength,  B ,  is  given  by: 


3h0GM2\  1/2 
2nR4  ) 


6.33  x  1CT9 


M 

R2 


tesla, 


(12.16) 
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where  the  gravitational  constant  G  =  6.674  x  10  m  kg  s  .  If  the 
gas  pressure,  Pg,  supports  the  cloud  alone,  then  multiplying  the  gas  pressure 
(Sect.  5.4)  by  the  cloud  volume  and  setting  it  equal  to  the  gravitational 
potential  energy  gives: 


MkT  _  GM 2 
TTLh  R 


(12.17) 


or  the  temperature  required  for  the  support  is: 


Gnii  i  M 


(12.18) 


_ 0^7 

where  the  mass  of  the  hydrogen  atom  mH  =  1.67  x  10  ^  kg,  and  the 

_ no  _ 1 

Boltzmann  constant  k  =  1.38065  x  10  J  K  .  This  is,  within  a  factor  of 
2/3,  the  expression  used  to  calculate  the  temperature  at  the  center  of  the  Sun. 
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Example:  Supporting  interstellar  clouds  by  gas  pressure  and  magnetic 
pressure 

The  ratio  of  magnetic  pressure,  PB ,  to  gas  pressure,  Pg  =  NkT,  is 
PB/Pg  =  B2/(2fioNkT)  =  2.88  x  1028  B2 /(NT)  for  a  magnetic  field  strength 
B ,  a  gas  number  density  A,  and  a  gas  temperature  T,  where  the  permeability 
of  free  space  po  =  471  x  10-7  =  1.2566  x  10-6  N  A-2  and  the  Boltzmann 
constant  k  =  1.38065  x  1CT23  J  K_1.  For  interstellar  space  B  =  10“ 10 
tesla,  the  density  of  hydrogen  atoms  is  about  N  =  10  m  and  the 
temperature  of  interstellar  hydrogen  atoms  is  about  T  =  100  K.  With  these 
numbers  PB/Pg  ~  0.3,  or  the  interstellar  magnetic  pressure  is  roughly 
comparable  to  the  interstellar  gas  pressure. 


For  the  most  part,  interstellar  clouds  merely  swirl  through  space  -  too  hot, 
agitated,  and  magnetic  to  collapse  into  stars.  Compression  by  an  external  agent 
nevertheless  can  force  an  isolated  cloud  into  gravitational  collapse.  Occasionally, 
gas  clouds  collide  with  one  another,  generating  shock  waves  that  can  compress  the 
colliding  clouds,  initiating  their  gravitational  collapse. 

A  spectacular  type  of  external  compression  is  provided  by  a  nearby  exploding 
star,  or  supernova.  When  a  massive  star  exhausts  its  thermonuclear  fuel,  it  can 
explode  and  eject  a  spherical  shock  wave  that  expands  at  a  speed  of 
10,000  km  s"1.  The  wave  produced  by  the  detonation  of  a  nuclear  bomb  is 
analogous  to  the  shock  wave  of  a  supernova. 

As  proposed  by  the  Estonian  astronomer  Ernst  Opik  (1893-1985),  who  spent 
the  second  half  of  his  career  in  Ireland,  the  shocks  and  expanding  remnants  of  the 
explosion  can  trigger  the  collapse  of  a  normally  stable  interstellar  cloud  (Opik 
1953).  The  shock  wave  pushes  nearby  interstellar  gas  and  dust  together,  com¬ 
pressing  clumps  of  matter  to  sufficiently  high  density  for  gravitational  collapse  to 
ensue. 

The  solar  nebula  once  may  have  been  so  spread  out  that  its  weak  gravity  was 
not  sufficient  for  it  to  collapse  to  form  the  Sun  and  the  planets.  Instead,  the 
explosion  of  a  nearby  star  may  have  triggered  the  collapse.  Some  elements  found 
in  meteorites  recovered  on  the  Earth  are  apparently  the  decay  products  of  radio¬ 
active  elements  that  must  have  been  produced  in  such  a  stellar  explosion  no  more 
than  a  few  tens  of  millions  of  years  before  our  solar  system  formed.  Because  of  the 
rate  of  radioactive  decay,  if  these  elements  were  created  before  that  time,  they 
would  not  be  around  now.  Adams  (2010)  has  provided  a  review  of  the  birth 
environment  of  the  solar  system. 

Emission  from  hot,  massive,  young  stars  also  can  compress  nearby  gas  and  dust 
into  gravitational  collapse  and  the  formation  of  new  stars.  Associations  of  bright  O 
and  B  stars  that  were  formed  about  1  million  years  ago,  for  example,  now  are 
expanding  and  dispersing  into  space  (Ambartsumian  1949,  1955).  The  intense 
radiation  and  powerful  winds  associated  with  a  previous  and  nearby  generation  of 
O  and  B  stars  could  have  triggered  the  collapse  of  neighboring  material,  giving  rise 
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to  the  expanding  stellar  associations.  These  newer,  younger  stars  may  trigger  the 
formation  of  other  stars  in  the  future,  in  an  ongoing  process  of  sequential  star 
formation  (Elmegreen  and  Lada  1977). 

Thus,  stars  do  not  form  just  anywhere.  They  are  bom  either  in  cold,  dense 
molecular  clouds  or  in  proximity  to  exceptionally  massive  and  short-lived  stars  by 
various  mechanisms,  including  the  pressure  of  stellar  radiation,  winds,  and 
explosions. 


12.2.4  Proto  stars 

A  star  in  the  process  of  formation  is  commonly  called  a  protostar.  Such  an 
embryonic  star  shines  by  the  release  of  gravitational  energy  during  the  collapse  of 
interstellar  material,  but  it  has  not  yet  begun  to  shine  by  nuclear  fusion  in  its  core. 
Protostars  are  exceptionally  bright  at  infrared  wavelengths,  which  can  be  used  to 
detect  them  within  dark  clouds. 

Once  an  interstellar  cloud  becomes  sufficiently  dense,  by  either  external  com¬ 
pression  or  within  a  giant  molecular  cloud,  the  mutual  gravitational  attraction  of  its 
parts  will  overcome  the  gas  pressure  and  cause  this  cloud  to  start  collapsing. 
As  this  protostar  falls  inward,  it  gains  gravitational  potential  energy,  much  in  the 
way  a  waterfall  gains  energy  when  its  water  moves  toward  the  ground.  Some  of  the 
energy  of  the  protostar  is  converted  into  heat  as  the  gas  particles  fall  inward  and 
collide  with  one  another  (Fig.  12.4). 

Before  it  becomes  a  Sun-like  star,  the  entire  collapsing  protostar  is  a  great, 
churning  caldron  of  heated  gas,  with  energy  transported  and  released  by  convec¬ 
tion.  Eventually,  the  convection  zone  retreats  toward  the  outer  parts  of  the  pro¬ 
tostar,  and  radiation  plays  a  role  in  transporting  and  carrying  off  energy. 

As  the  gas  particles  fall  inward,  their  gravitational  potential  energy  is  converted 
into  the  kinetic  energy  of  motion,  and  particle  collisions  transform  the  kinetic 
energy  into  thermal  energy,  which  heats  the  gas.  As  long  as  the  thermal  energy  and 
heat  can  be  radiated  away,  the  protostar  stays  cool  and  the  pressure  remains  too 
weak  to  slow  the  gravitational  collapse.  However,  as  the  shrinking  cloud  becomes 
smaller  and  denser,  some  of  the  thermal  energy  cannot  escape  and  is  trapped  inside 
the  collapsing  cloud.  The  internal  temperature  and  pressure  will  then  rise  and  slow 
the  pace  of  contraction. 

Eventually,  the  core  temperature  becomes  high  enough  to  ignite  thermonuclear 
reactions,  which  heat  the  surrounding  protostar  material  and  completely  halt  the 
collapse.  The  protostar  then  arrives  on  the  main  sequence,  and  a  star  is  born. 

More  massive  protostars  possess  more  gravitational  potential  energy  and  can 
thus  collapse  faster.  They  also  dissipate  the  gravitational  energy  at  a  relatively 
faster  rate  during  contraction  because  of  their  larger  luminosity  and  greater 
radiation. 

Studies  of  the  youngest  star  clusters  show  that  their  hottest,  most  massive  O  and 
B  stars  have  arrived  on  the  main  sequence  of  the  Hertzsprung-Russell  diagram, 
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Gravitational  Collapse 


cloud  of  gas  and  dust  (left)  compresses  the  cloud  and  heats  it  (right).  When  the  cloud  shrinks, 
gravitational  potential  energy  is  converted  into  heat  as  the  gas  particles  fall  inward  and  collide 
with  one  another.  This  also  produces  radiation  that  can  carry  off  some  of  the  energy.  The 
velocities  of  the  gas  atoms  are  denoted  by  arrows  that  point  in  the  direction  of  atomic  motion  and 
have  lengths  that  increase  with  the  speed  of  motion.  Higher  speeds  occur  in  the  compressed 
cloud,  where  the  gas  atoms  move  faster  and  in  all  directions 


while  their  cooler,  less  massive  stars  have  not  reached  it.  The  latter  stars,  of  late 
spectral  type,  must  still  be  undergoing  gravitational  contraction  from  the  sur¬ 
rounding  pre- stellar  medium  and  have  not  yet  had  sufficient  time  to  become  hot 
enough  to  ignite  thermonuclear  reactions. 

When  the  observations  of  young  stellar  clusters  are  combined  with  the  theo¬ 
retical  studies  of  the  Japanese  astrophysicist  Chushiro  Hayashi  (1920-2010),  the 
pre-main  sequence  evolution  of  protostars  of  different  masses  can  be  deciphered 
(Hayashi  1961,  1966).  As  illustrated  in  Fig.  12.5,  a  protostar’s  track  in  the 
Hertzsprung-Russell  (H-R)  diagram  initially  moves  straight  down  and  subse¬ 
quently  turns  to  the  left  and  continue  that  way  until  the  protostar  arrives  on  the 
main  sequence.  These  paths  in  the  H-R  diagram  have  been  successfully  compared 
with  observations  of  young  star  clusters  (Walker  1956). 

Upon  arriving  at  the  main  sequence,  the  outward  pressure  of  star’s  hot  gas, 
which  is  now  heated  by  nuclear  fusion  reactions,  prevents  the  star  from  collapsing 
further.  It  has  settled  down  for  a  long  rather  uneventful  life  as  a  main  sequence 
star,  the  longest  stop  in  its  life  history. 

How  long  does  it  take  for  a  collapsing  protostar  to  become  a  star?  One  estimate 
for  the  time-scale  on  which  clouds  collapse  is  the  free  fall  time.  This  is  the  time  it 
would  take  a  cloud  to  undergo  gravitational  collapse  from  its  original  shape  to  a 
single  point,  neglecting  gas  pressure  that  counteracts  this  force.  The  free  fall 
time,  Tff,  for  unopposed  gravitational  collapse  is  given  by: 
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32  Gp 
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Fig.  12.5  Protostars  on  the  Hertzsprung-Russell  diagram  Evolutionary  tracks  of  protostars  of 
various  masses  in  the  H-R  diagram,  ending  with  their  arrival  on  the  main  sequence  when  stars 
have  begun  burning  hydrogen  in  their  cores.  The  absolute  luminosity,  L,  is  given  in  units  of  the 
Sun’s  absolute  luminosity,  denoted  L0.  The  star  mass  is  given  in  units  of  the  Sun’s  mass, 
designated  M0.  The  mass  values  are  specified  along  the  main  sequence,  from  upper  left  to  lower 
right.  High  mass  stars,  which  have  greater  luminosity  than  low  mass  stars,  are  found  at  higher 
points  on  the  main  sequence  and  take  a  shorter  time  to  arrive  there.  The  protostar  lifetimes  are 
given  above  the  relevant  track.  During  star  formation,  transport  of  a  protostar’ s  internal  energy  is 
dominated  by  either  radiation  ( horizontal  lines )  or  by  convection  ( vertical  lines).  Stars  with  lower 
mass  ultimately  have  larger  interior  convective  zones.  (From  “The  Life  and  Death  of  Stars”  by 
Kenneth  R.  Lang,  published  by  Cambridge  University  Press,  2013.  Reprinted  with  permission.) 
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where  n  =  3.14159,  the  gravitational  constant  G  =  6.674  x  10  m  kg  s 
and  p  is  the  mass  density  within  the  cloud  at  the  time  collapse  began. 


Example:  Free  fall  time  for  a  giant  molecular  cloud  and  for  the  Sun 

A  giant  molecular  cloud  can  have  a  hydrogen  molecule  number  density  of 

1  O  _ Q 

N  =  10  ^  m  ,  and  the  mass,  m,  of  each  hydrogen  molecule  is  twice  the 
mass  of  a  hydrogen  atom,  with  m  =  2 mH  k,  3.348  x  10  kg.  The  mass 

_ i  _ o 

density  of  the  giant  molecular  cloud  is  p  =  Nm  k,  3.348  x  10  kg  m 
and  the  free  fall  time  is 


1.15  x  1012s  «  36,000years, 


Tff  = 


3n 


(12.20) 
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where  the  Newtonian  constant  of  gravitation  G  =  6.674  x  10  m  kg  s  , 
and  1  year  =  3.15576  x  107  s. 

A  single  cloud  does  not  collapse  to  a  point,  with  a  radius  of  zero.  As  the 
cloud  collapses,  the  density  will  increase  and  the  Jeans  mass  of  each  piece  of 
the  cloud  will  decrease.  These  pieces  will  start  to  collapse  on  their  own.  So 
the  cloud  fragments  into  small,  dense  parts,  which  collapse  faster  than  the 
overall  cloud. 

If  an  external  force  triggered  the  collapse  of  an  interstellar  cloud  of 
temperature  T  =  100  K,  to  form  the  Sun  of  mass  M0  =  1.989  x  1030  kg, 
collapse  would  start  when  the  density  exceeded  the  Jeans  mass  density. 

«  3.1  x  10“12kg  nT3,  (12.21) 

_ OQ  _  1 

where  the  Boltzmann  constant  k  =  1.38065  x  10  Jr1.  This  mass 
density  can  be  used  to  obtain  a  free  fall  time  «  3.8  x 
1010  s  «  1,200  years. 


3  /  3  kT 

AtzMq  \Gnih 


For  the  Sun,  the  unopposed  free  fall  collapse  time  may  have  been  about 
1,200  years,  where  1  year  =  3. 157  x  10  s,  but  there  are  forces  that  opposed  the 
collapse  as  it  took  place,  making  it  take  about  1  million  years  for  the  Sun  to 
undergo  gravitational  collapse  to  the  main  sequence.  This  is  much  shorter  than  the 
thermonuclear  lifetime  of  about  10  billion  years  once  the  Sun  began  to  shine  by 
hydrogen  burning  in  its  core.  When  compared  to  the  Sun,  more  massive  stars  have 
shorter  free  fall  times  to  the  main  sequence  and  shorter  thermonuclear  life  times  on 
the  main  sequence  as  well. 


12.2.5  Losing  Mass  and  Spin 

The  early  stages  of  star  life  can  be  the  most  active.  For  example,  young  stars  can 
have  strong  stellar  winds  that  drive  away  protostellar  material  that  may  still 
envelop  them.  Even  now,  billions  of  years  after  its  birth,  the  Sun  generates  a  solar 
wind  that  removes  about  10“ 14  solar  masses  every  year;  we  suspect  that  its  winds 
were  more  violent  in  its  youth,  clearing  out  the  solar  nebula  when  the  planets  were 
formed. 

Hot,  exceptionally  luminous  stars  can  generate  immensely  powerful  winds  with 
greater  mass  loss  than  the  current  solar  wind.  Mass  loss  rates  of  up  to  1/ 10,000th 
or  10-4,  solar  masses  per  year  have  been  observed;  at  this  rate,  a  star  of  say  10 
times  the  mass  of  the  Sun  would  blow  itself  away  in  just  100,000  years. 

Stellar  winds  are  not  confined  to  youth  alone,  for  stars  of  advancing  age  also 
can  produce  strong  winds.  When  a  star  of  relatively  low  mass  evolves  into  a  red 
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giant,  the  gravitational  attraction  at  its  inflated  outer  layers  becomes  much  smaller 
than  that  during  the  star’s  former  life  on  the  main  sequence.  This  reduces  the  star’s 
ability  to  hold  onto  its  outer  atmosphere  and  increases  the  likelihood  that  some  of 
it  will  escape.  As  a  result,  red  giant  stars  also  have  strong  winds,  sometimes  losing 
a  significant  fraction  of  their  mass  during  this  stage  of  stellar  evolution. 

Younger  stars  also  rotate  faster  than  older  stars  (Sect.  4.3).  The  rotation  speed 
can  be  measured  from  the  Doppler  broadening  of  a  star’s  spectral  lines.  Rapidly 
rotating  stars  have  broad  lines;  slowly  rotating  stars  have  sharp  and  narrow  lines. 
Observations  of  this  line-broadening  indicate  that  the  rotation  speed  of  main- 
sequence  stars  decreases  from  left  to  right  on  the  H-R  diagram,  from  luminous, 
young  stars  to  fainter,  older  ones.  Bright  stars  of  spectral  class  O  and  B  rotate  at 
speeds  of  more  than  100  km  s-1  (Shajn  and  Struve  1929).  The  B3  type  main 
sequence  star  Acherner  rotates  so  rapidly,  at  more  than  225  km  s-1,  that  its 
equatorial  diameter  is  56  %  greater  than  its  polar  diameter.  The  rapid  spin  of  the 
A7  type  star  Altair,  at  about  240  km  s_1,  has  similarly  produced  an  oblate,  non- 
spherical  stellar  shape. 


Example:  How  fast  do  stars  spin? 

The  Sun  has  an  equatorial  rotation  period  of  PQ  =  25.67  days  = 
2.218  x  106s,  where  1  day  =  86,400  s.  At  the  Sun’s  equator  the  rotation 
velocity  is  V0  =  2tiRq/Pq  ~  1.971  x  103ms_1  =  1.97kms_1,  where  the 
Sun’s  radius  RQ  =  6.955  x  108  m.  This  is  hundreds  of  times  slower  than  the 
fastest  rotating  main- sequence  stars  and  the  expected  initial  rotation  period 
of  the  newly  formed  Sun.  The  Sun  will  eventually  use  up  its  nuclear  fuel  and 
collapse  to  a  white  dwarf  star  with  a  radius  comparable  to  that  of  the  Earth 

/r 

with  Re  =  6.378  x  10  m.  Since  angular  momentum  is  conserved  in  grav¬ 
itational  collapse,  the  white  dwarf  rotation  period  will  be 

Pwd  =  Pq(Re/Rq)  ~  186  s.  Its  rotation  velocity  will  be  VWD  =  2tzRe/ 
PWD  «  2.15  x  105  m  s”1  «  215  km  s-1,  which  is  comparable  to  the 
rotation  velocity  the  Sun  might  have  had  when  it  initially  formed  about  4.6 
billion  years  ago. 


Main-sequence  stars  of  later  spectral  class  rotate  at  significantly  slower  speeds. 
Stars  of  spectral  class  F5  have  slowed  to  rotation  speeds  of  about  30  km  s-1,  while 
the  Sun,  at  spectral  class  G2,  spins  at  a  leisurely  speed  of  about  2  km  s-1.  The 
older,  late-type  stars  may  rotate  more  slowly  than  early  type  stars  of  relatively 
young  age  because  of  the  magnetic  braking,  which  astronomers  believe  accounts 
for  the  Sun’s  unexpectedly  slow  rotation.  The  magnetic  field  that  is  embedded  in  a 
newly  formed  star  and  the  surrounding  nebula  will  act  like  a  network  of  elastic 
cords  that  tie  the  star  and  distant  regions  together.  The  inner  regions  will  be 
moving  faster  than  the  outer  ones,  and  the  magnetic  field  will  transport  some  of  the 
rotation  from  the  star  outward.  Strong  stellar  winds  of  the  young  stars  may  also 
play  a  role  in  removing  angular  momentum  and  spin  from  the  stars. 
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12.3  Planet-Forming  Disks  and  Planets 
Around  Nearby  Stars 

12.3.1  The  Plurality  of  Worlds 

Just  as  the  ancient  Greeks  imagined  that  all  matter  consists  of  atoms ,  so  they  also 
believed  that  there  were  many  planets  like  ours  in  the  universe,  created  by  the 
coalescence  of  atoms.  In  the  second  century  BC,  the  Greek  philosopher  Epicurus 
of  Samos  (276-194  BC)  proposed  that  the  chance  conglomerations  of  innumerable 
atoms,  in  an  infinite  universe,  should  result  in  the  formation  of  a  multitude  of 
unseen  Earth-like  worlds. 

Then,  the  Roman  poet  Lucretius  (99-55  BC)  wrote  about  the  plurality  of 
worlds,  declaring  that  innumerable  particle  seeds  are  rushing  on  countless  courses 
through  an  unfathomable  universe,  making  it  highly  unlikely  that  our  Earth  is  the 
only  planet  to  have  been  created  and  that  all  of  those  other  particles  are  not 
accomplishing  anything  (Lucretius  55  BC). 

The  belief  in  unseen  worlds  -  some  possibly  inhabited  -  that  are  in  orbit  around 
stars  other  than  the  Sun,  dates  at  least  as  far  back  as  the  late  sixteenth  century,  to 
the  Italian  philosopher  and  priest  Giordano  Bruno  (1548-1600).  He  reasoned  that 
other  planets  would  remain  invisible  to  us  because  they  are  small  and  dim  and 
would  be  hidden  in  the  glare  of  their  host  star  (Bruno  1584). 

During  the  nineteenth  and  twentieth  centuries  astronomers  used  telescopes  to 
explore  the  distant  reaches  of  the  Milky  Way,  showing  that  it  contains  about  100 
billion  stars.  More  recently,  hundreds  of  planetary  worlds,  which  were  once  only 
imagined,  have  been  observed  orbiting  nearby  stars. 


12.3.2  Proto-Planetary  Disks 

Planetary  systems  probably  formed  around  many  stars  as  a  result  of  the  gravita¬ 
tional  collapse  of  an  interstellar  cloud  of  gas  and  dust  that  created  the  stars,  all  in 
accordance  with  the  nebular  hypothesis  of  the  origin  of  the  solar  system.  The 
collapsing  cloud  would  rotate  faster  and  faster,  giving  spin  to  the  material  that  then 
flattened  into  a  planet- forming  disk  with  a  star  at  the  center.  Because  rotation 
imparts  motion  to  the  colliding  material  in  the  direction  of  spin,  the  random  gas 
motions  of  the  original  cloud  are  changed  into  a  rotating  disk.  The  centrifugal 
force  of  the  rotation  prevents  gas  and  dust  from  raining  directly  onto  the  central 
star,  instead  making  it  settle  into  a  rotating  disk  from  which  planets  can  form.  The 
direction  in  which  the  disk  is  spinning  coincides  with  the  direction  of  the  new 
star’s  rotation  as  well  as  the  direction  of  the  orbits  of  any  planets  that  may  be 
formed  in  the  disk. 

Astronomers  have  discovered  flattened,  rotating  disks  of  gas  and  dust  around 
nearby  stars.  The  first  evidence  for  these  planet-forming  disks  was  obtained  in  the 
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Table  12.2  Stars  with  an  excess  of  infrared  radiation  detected  from  the  IRAS  satellitea 


Star 

Luminosity  (L0) 

Spectral  type  (V) 

Mass  (Mq) 

Distance  (light-years) 

Vega 

37 

A0 

2.1 

25.3 

Fomalhaut 

18 

A3 

2.1 

25.13 

Epsilon  Eridani 

0.34 

K2 

0.82 

10.49 

Beta  Pictoris 

8.7 

A6 

1.75 

63.4 

a  The  luminosity  is  in  solar  units  of  L0  =  3.828  x  1026  J  s  1  and  the  mass  is  in  units  of  the  Sun’s 
mass  Mq  =  1.989  x  1030  kg 


early  1980s  with  instruments  aboard  the  InfraRed  Astronomical  Satellite  (IRAS), 
using  technology  pioneered  by  the  military  to  detect  the  infrared  heat  of  the 
enemy. 

The  IRAS  instruments  detected  excess  infrared  radiation  from  four  nearby  stars, 
beyond  what  would  be  expected  from  the  star  alone  (Aumann  1985,  Table  12.2). 
This  implied  the  presence  of  a  circumstellar  disk  of  cool  dust  in  orbit  around  the 
star,  which  would  radiate  at  infrared  wavelengths  and  produce  the  excess.  The 
hotter  stars  would  shine  brightly  in  optically  visible  light  and  emit  relatively  little 
infrared. 

The  Spitzer  Space  Telescope  recently  used  its  powerful  infrared  vision  to  detect 
hundreds  of  stars  with  excess  infrared  radiation,  suggesting  that  they  harbor  planet¬ 
forming  disks.  In  fact,  the  youngest  nearby  stars  usually  are  found  embedded  in  the 
dense  clouds  of  the  interstellar  gas  and  dust  that  spawned  them.  Zuckerman  (2001) 
has  reviewed  the  available  knowledge  of  dusty  circumstellar  disks. 

The  closest  disk  system  to  our  own,  surrounding  the  star  Epsilon  Eridani, 
contains  two  infrared- emitting  belts:  the  first,  at  approximately  the  same  position 
as  the  asteroid  belt  in  our  solar  system;  and  the  second,  denser  belt  between  the 
first  one  and  a  more  remote  ring  similar  to  our  own  Kuiper  belt. 

Instruments  aboard  the  Hubble  Space  Telescope  (HST)  discovered  flattened 
disks  of  dust  swirling  around  at  least  half  of  the  young  stars  in  the  Orion  nebula, 
that  are  shining  in  reflected  visible  light.  The  high-resolution  and  sensitivity  of  the 
HST  also  have  been  used  to  obtain  detailed  images  of  dusty,  planet-forming  disks 
surrounding  Sun-like  stars,  providing  insights  to  the  beginnings  of  our  solar  system 
(Fig.  12.6).  The  flattened,  rotating  disks  suggest  that  the  nebular  hypothesis 
applies  to  them,  and  material  in  the  disks  is  expected  to  coalesce  into  full-blown 
planets  if  it  has  not  done  so  already. 

In  the  meantime,  the  circumstellar  disk  around  one  of  the  IRAS  stars,  Beta 
Pictoris,  became  the  first  to  be  imaged  at  visible  wavelengths  by  using  an  occulting 
disk  to  block  the  star’s  bright  light  (Smith  and  Terrile  1984).  Detailed  observations 
of  the  disk  were  obtained  more  than  a  decade  later  with  the  HST  and  the  Keck 
telescope  (Golimowski  et  al.  2006).  Eventually,  the  ground-based  Very  Large 
Telescope  (VLT),  located  in  Chile,  was  used  with  adaptive  optics  to  show  that  a 
Jupiter-sized  world  is  moving  around  the  star  (Fig.  12.7,  Bonnefoy  et  al.  2011). 
This  giant  planet,  which  has  been  called  Beta  Pictoris  b,  is  located  from  its  host 
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Fig.  12.6  Dusty  disks  around  Sun-like  stars  Instruments  aboard  the  Hubble  Space  Telescope 
have  obtained  these  images  of  the  visible  starlight  reflected  from  thick  disks  of  dust  around  two 
young  stars  that  still  may  be  in  the  process  of  forming  planets.  Viewed  nearly  face-on,  the  debris 
disk  surrounding  the  Sun-like  star  known  as  HD  107146  {right)  has  an  empty  center  large  enough 
to  contain  the  orbits  of  the  planets  in  our  solar  system.  Seen  edge-on,  the  dust  disk  around  the 
reddish  dwarf  star  known  as  AU  Microscopii  (left)  has  a  similar  cleared-out  space  in  the  middle. 
HD  107146  is  88  light-years  away  and  is  thought  to  be  between  50  and  250  million  years  old, 
whereas  AU  Microscopii  is  located  32  light-years  away  and  is  estimated  to  be  just  12  million 
years  old.  [Courtesy  of  NASA/ESA/STScI/JPL/David  Ardila  -  JHU  {right),  and  John  Krist  - 
STScI/JPL  {left).] 

star  Beta  Pictoris  at  a  distance  between  9  and  15  times  the  Earth-Sun  distance  of 
1  AU  or  at  about  the  same  distance  as  Saturn  from  the  Sun  at  9.539  AU. 

Circumstellar  dust  around  another  IRAS  star,  Fomalhaut,  also  has  been  imaged 
with  the  HST.  The  sharp  inner  edge  of  the  dust  ring  suggests  that  a  planet  was 
clearing  out  the  material  inside  the  ring.  The  HST  detected  the  light  of  a  Jupiter- 
size  world  orbiting  Fomalhaut  in  the  expected  place,  which  is  an  enormous  115 
AU  from  the  star  (Kalas  et  al.  2005,  2008).  The  fantastic  images  of  Beta  Pictoris 
and  Fomalhaut  confirmed  that  infrared-emitting  circumstellar  disks  are  indeed 
signposts  of  planet  formation,  but  they  were  obtained  more  than  a  decade  after  the 
even  more  astounding  detection  of  the  first  planets  orbiting  a  Sun-like  star.  These 
were  also  Jupiter-sized  worlds  but  they  were  orbiting  unexpectedly  close  to  their 
host  star. 

Williams  and  Cieza  (2011)  discussed  protoplanetary  disks  and  their  evolution; 
Kley  and  Nelson  (2012)  reviewed  planet-disk  interactions  and  orbital  migration; 
and  Armitage  (2011)  described  the  dynamics  of  protoplanetary  disks.  Dullemand 
and  Monnier  (2010)  reviewed  the  inner  regions  of  protoplanetary  disks,  and  Blum 
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Fig.  12.7  Exoplanet  on  the  move  An  exoplanet’s  orbital  motion,  denoted  by  the  central  white 
elliptical  line,  was  imaged  from  an  adaptive  optics  instrument  attached  to  the  Very  Large 
Telescope  ( VLT)  in  Chile.  The  small  white  spot  at  the  center  shows  the  location  of  the  host  star, 
Beta  Pictoris.  Observations  in  2003  are  located  at  the  left  side  of  the  planet’s  orbital  ellipse  and 
those  in  2009  are  on  the  right  side.  The  larger  dust  disk  surrounding  the  host  star  also  is  shown  by 
the  large  flattened  blue  image  at  the  left  and  the  right.  (Courtesy  of  ESO/A.M.  Lagrange.) 


and  Wurm  (2008)  have  discussed  the  growth  mechanisms  of  macroscopic  bodies 
in  protoplanetary  disks. 


12.3.3  The  First  Discoveries  of  Exoplanets 

Individual  planets  shine  by  reflecting  light  that  is  much  fainter  than  the  light  of  the 
star  that  illuminates  them.  The  visible  light  reflected  by  Jupiter,  for  example,  is 
about  1  billion  or  109  times  dimmer  than  the  light  emitted  by  the  Sun,  and  that 
which  is  reflected  by  the  Earth  is  10  billion  times  fainter.  As  a  result,  planets  are 
almost  always  too  small  and  too  faint  to  be  seen  directly  in  the  luminous  radiation 
of  their  nearby  star.  Their  presence  only  recently  has  been  inferred  from  their 
miniscule  gravitational  effects  on  the  motions  of  the  star  around  which  they 
revolve  or  when  they  chance  to  pass  in  front  of  a  star,  momentarily  blocking  the 
star’s  light  when  viewed  from  the  Earth.  Such  extrasolar  planets  that  orbit  around 
stars  other  than  the  Sun  are  called  exoplanets. 

The  presence  of  an  unseen  planet  orbiting  a  normal  star  like  the  Sun  was  first 
deduced  by  recording  the  way  its  gravity  pulls  at  the  star  it  orbits.  The  planet  and 
star  orbit  a  common  center  of  mass  where  their  gravitational  forces  are  equal.  This 
fulcrum  is  closest  to  the  massive  star  in  the  stellar  case.  So  the  star  moves  in  a 
much  smaller  circle,  a  miniature  version  of  the  planet’s  larger  path.  The  more 
massive  the  planet  and  the  closer  it  is  to  the  star,  the  stronger  the  planet’s  grav¬ 
itational  pull  on  the  star  and  the  more  the  planet  perturbs  it. 
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Fig.  12.8  Starlight  shift  reveals  invisible  planet  An  unseen  planet  exerts  a  gravitational  force 
on  its  visible  host  star.  This  force  tugs  the  star  in  a  circular  or  oval  path,  which  mirrors  in 
miniature  the  planet’s  orbit.  As  the  star  moves  along  this  path,  it  approaches  and  recedes  from 
Earth,  changing  the  wavelength  of  the  starlight  seen  from  the  Earth  through  the  Doppler  effect. 
When  the  planet  pulls  the  star  toward  us,  its  light  waves  pile  up  slightly  in  front  of  it,  shortening 
or  “blueshifting”  the  wavelength  that  we  detect.  When  the  planet  pulls  the  star  away  from  us,  we 
detect  light  waves  that  are  stretched,  or  “redshifted.”  During  successive  planet  orbits,  the  star’s 
spectral  lines  are  periodically  shortened  and  lengthened,  revealing  the  presence  of  the  planet 
orbiting  the  star,  even  though  we  cannot  see  the  planet  directly 

To  detect  this  tumbling  motion,  astronomers  had  to  look  for  the  subtle  com¬ 
pressing  and  stretching  of  starlight  as  an  unseen  planet  tugged  on  a  star,  pulling  it 
first  toward  and  then  away  from  the  Earth,  causing  a  periodic  shift  of  the  stellar 
radiation  to  shorter  and  then  longer  wavelengths  (Fig.  12.8).  To  measure  the  effect, 
astronomers  must  observe  the  wavelength  of  a  well-known  spectral  feature,  called 
a  line ,  and  measure  the  Doppler  shift  of  its  wavelength. 

However,  an  orbiting  planet  produces  an  exceedingly  small  variation  in  the 
wavelength  of  spectral  lines  emitted  from  its  star.  Massive  Jupiter,  for  example, 
makes  the  Sun  wobble  at  a  speed  of  only  about  12  m  s-1.  To  detect  the  Doppler 
effect  of  a  star  moving  periodically  with  this  speed,  astronomers  would  have  to 
measure  the  wavelengths  with  an  unheard  accuracy  of  at  least  1  part  in  30  million 
and  use  a  computer  to  search  for  a  periodic  back-and-forth  wavelength  change. 

Therefore,  the  effect  could  not  be  detected  until  sensitive  spectrographs  were 
constructed  to  precisely  spread  out  the  light  rays.  The  enhanced  light-collecting 
powers  of  electronic  CCDs  were  then  used  to  record  the  dispersed  starlight. 
Because  no  single  line  shift  is  significant  enough  to  be  seen,  computer  software 
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Time  (number  of  4.23-day  periods) 


Fig.  12.9  Unseen  planet  orbits  the  star  51  Pegasi  Discovery  data  for  the  first  planet  found 
orbiting  a  normal  star  other  than  the  Sun.  The  giant,  unseen  planet  is  revolving  around  the  solar- 
type  star  51  Pegasi,  located  50  light-years  away  from  the  Earth.  The  radial  velocity  of  the  star,  in 
units  of  meters  per  second,  designated  m  s-1,  was  measured  from  the  Doppler  shift  of  the  star’s 
spectral  lines.  The  velocity  exhibits  a  sinusoidal  variation  with  a  4.23  day  period,  caused  by  the 
invisible  planetary  companion  that  orbits  51  Pegasi  in  this  period.  The  observational  data  {solid 
dots )  are  fit  with  the  solid  line,  whose  amplitude  implies  that  the  mass  of  the  companion  is 
roughly  0.46  times  the  mass  of  Jupiter.  The  4.23  day  period  indicates  that  the  unseen  planet  is 
orbiting  51  Pegasi  at  a  distance  of  0.05  AU,  where  1  AU  is  the  mean  distance  between  the  Earth 
and  the  Sun.  [Adapted  from  Mayor  and  Queloz  A  Jupiter-mass  companion  to  a  solar-type  star. 
Nature  378:355-359  1995).] 


had  to  be  written  to  add  up  all  of  the  star’s  spectral  lines,  which  shift  together,  and 
to  combine  them  repeatedly  at  all  possible  regularities,  or  orbital  periods,  and  with 
continued  comparison  to  nonmoving  laboratory  spectral  lines. 

It  took  decades  for  astronomers  to  develop  these  complex  and  precise  instru¬ 
ments.  Then,  in  the  1990s,  two  Swiss  astronomers  from  the  Geneva  Observatory  in 
Switzerland,  Michel  Mayor  (1947-  )  and  Didier  Queloz  (1966-  ),  discovered  the 
first  planet  that  orbits  an  ordinary  star:  the  faintly  visible,  Sun-like  star  5 1  Pegasi, 
only  48  light-years  away  from  the  Earth  (Mayor  and  Queloz  1995). 

They  had  detected  the  back- and- forth  Doppler  shift  of  the  star’s  light  with  a 
regular  4.23  day  period,  measured  by  a  periodic  change  of  the  star’s  radial  velocity 
of  up  to  50  m  s_1  (Fig.  12.9).  To  produce  such  a  quick  and  relatively  pronounced 
wobble,  the  newfound  planet  must  be  large,  with  a  mass  comparable  to  that  of 
Jupiter  -  which  is  3 1 8  times  heftier  than  the  Earth  -  and  it  was  moving  in  a  tight 
close  orbit  around  51  Pegasi,  at  a  distance  of  only  0.05  AU  (Focus  12.3). 
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Focus  12.3  Determining  the  mass  and  orbital  distance  of  an  exoplanet 

Planet  hunters  record  the  spectral  lines  of  a  nearby  star,  and  look  for  periodic 
variations  in  the  line-of-sight  velocities,  Vobs,  detected  from  the  measured 
Doppler  shifts  of  the  lines.  Because  the  orbital  plane  is  normally  inclined  to 
the  line  of  sight,  the  true  orbital  velocity,  V,  is  related  to  the  observed 
velocity  Vobs  by: 


yobs  =  V  sin  i,  (12.22) 

where  i  is  the  inclination  angle  between  the  perpendicular  to  the  orbital  plane 
and  the  line  of  sight. 

The  period,  P,  of  the  velocity  variations  is  given  by  Kepler’s  third  law: 


47T2<33 

“  G(Mi  +  M2)’ 


(12.23) 


where  Mx  and  M2  respectively  denote  the  mass  of  the  star  and  its  planet,  their 
separation  is  a ,  and  the  Newtonian  gravitational  constant 

_ ii  a  _ i  _ o 

G  =  6.674  x  10  m  kg  s  .  If  rx  and  r2  denote  their  respective  dis¬ 
tances  from  a  common  center  of  mass,  and  we  assume  circular  orbits,  then 
r{Mi  =  r2M2  with  a  =  rx  +  r2  =  rx  (M{  +  M2)/M2.  Since  the  orbital 
velocity  V  =  2nri/P  =  Vobs/sin  i,  we  obtain: 


PV Obs 
2n  sin  i 


Mi  +M2 

m2 


(12.24) 


Substituting  this  expression  into  Kepler’s  third  law  gives: 

py3 

sin3  /  = (M!  +  M2)  ,  (12.25) 

2nG 

and  since  the  mass  of  the  star  will  greatly  exceed  the  mass  of  the  planet,  or 
M\  M2, 


M2  sin  i 


2nG 


\  1/3 

)  VobsMf/3. 


(12.26) 


For  the  first  exoplanet  to  be  discovered,  we  have  P  =  4.23  days  = 
3.655  x  105  s,  for  1  day  =  86,400  s,  and  Vobs  =  50  m  s”1.  Under  the 
assumption  that  sin  i  =  1  and  the  star’ s  mass  is  comparable  to  the  Sun,  with 
M\  «  Mq  =  1.989  x  1030  kg,  we  obtain  a  planet  mass  of 
M2  =  7.55  X  10“  kg,  which  is  comparable  to  the  mass  of  Jupiter 
Mj  =  1.90  x  10"  kg.  But  the  exoplanet  is  nowhere  near  as  far  away  from 
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its  star  as  Jupiter  is  from  the  Sun.  The  separation,  a ,  of  the  newfound 
exoplanet  from  its  star  is  given  by: 


a  =  n  +  r2  =  n 

«  0.0512  AU, 


r\M\ 


Mi  PV obs 
M2  27isin  i 


7.66  x  109  m 


(12.27) 


for  1  AU  =  1.496  x  1011  m,  the  mean  distance  from  the  Earth  and  the  Sun. 
So  the  new-planet  is  about  one  hundredth  of  Jupiter’s  distance  from  the  Sun, 
which  is  7.78  x  1011  m  =  5.2  AU.  The  exoplanet  is  even  closer  to  the  star 
than  Mercury  is  from  the  Sun,  at  5.79  x  1010  m  or  0.387  AU.  Jupiter  orbits 
the  Sun  once  every  11.86  years,  Mercury  has  an  orbital  period  of  about  88 
Earth  days;  the  exoplanet  orbits  its  star  once  every  4.23  Earth  days. 


Planets  that  are  closer  to  a  star  move  around  it  with  greater  speed  and  take  less 
time  to  complete  an  orbit,  in  accordance  with  Kepler’s  third  law.  Thus,  the  Earth 
takes  a  year,  or  365  days,  to  travel  once  around  the  Sun  at  a  mean  distance  of 
1  AU,  whereas  Mercury,  the  closest  planet  to  the  Sun,  orbits  our  star  in  a  period  of 
88  days  at  0.387  AU.  A  short  orbital  period  of  only  4.23  days  meant  that  the 
newfound  planet  was  located  at  a  distance  of  only  0.05  AU  from  its  parent  star,  or 
about  one-eighth  the  distance  between  Mercury  and  the  Sun.  Thus,  a  completely 
unanticipated  planet  had  been  found,  rivaling  Jupiter  in  size  and  revolving  around 
51  Pegasi  in  an  orbit  smaller  than  Mercury’s. 

No  one  anticipated  that  a  giant  planet  would  orbit  so  close  to  its  star.  The 
intense  radiation  and  powerful  winds  of  the  newly  formed  star  were  expected  to 
keep  any  hydrogen  from  gathering  together  into  a  planet,  explaining  why  Jupiter 
and  the  other  giant  planets  were  formed  far  from  the  Sun  in  the  cold,  outer 
precincts  of  our  solar  system.  However,  this  was  good  for  planet  hunters,  for  the 
large  mass  of  a  giant  world  would  produce  a  more  pronounced  velocity  change 
than  the  smaller  mass  of  an  Earth-sized  world,  and  the  close  orbit  meant  a  short 
orbital  period  that  might  be  detected  in  weeks  instead  of  years. 

Fewer  than  2  weeks  after  the  announcement  of  a  giant  planet  circling  5 1  Pegasi, 
two  American  astronomers,  Geoffrey  W.  Marcy  (1955-  )  and  R.  Paul  Butler 
(1962-  ),  used  their  own  past  observations  to  confirm  the  result.  Once  they  knew 
that  giant  planets  could  revolve  unexpectedly  near  a  star,  with  short  orbital  peri¬ 
ods,  they  used  powerful  computers  to  reexamine  their  observations  of  other  nearby 
stars  accumulated  during  previous  years.  They  subsequently  announced  the 
discovery  of  two  more  Jupiter-sized  companions  of  Sun-like  stars  (Marcy  and 
Butler  1996). 
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12.3.4  Hundreds  of  New  Worlds  Circling  Nearby  Stars 

After  scientists  realized  that  a  large  planet  could  be  so  near  to  its  star,  they  knew 
where  and  how  to  look.  By  monitoring  thousands  of  nearby  Sun-like  stars  for 
years,  American  and  European  teams  found  hundreds  of  planets  revolving  about 
other  nearby  stars,  most  of  them  massive  Jupiter- sized  planets.  The  accelerating 
pace  of  discovery  is  documented  at  the  extrasolar  planets  encyclopedia  at  http:// 
exoplanet.eu/  and  at  http://planetquest.jpl.nasa.gov/. 

Udry  and  Santos  (2007)  reviewed  the  statistical  properties  of  exoplanets;  Seager 
and  Deming  (2010)  have  reviewed  exoplanet  atmospheres,  and  Marcy  and  Butler 
(1998)  have  reviewed  the  detection  of  extrasolar  giant  planets. 

Some  of  the  newfound  worlds  travel  in  nearly  circular  orbits,  like  those  in  the 
solar  system,  but  they  are  much  closer  to  their  host  star  than  Mercury  is  to  the  Sun. 
Dubbed  “hot  Jupiters”  because  of  their  size  and  proximity  to  the  intense  stellar 
heat,  they  are  much  too  hot  for  human  life  to  survive  or  water  to  exist.  Their 
temperatures  can  soar  to  more  than  1,000  K,  far  hotter  than  the  surface  of  any 
planet  in  our  solar  system.  Other  newfound  planets  follow  eccentric,  oval-shaped 
orbits  that  deviate  from  a  circular  path,  so  they  venture  both  near  and  far  from  their 
star.  Many  flat  multi-planet  systems  also  have  been  found  as  a  result  of  longer  and 
improved  observations,  as  expected  from  the  nebular  hypothesis. 

Most  of  these  worlds  were  discovered  by  the  wobble  they  create  in  the  motion 
of  their  host  star,  but  some  of  them  were  discovered  when  they  passed  in  front  of 
the  star,  causing  it  to  dim,  or  blink.  If  a  planet  happens  to  have  a  near  edge-on 
orbit,  as  seen  from  the  Earth,  it  periodically  will  cross  directly  in  front  of,  or 
transit ,  its  host  star.  Such  a  transit  can  be  seen  only  if  the  orbit  of  the  distant  planet 
crosses  the  line  of  sight  from  the  Earth,  blocking  a  tiny  fraction  of  the  star’s 
observed  light  and  causing  it  to  periodically  dim,  repeatedly  during  the  planet’s 
endless  journey  around  its  star. 

The  size  of  a  planet  can  be  derived  from  the  size  of  the  dip.  The  fractional 
change  in  brightness,  or  transit  depth ,  is  equal  to  the  ratio  of  the  area  of  the  planet 
to  the  area  of  the  star.  For  the  Earth  and  the  Sun,  as  an  example,  the  transit  depth  is 
0.000084.  The  planet’s  temperature  can  be  estimated  from  the  characteristics  of 
the  star  that  it  orbits  and  the  planet’s  orbital  period. 

These  have  all  been  indirect  detections  of  exoplanets.  As  previously  mentioned, 
the  important  direct  confirmation  of  a  planet  circling  another  star  was  obtained 
from  the  ground-based  Very  Large  Telescope  in  Chile,  obtaining  images  of  a 
Jupiter-sized  planet  moving  around  the  star  Beta  Pictoris.  Astronomers  have  also 
used  the  Keck  I  telescope  in  Hawaii  to  directly  image  the  orbital  motion  of  three 
planets  around  the  star  HR  8799,  using  adaptive  optics  at  infrared  wavelengths. 
The  host  star  is  roughly  1.5  times  as  massive  as  the  Sun,  about  5  times  as  lumi¬ 
nous,  and  located  129  light-years  away  from  the  Earth.  The  planets,  designated  HR 
8799  b,  c  and  d,  orbit  inside  a  massive  dusty  disk  at  distances  of  roughly  twice 
those  of  Neptune,  Uranus  and  Saturn  from  the  Sun.  Their  masses  lie  between  8  and 
10  times  the  mass  of  Jupiter. 
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12.3.5  Searching  for  Habitable  Planets 

From  a  human  perspective,  the  most  interesting  planets  will  be  those  as  small  as 
the  Earth,  in  circular  orbits  at  the  precise  distance  from  the  heat  of  a  Sun-like  star 
to  provide  a  haven  for  life.  Scientists  call  this  location  a  habitable  zone ,  meaning 
that  it  could  be  inhabited  -  but  not  necessarily  that  it  is.  Such  a  planet  might  be 
detected  by  the  transit  method. 

The  orbital  size  can  be  calculated  from  the  period  of  the  repeated  transit  and  the 
mass  of  the  star.  From  the  orbital  size  and  the  luminosity  of  the  star,  the  planet’s 
temperature  can  be  calculated.  This  information  would  indicate  whether  the  planet 
resides  within  the  warm  habitable  zone  -  that  is,  the  range  of  distances  from  a  star 
where  liquid  water  can  exist  on  the  planet’s  surface  and  life  might  exist.  At  closer 
distances,  the  water  would  boil  away,  and  at  more  remote  distances  it  would  freeze 
solid. 

The  Kepler  mission  is  specifically  designed  to  detect  hundreds  of  planets 
comparable  in  size  to  the  Earth  or  smaller  and  located  at  or  near  the  habitable  zone. 
By  measuring  the  brightness  of  100,000  stars,  it  detects  the  periodic  dimming  of 
starlight  produced  when  the  planets  pass  in  front  of  the  stars.  A  transit  by  an  Earth¬ 
sized  planet  produces  a  small  change  in  the  star’s  brightness  of  about  1/10,000, 
lasting  for  2-16  h. 

The  Kepler  mission  discovered  several  hundred  new-planet  candidates  orbiting 
nearby  stars.  A  few  of  the  potential  planets  are  nearly  Earth- sized  and  orbit  in  the 
habitable  zone  of  smaller,  cooler  stars  than  our  Sun.  Because  these  stars  are  less 
luminous  than  the  Sun,  the  habitable  zone  is  closer  and  planets  within  it  have 
orbital  periods  that  are  shorter  than  our  year,  so  they  can  be  recognized  in  an 
observation  time  of  a  few  years.  The  Kepler  planet  candidates  require  follow-up 
observations  with  the  world’s  best  ground-based  telescopes  to  verify  that  they  are 
actual  planets. 

In  the  meantime,  the  world’s  best  telescopes  are  being  employed  to  find  new 
exoplanets  using  the  velocity  method.  The  European  Southern  Observatory’s 
3.6  m  telescope  in  La  Silla,  Chile,  has  discovered  many  new  ones,  including 
several  super-Earths,  and  the  10  m  Keck  I  telescope  atop  Mauna  Kea  in  Hawaii 
has  been  used  to  discover  many  more,  including  a  super-Earth  with  about  four 
times  the  mass  of  the  Earth. 


Example:  An  exoplanet  in  the  habitable  zone 

The  nearby  star  Gliese  581,  the  581th  star  in  the  nearby  star  catalogue  of 
Wilhelm  Gliese  (1915-1963),  has  at  least  six  planets.  The  star  is  located  at  a 
distance  of  6.2  pc  or  20.3  light-years  from  the  Earth.  It  has  an  apparent 
visual  magnitude  of  mv  =  10.55,  an  absolute  visual  magnitude  of 
Mv  =  11.56  and  an  absolute  luminosity  of  L  =  0.013  L0;  from  the  mass- 
luminosity  relation  for  such  stars  a  mass  of  M  =  0.31  M0  is  determined,  and 
from  the  mass-radius  relation  a  radius  of  R  =  0.29  Rq  is  found. 
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The  subscript  ©  for  the  luminosity,  mass  and  radius  denotes  the  Sun’s  value. 
An  effective  temperature,  Teff  =  3,480  K  is  inferred  from  the  star’s  lumi¬ 
nosity  and  radius.  The  sixth  exoplanet  detected  for  this  star,  named  GJ 
581  g,  has  an  orbital  period  of  36.6  days,  an  estimated  distance  from  its  star 
of  0.146  AU,  and  a  minimum  estimated  mass  of  MP  =  3.1  ME,  where  ME, 
denotes  the  mass  of  the  Earth  (Vogt  et  al.  2010).  [The  designation  GJ  comes 
from  the  nearby  catalogue  by  Gliese  and  Jahreiss  (1979)].  For  an  albedo  of 
A  =  0.3  and  the  star’s  luminosity,  the  estimated  temperature  of  the  exo¬ 
planet  at  its  orbital  distance  is  TP  =  228  K  (Sect.  2.5).  That  is  below  the 
freezing  temperature  of  water,  at  273  K,  but  still  thought  to  be  in  the  hab¬ 
itable  zone  of  the  star;  an  atmospheric  greenhouse  effect  might  raise  the 
temperature  above  freezing  as  it  does  on  the  Earth. 


The  atmospheres  of  transiting  exoplanets  also  are  being  investigated  using  the 
Hubble  Space  Telescope ,  the  Spitzer  Space  Telescope ,  and  ground-based  infrared 
telescopes.  As  a  planet  passes  in  front  and  behind  its  star,  astronomers  can  subtract 
the  light  of  the  star  alone  -  when  the  planet  is  blocked  -  from  the  light  of  the  star 
and  planet  together  prior  to  eclipse.  This  isolates  the  emission  of  the  planet  and 
enables  the  detection  of  the  infrared  spectral  signatures  of  gases  in  the  planet’s 
atmosphere.  Water  vapor  and  methane,  for  example,  have  been  found  in  the 
atmosphere  of  at  least  one  exoplanet,  HD  189733  b  (Tinetti  et  al.  2007;  Swain 
et  al.  2008);  it  is  a  hot  Jupiter- size  planet  that  orbits  its  star  in  just  2.2  days  and  is 
nearly  63  light-years  away  from  the  Earth. 


Chapter  13 

Stellar  End  States 


13.1  A  Range  of  Destinies 

No  material  object  can  exist  forever,  and  stars  are  no  exception.  Although  their 
lives  may  be  measured  in  millions  or  billions  of  years,  stars  do  stop  shining  when 
all  of  the  available  sources  of  subatomic  energy  have  been  exhausted.  Their  central 
thermonuclear  reactions,  which  keep  the  star  hot  inside,  are  then  turned  off.  There 
is  no  heat  and  pressure  being  generated  inside  such  a  star,  so  the  internal  support 
has  been  removed  and  it  begins  its  ultimate  contraction.  The  demise  of  such  a  star 
results  in  the  simultaneous  creation  of  a  new  star  from  its  collapsing  core,  with  a 
final  resting  state  that  depends  on  the  star’s  mass. 

As  in  the  beginning  of  their  lives,  the  central  regions  of  all  dying  stars  are 
subject  to  the  unsupported,  inward  pull  of  gravity  from  all  sides,  and  the  entire 
stellar  mass  is  compressed  into  an  increasingly  smaller  radius.  It  only  stops  when 
some  outward  pressure  grows  sufficiently  large  to  halt  the  contraction,  which 
means  that  there  is  an  enormous  range  in  stellar  size  and  mass  density 
(Table  13.1).  The  mean  mass  density  of  the  Sun,  for  example,  is  comparable  to 
that  of  water,  whereas  the  density  of  a  neutron  star  is  similar  to  that  of  the  nucleus 
of  an  atom. 

In  its  earliest  stages,  a  protostar’s  gravitational  contraction  is  stopped  when  the 
star  begins  to  fuse  hydrogen  nuclei  into  helium  nuclei,  generating  the  internal  heat 
and  pressure  that  halts  the  formative  collapse.  A  long  time  later,  when  the 
hydrogen  runs  out,  gravity  takes  over  again  and  compresses  the  core,  heating  it  up 
until  helium  can  be  consumed  in  synthesizing  carbon.  Enough  heat  is  then  gen¬ 
erated  to  balance  the  relentless  force  of  gravity,  and  the  star’s  outer  atmosphere 
expands  to  giant  or  supergiant  size. 

What  happens  next  depends  on  the  mass  of  the  star.  Stars  that  have  a  mass 
comparable  to  the  Sun’s  mass  begin  their  ultimate  collapse  when  the  helium  is 
gone,  ending  up  as  burned-out,  Earth-sized  white  dwarf  stars.  The  ultimate  des¬ 
tinies  of  the  rare,  more  massive,  and  luminous  supergiants  are  explosive.  They  can 
leave  a  city- sized  neutron  star  behind  or  be  crushed  into  a  stellar  black  hole.  Thus, 


K.  R.  Lang,  Essential  Astrophysics,  Undergraduate  Lecture  Notes  in  Physics, 
DOI:  10.1007/978-3-642-35963-7_13,  ©  Springer- Verlag  Berlin  Heidelberg  2013 


411 


412 


13  Stellar  End  States 


Table  13.1  Representative  mass,  radius,  and  mean  mass  density  of  the  starsa 


Star 

Mass 

(M0) 

Radius 

(Re) 

Mean  mass  density5 
(kg  m-3) 

Red  giant  star 

1.2 

100 

0.0014 

Sun 

1.0 

1.0 

1,400 

White  dwarf  star 

0.6 

0.01 

0.84  x  109 

Neutron  star 

1.5 

0.00001 

2.1  x  1018 

Black  holec 

10.0 

0.000004 

0.2  x  1018 

a  The  mass  is  in  units  of  the  Sun’s  mass  M0  =  1.989  x  103()  kg,  and  the  radius  is  in  units  of  the 

o 

Sun’s  radius  RQ  =  6.955  x  10°  m 

b  The  mean  mass  density  =  3  M/(4nR3)  for  a  star  of  mass  M  and  radius  R 
c  A  representative  radius  for  the  black  hole  formed  by  the  collapse  of  a  star  of  mass  M  is  taken  as 
the  Schwarzschild,  or  gravitational,  radius  Rg  =  2  GM/c  for  gravitational  constant 
G  =  6.673  x  10-11  N  m2  kg-2  and  the  speed  of  light  c  =  2.9979  x  108  m  s-1 


there  is  a  range  of  destinies:  Giants  turn  into  tiny  white  dwarfs;  supergiants  turn 
into  even  smaller  neutron  stars;  and  the  bigger,  heavier  super  giants  turn  into  black 
holes  -  all  compressed  into  their  end  states  by  the  never-ending  force  of  gravity. 
The  more  massive  the  star,  the  smaller  it  eventually  becomes. 


13.2  Planetary  Nebulae 

After  discovering  Uranus,  in  1781,  the  English  astronomer  William  Herschel 
(1738-1822)  discovered  a  small  glowing  object  that  he  designated  a  planetary 
nebula  because  of  its  round  shape,  which  resembled  the  disks  of  planets  as  seen 
through  a  small  telescope  (Herschel  1786).  However,  planetary  nebulae  are  not 
made  of  planets,  and  planets  are  not  visible  in  them.  The  designation  nebula  is 
from  the  Latin  word  for  “cloud”,  and  it  was  used  to  distinguish  the  diffuse 
planetary  nebulae,  which  have  resolved  disks,  from  unresolved,  point-like  stars. 

Herschel  and  other  astronomers  soon  discovered  more  of  these  objects,  and  one 
of  them,  named  the  Cat’s  Eye  Nebula,  had  a  “condensation”  in  its  center,  which 
turned  out  to  be  a  star  (Herschel  1786).  It  was  eventually  realized  that  every 
planetary  nebula  has  a  star  at  its  center,  the  exposed  core  of  a  dying  red  giant  that 
illuminates  the  nebula  (Fig.  13.1). 

Any  star  with  a  moderate  mass,  comparable  to  that  of  the  Sun,  eventually 
balloons  into  a  red  giant  star.  As  the  core  nuclear  reactions  cease,  the  giant  sheds 
its  outer  layers,  which  are  blown  away.  All  that  remains  of  that  part  of  the  star  is 
gas  and  dust,  the  planetary  nebula.  The  central  regions  of  a  red  giant  star  will 
collapse  into  a  smaller  white  dwarf  star. 

Another  English  astronomer,  William  Huggins  (1824-1910),  used  his  spec¬ 
troscope  to  find  a  trio  of  emission  lines  in  the  Cat’s  Eye  Nebula  (Huggins  1864, 
1868).  When  heated,  a  low-density  gas  radiates  these  emission  lines;  therefore, 
their  presence  indicated  that  the  planetary  nebulae  contain  hot,  rarefied  gas. 


13.2  Planetary  Nebulae 
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Fig.  13.1  Planetary  nebula  When  a  Sun-like  star  uses  up  its  nuclear  fuel,  the  star’s  center 
collapses  into  an  Earth-sized  white  dwarf  star  and  its  outer  gas  layers  are  ejected  into  space.  Such 
a  planetary  nebula  is  named  after  its  round  shape,  which  resembles  a  planet  as  seen  visually  in 
small  telescopes,  and  is  not  related  to  planets.  The  shells  of  gas  in  the  planetary  nebula  NGC 
6751,  shown  here,  were  ejected  several  thousand  years  ago.  The  hot  stellar  core,  exposed  by  the 
expulsion  of  the  material  surrounding  it,  has  a  disk  temperature  of  about  140,000  K.  Its  intense 
ultraviolet  radiation  causes  the  ejected  gas  to  fluoresce  as  a  planetary  nebula.  (A  Hubble  Space 
Telescope  image  courtesy  of  the  NASA/STScI/AURA/Hubble  Heritage  Team.) 


However,  at  that  time,  no  one  knew  how  hot  the  gas  was,  or  much  about  its 
elemental  constitution. 

The  wavelength  of  one  of  the  emission  lines  detected  by  Huggins  coincided 
with  hydrogen — the  Balmer  emission  line  at  486.1  nm,  but  the  chemical  identi¬ 
fication  of  the  other  two  emission  lines  remained  a  mystery  for  more  than  half  a 
century.  Although  these  green  nebular  lines,  at  wavelengths  of  495.9  and 
500.7  nm,  were  even  stronger  than  hydrogen,  they  defeated  attempts  to  identify 
them  with  elements  known  on  the  Earth.  These  spectral  features  initially  were 
attributed  to  a  previously  unknown  element  called  “nebulium,”  but  the  emission 
lines  eventually  were  shown  to  be  due  to  known  elements  that  had  become  ionized 
by  the  ultraviolet  light  of  the  bright  central  stars. 

When  it  was  realized  that  the  central  stars  of  planetary  nebulae  are  very  hot,  the 
English  astronomer  Arthur  Stanley  Eddington  (1882-1944)  proposed  that  the 
ultraviolet  starlight  ionizes  surrounding  material,  and  that  the  electrons  liberated 
by  this  photoionization  will  heat  the  gas  to  about  10,000  K  (Eddington  1926).  The 
American  astronomer  Donald  Menzel  (1901-1976)  and  the  Dutch  astronomer 
Herman  Zanstra  (1894-1972)  then  showed  that  the  hydrogen  emission  lines  of 
planetary  nebulae  are  produced  when  the  free  electrons  liberated  by  the  ultraviolet 
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Table  13.2  Physical  properties  of  planetary  nebulae 

Q  _ O 

N  =  number  density  =  Ne  =  electron  density  =  (0.5-20)  x  10  m 

Te  =  electron  temperature  &  (0.6-1. 8)  x  104  K 

R  =  radius  =  0.2-0. 8  light-years  =  0.07-0.25  pc  w  (2-7)  x  1015  m 

M  =  mass  =  4nR3NmP/3  =  1021  kg  =  5  x  10-10  MQ 

Vexp  =  expansion  velocity  =  (1-9)  x  104  m  s”* 1 

rexp  =  expansion  age  «  16,000  years 


photoionization  recombine  with  protons  to  make  hydrogen  atoms,  cascading 
through  the  atoms’  various  allowed  electron  orbits  or  energy  levels  and  radiating 
the  Balmer  emission  line  (Menzel  1926;  Zanstra  1927,  1928). 

In  a  brilliant  piece  of  detective  work,  the  American  astronomer  Ira  S.  Bowen 
(1898-1973)  interpreted  the  two  strong  green  emission  lines  as  forbidden  transi¬ 
tions  of  doubly  ionized  oxygen  (Bowen  1928).  His  solution  depended  on  the  rarity 
of  atomic  collisions  in  the  extremely  tenuous  planetary  nebulae,  which  allows  the 
occurrence  of  “forbidden”  transitions.  They  are  not  actually  forbidden  but  rather 
so  improbable  that  they  seldom  take  place  in  a  higher-density  laboratory  situation, 
where  an  atom  almost  always  is  jostled  by  collisions  into  a  different  state  before 
the  forbidden  radiation  can  be  emitted. 

The  observed  emission  lines  indicate  gas  temperatures  of  about  10,000  K  and 
electron  or  ion  number  densities  of  about  10  billion  per  cubic  meter,  denoted  as 

i  n  _ o 

10  m  (Table  13.2).  Although  this  is  a  big  number,  such  densities  are  lower 
than  the  best  vacuum  used  in  a  terrestrial  laboratory.  Kaler  (1985)  has  provided  a 
review  of  planetary  nebulae  and  their  central  stars. 

The  mass  density  and  temperature  of  planetary  nebulae  resemble  those  of  the 
emission  nebulae  (Sect.  11.1);  however,  the  planetary  nebulae  are  about  10  times 
smaller  and,  unlike  the  emission  nebulae,  they  also  are  expanding.  Both  types  of 
nebulae  are  illuminated  by  a  bright  central  star,  and  they  both  emit  similar  spectral 
lines:  those  of  ionized  hydrogen  and  the  forbidden  emission  lines  of  oxygen  and 
nitrogen  ions,  designated  [O  III],  [O  II],  and  [N  II]  (see  previous  Table  11.2). 

As  Zanstra  realized,  the  intensity  of  the  hydrogen  emission  line  can  be  related 
to  the  temperature  of  the  exciting  star  through  the  theory  of  the  hydrogen  atom  and 
the  Planck  spectrum  of  thermal  radiation.  He  found  that  these  stars  are  enormously 
hot,  and  modem  investigations  show  that  they  are  the  hottest  stars  known.  The 
luminous  central  star  radiates  thousands  of  times  more  energy  than  the  Sun  and  has 
a  temperature  of  100,000  K  and  even  over  200,000  K,  much  higher  than  any  main- 
sequence  star.  This  places  the  central  star  of  a  planetary  nebula  right  off  the  scales 
of  the  Hertzsprung  -  Russell  diagram,  on  the  far  left  side  (Fig.  13.2).  Powerful 
winds  have  removed  the  star’s  relatively  cool,  outer  layers  to  reveal  its  hot  interior. 

Most  of  the  radiation  of  such  a  hot  star  is  at  ultraviolet  wavelengths,  which 
brighten  the  surrounding  nebula,  but  the  star  is  relatively  dim  at  the  longer  visible 
wavelengths  and  may  even  become  invisible.  However,  as  a  young  planetary 
nebula  is  blown  outward  by  powerful  winds,  it  slowly  grows  in  size,  thins  out,  and 
becomes  transparent,  revealing  its  source  -  the  exposed  core  of  a  dying  red  giant. 


13.2  Planetary  Nebulae 
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Fig.  13.2  Formation  of  a  planetary  nebula  and  white  dwarf  star  The  evolutionary  track  of  a 
dying  Sun-like  star  in  the  Hertzsprung-Russell  diagram.  When  the  star  has  exhausted  its  nuclear 
hydrogen  fuel,  which  makes  the  star  shine,  it  expands  into  a  red  giant  star;  after  a  relatively  short 
time,  the  giant  star  ejects  its  outer  layers  to  form  a  planetary  nebula.  The  ejected  gas  exposes  a  hot 
stellar  core,  which  collapses  to  form  an  Earth-sized  white  dwarf  star  that  gradually  cools  into  dark 
invisibility.  The  luminosity  is  in  units  of  the  Sun’s  luminosity,  denoted  L0  =  3.828  x  10  Js  , 
and  the  effective  temperature  of  the  stellar  disk  is  in  units  of  degrees  kelvin,  denoted  K.  (From 
“The  Life  and  Death  of  Stars”  by  Kenneth  R.  Lang,  published  by  Cambridge  University  Press, 
2013.  Reprinted  with  permission.) 

When  modern  telescopes  are  used  to  zoom  in  and  resolve  the  expanding  gas  and 
dust,  they  show  that  it  has  not  been  expelled  in  a  single  puff  of  stellar  wind,  but 
instead  in  multiple  gusts  that  can  slam  into  each  other  (Fig.  13.3).  Fast  and  slow 
winds  may  also  play  a  role  in  producing  the  various  shapes  and  forms  of  planetary 
nebulae.  Balick  and  Frank  (2002)  have  reviewed  the  shapes  and  shaping  of 
planetary  nebulae. 

The  observed  expansion  speeds  of  about  10  km  s-1  and  nebular  dimensions  of 
about  a  light-year  across  indicate  the  expanding  shells  of  gas  were  ejected  about 
16  thousand  years  before  the  expansion  and  size  were  measured.  Their  luminescent 
gas  will  expand  and  disperse  into  interstellar  space,  cooling  into  invisibility  and 
becoming  indistinguishable  from  their  surroundings  in  about  20,000  years.  This  is  a 
relatively  brief  existence,  only  about  1  millionth  of  the  stellar  lifetime  of  many 
billions  of  years.  As  a  result,  planetary  nebulae  are  much  less  numerous  than  the  stars. 


Example:  Expansion  age  of  a  planetary  nebula 

The  named  planetary  nebulae  have  expansion  velocities  of  at  least 
Vexp  =  10  km  s-1,  and  a  radius,  R ,  of  about  5  x  1015  m  or  half  a  light-year. 
The  expansion  time  Texp  =  R/Vexp  «  5  x  1011  s  ^  16,000  years,  where 
1  year  =  3.1557  x  107  s. 
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Fig.  13.3  The  Eskimo  Nebula  About  10,000  years  ago,  a  dying  Sun-like  star  began  flinging 
material  into  nearby  space,  producing  this  planetary  nebula  that  is  formally  designated  as  NGC 
2392.  When  first  observed  more  than  two  centuries  ago,  it  was  dubbed  the  “Eskimo”  Nebula 
because  it  resembled  a  face  surrounded  by  a  fur  parka  like  those  worn  by  Eskimos.  It  is  located 
about  5,000  light-years  from  the  Earth.  This  detailed  image,  obtained  by  instruments  aboard  the 
Hubble  Space  Telescope ,  reveals  several  episodes  of  ejection  from  the  central  star,  including  an 
outer  ring  of  objects  that  are  shaped  like  teardrops  pointing  outward  and  elongated,  filamentary 
bubbles,  each  about  1  light-year  in  diameter.  Dense  material  enveloping  the  star’s  equator  has 
blocked  ejected  material,  and  intense  winds  moving  at  about  420  km  s-1  have  swept  material 
above  and  below  the  equatorial  regions.  The  bright  central  region  contains  another  wind-blown 
bubble.  (Courtesy  of  NASA/Andrew  Frucher/ERO  Team,  Slyvia  Baggett/STScI/Richard  Hook, 
ST-ECF,  and  Zolan  Levay/STScI.) 


Despite  their  infrequent  appearance  on  cosmic  time-scales,  thousands  of 
planetary  nebulae  are  known.  Some  of  them  are  listed  in  Table  13.3  with  the 
names  associated  with  them,  and  their  number  in  the  New  General  Catalogue 
(NGC).  Table  13.3  also  provides  the  celestial  position,  distance,  radius,  expansion 


Table  13.3  Bright  named  planetary  nebulaea 

Catalog  designation  Popular  namec  (Nebula)  RA  (2000)  Dec.  (2000)  Db  (ly)  R  (ly)  Vexp  (km  s_1)  mvd  7^  (K) 


13.2  Planetary  Nebulae 
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velocity,  and  apparent  visual  magnitude  and  temperature  of  the  central  star  for  the 
planetary  nebulae.  The  distances  are  often  uncertain,  and  the  radii  are  between  0.1 
and  3  light-years,  but  it  is  the  popular  names  that  are  so  fascinating.  They  describe 
the  resemblance  of  planetary  nebulae  to  everyday  objects,  such  as  the  twin  weights 
of  a  dumbbell,  a  fur  parka  enveloping  an  Eskimo’s  face,  the  head  of  an  owl,  a  cat’s 
eye  or  simply  a  ring. 

Planetary  nebulae  are  produced  by  the  winds  of  dying  stars.  Cassinelli  (1979) 
has  provided  a  review  of  stellar  winds;  Kudritzki  and  Puls  (2000)  have  reviewed 
winds  from  hot  stars;  Willson  (2000)  has  discussed  mass  loss  from  cool  stars  with 
their  impact  on  the  evolution  of  stars  and  stellar  populations;  and  Dupree  (1986) 
has  discussed  mass  loss  from  cool  stars. 

Stars  that  produce  the  planetary  nebulae  end  up  as  small,  dense  white  dwarf 
stars.  In  the  very  distant  future,  our  Sun  will  become  one,  as  will  the  majority  of 
other  stars.  Their  discovery  was  entirely  unexpected. 


13.3  Stars  the  Size  of  the  Earth 

13.3.1  The  Discovery  of  White  Dwarf  Stars 

The  first  white  dwarf  to  be  known  is  a  companion  of  a  much  brighter  star,  40  Eridani, 
also  known  as  Omicron  Eridani  from  its  Greek  letter  designation.  The  fainter  star  is 
designated  40  Eridani  B  to  distinguish  it  from  the  brighter  member,  A,  of  the  pair. 
The  American  astronomer  Walter  S.  Adams  (1876-1956)  first  drew  attention  to  the 
A0  spectral  type  of  40  Eridani  B,  which  suggested  a  disk  temperature  of  about 
10,000  K,  and  noticed  that  it  was  surprising  that  such  a  hot  star  should  exhibit  such  a 
very  low  luminosity  (Adams  1914). 

Unfortunately,  40  Eridani  B  is  so  far  away  from  its  bright  companion,  with  an 
orbital  period  of  at  least  7,300  years,  that  its  mass  could  not  be  inferred  from  its 
orbital  motion.  However,  this  was  not  the  case  for  Sirius  B,  the  second  white  dwarf 
to  be  discovered.  It  is  also  a  member  of  a  binary  star  system,  with  a  luminous 
companion  designated  Sirius  A,  the  brightest  star  in  the  night  sky. 

The  irregular  motion  of  Sirius  A  first  suggested  the  presence  of  its  dim  com¬ 
panion.  As  discovered  by  Friedrich  Wilhelm  Bessel  (1784-1846),  the  bright  star 
swerves  from  side  to  side  of  a  straight-line  trajectory,  and  this  swerving  motion 
was  attributed  to  the  gravitational  attraction  of  a  nearby,  unseen  companion 
(Bessel  1844).  The  American  astronomer  and  telescope  maker  Alvan  Clark 
(1804-1887)  first  detected  the  diminutive  star  in  1862. 

The  masses  of  Sirius  A  and  B  have  been  estimated  from  the  orbital  motion  of 
the  pair,  weighing  in  at  roughly  the  mass  of  the  Sun;  because  the  companion  was 
about  twice  as  far  as  Sirius  from  their  common  center  of  mass,  the  companion  had 
to  have  about  half  the  mass  of  the  bright  star.  Modern  determinations  indicate  that 
the  mass  of  Sirius  A  is  2.02  solar  masses,  and  that  the  mass  of  Sirius  B  is  0.978 
times  that  of  the  Sun. 
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Adams  showed  that  this  low-luminosity  star  has  a  spectral  class  AO,  like  40 
Eridani  B  (Adams  1915).  What  Adams  did  not  point  out  explicitly  was  that  the 
high  surface  temperature  in  combination  with  the  low  total  luminosity  meant  that 
both  40  Eridani  B  and  Sirius  B  must  be  very  small  -  only  about  the  size  of  the 
Earth.  Furthermore,  the  rather  ordinary  mass  of  Sirius  B  meant  that  the  average 
mass  density  of  the  star  must  be  enormous  -  about  10  kg  m  and  about  1  million 
times  the  average  mass  density  of  the  Sun.  Later  on,  it  was  found  that  the  white 
dwarf  40  Eridani  B  has  it  own  companion,  designated  40  Eridani  C,  whose  orbital 
motion  allowed  a  mass  determination  of  about  half  a  solar  mass  for  this  white 
dwarf  star,  with  a  mass  density  comparable  to  that  of  Sirius  B. 

These  stars  now  are  known  to  be  the  inner,  collapsed  leftovers  of  dying  red 
giant  stars,  exposed  by  the  planetary  nebulae  that  carried  off  the  outer  stellar 
atmospheres. 


13.3.2  Unveiling  White  Dwarf  Stars 

Having  depleted  the  hydrogen  in  their  cores,  the  central  regions  of  solar-mass  stars 
contract  to  become  hot  enough  to  fuse  helium  into  carbon  and  oxygen;  however, 
there  is  not  enough  mass  to  generate  a  temperature  hot  enough  to  fuse  carbon  into 
neon,  at  about  1  billion  K,  and  an  inert  carbon-oxygen  core  is  surrounded  by  an 
inner  helium-burning  shell  and  an  outer  hydrogen-burning  shell. 

After  shedding  most  of  its  outer  material  to  form  a  planetary  nebula,  these  giant 
stars  leave  behind  a  hot  core  of  carbon  and  oxygen.  Because  it  cannot  generate 
additional  heat  by  nuclear  reactions,  the  core  collapses  to  form  a  white  dwarf  with 
about  the  Sun’s  mass  compressed  to  1/  100th  of  its  former  size  and  about  the  same 
radius  as  the  Earth.  Such  a  collapsed  star  is  called  a  white  dwarf  star  because 
initially  it  is  white  in  color  and  it  is  relatively  small  for  a  star. 

The  concept  of  a  white  dwarf  star  as  the  exposed  carbon-oxygen  core  of  a 
former  giant  star  initially  was  difficult  to  reconcile  with  the  fact  that  the  emitted 
light  of  the  first  white  dwarfs  contained  strong  spectral  lines  of  hydrogen  and  it  did 
not  include  the  oxygen  lines  found  in  the  surrounding  planetary  nebula.  The 
French  astronomer  Evry  Schatzman  (1920—2010)  explained  this  paradox  when  he 
noted  that  the  hydrogen  resides  in  a  thin  outer  atmosphere  and  that  the  heavy 
elements  such  as  carbon  and  oxygen  had  to  sink  out  of  sight  into  the  dense  stellar 
interior  (Schatzman  1945).  These  elements  are  drawn  by  gravity  into  the  unseen 
interior  of  the  white  dwarf  and  remain  hidden  by  a  hydrogen-rich  layer  that  is  only 
about  1/ 10,000th  of  the  star’s  mass. 

An  inert,  nonbuming  star,  with  no  nuclear  reactions  in  its  core,  which  is 
composed  mainly  of  carbon  and  oxygen  nuclei,  represents  the  final  destiny,  the  end 
state,  of  main-sequence  stars  with  a  mass  of  between  0.5  and  8  solar  masses,  and  it 
accounts  for  most  of  the  observed  white  dwarf  stars.  Main- sequence  stars  of  lower 
mass,  below  0.5  solar  masses,  bypass  the  giant  stage  -  never  becoming  hot  enough 
inside  to  fuse  helium  -  and  they  collapse  directly  into  white  dwarfs  composed  of 
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helium.  However,  these  stars  have  such  a  low  luminosity  and  temperature  that  they 
burn  hydrogen  slowly  and  have  not  yet  exhausted  their  internal  supply,  even 
during  the  14-billion-year  age  of  the  observable  universe. 

When  first  exposed,  the  white  dwarf  can  be  very  hot  because  it  was  previously 
in  the  hot  interior  of  a  giant  star.  It  may  have  an  initial  temperature  of  200,000  K 
due  to  its  hot  origin  and  the  collapse  that  created  it,  but  because  there  is  no 
thermonuclear  fuel,  there  is  nothing  left  to  heat  a  white  dwarf  star. 

A  white  dwarf’s  temperature  will  drop  from  an  initial  high  of  up  to  200,000  K 
to  an  observed  low  that  is  colder  than  the  Sun,  steadily  becoming  fainter  and 
dimmer.  Liebert  (1980)  has  reviewed  white  dwarf  stars,  Hansen  and  Liebert  (2003) 
have  provided  a  review  of  cool  white  dwarf  stars,  and  D’Antona  and  Mazzitelli 
(1990)  have  reviewed  the  cooling  of  white  dwarfs.  This  cooling  and  dimming  is  so 
slow  that  the  oldest  white  dwarf  has  not  yet  chilled  to  the  point  of  invisibility.  That 
is,  observations  indicate  that  the  oldest  white  dwarf  stars,  which  are  remnants  of 
the  earliest  stars,  have  not  yet  had  time  to  cool  to  the  lowest  possible  luminosity 
that  might  be  observed.  The  amount  of  time  that  the  oldest  white  dwarf  has  cooled 
is  estimated  to  be  about  9  billion  years,  which  when  combined  with  its  former 
lifetime  as  a  main— sequence  and  giant  star  gives  a  rough  estimate  to  the  age  of  the 
observable  universe  of  about  14  billion  years  (Winget  et  al.  1987;  Wood  1992). 


13.3.3  The  High  Mass  Density  of  White  Dwarf  Stars 

The  rather  ordinary  stellar  mass  of  a  white  dwarf  has  been  compressed  within  a 
star  that  is  comparable  to  the  size  of  the  Earth,  which  means  that  it  has  an  enor¬ 
mous  mean  mass  density  up  to  1  million  times  that  of  the  Sun  or  about  a  billion,  or 
109,  kg  m-3  (Focus  13.1). 


Focus  13.1  Radius  and  mass  density  of  a  white  dwarf  star 

The  effective  disk  temperature  of  Sirius  B  is  Teff  =  25,200  K,  while  its 
absolute  luminosity,  L,  is  L  =  0.026  L©,  where  the  Sun’s  absolute  lumi- 
nosity  is  L©  =  3.828  x  10^  J  s  .  From  the  Stefan— Boltzmann  law, 

L  =  AnaR2T4efp  (13.1) 

where  n  «  3.1416  and  the  Stefan— Boltzmann  constant  a  =  5.6704  x 

O  1  O  A 

10_o  J  s_i  m  K  ,  we  obtain  an  expression  for  the  radius,  R : 


1/2 


AllaTeff 


(13.2) 
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/r 

which  gives  a  radius  R  %  5.9  x  10  m  ^  0.01  Rq,  for  the  radius  of  Sirius  B, 

o 

where  the  solar  radius  RQ  =  6.955  x  10  m.  By  way  of  comparison,  the 
radius  of  the  Earth  is  re  =  6.378  x  106  m. 

The  mass,  M,  of  Sirius  B  is  M  =  0.98  M0,  where  the  Sun  mass 

on 

M0  =  1.989  x  10  kg.  The  mass  density,  p,  of  Sirius  B  is  therefore 
p  =  3  M/(4nR3)  «  2.26  x  109  kg  m-3. 


Arthur  Stanley  Eddington  (1882-1944)  pointed  out  that  there  is  nothing 
inherently  absurd  about  the  high  mass  densities  of  white  dwarf  stars  (Eddington 
1924).  Because  all  of  the  electrons  are  stripped  away  from  their  atomic  nuclei  in 
the  hot  stellar  interiors,  the  free  electrons  can  be  packed  closely  with  the  bare 
nuclei,  within  the  former  space  of  the  empty  atoms. 

Eddington  also  predicted  that  the  gravitational  redshift  of  Sirius  B  might  be 
observed  once  allowance  was  made  for  the  orbital  motion  of  the  double- star 
system  (Eddington  1924).  The  gravitational  redshift  is  the  Doppler  shift  of  a 
spectral  line  caused  by  the  loss  of  energy  in  overcoming  the  gravity  of  the  emitting 
object  (Sect.  6.5).  Adams  apparently  confirmed  the  effect,  providing  an  indepen¬ 
dent  confirmation  of  the  small  size  of  white  dwarf  stars  that  had  been  inferred  from 
their  high  temperature  and  low  luminosity  (Adams  1915).  This  result  was  sub¬ 
stantiated  with  greater  clarity  for  Sirius  B,  40  Eridani  B,  and  other  white  dwarf 
stars  using  ground-based  telescopes  (Greenstein  et  al.  1971;  Greenstein  and 
Trimble  1972;  Shipman  1972;  Wegner  1980;  Shipman  et  al.  1997;  Huber  et  al. 
1998),  as  well  as  the  Hubble  Space  Telescope  (Barstow  et  al.  2005). 

The  expression  for  the  change  in  wavelength,  A2,  for  a  line  emitted  at  wave¬ 
length  2^  can  be  derived  from  Newton’s  theory  of  gravity  by  assuming  that  the 
radiation  photons  have  an  energy  hv  at  frequency  v,  and  that  this  energy  can  be 
expressed  in  terms  of  an  imaginary  radiation  mass,  m,  times  the  square  of  the 
speed  of  light,  c,  or  that  hv  =  me  ,  where  the  Planck  constant 
h  =  6.626  x  10-34  J  s  and  c  =  2.9979  x  108  ms-1.  When  the  photon  leaves  the 
surface  of  a  mass,  M,  with  radius,  R ,  the  photon  loses  the  energy,  AE,  given  by: 


A  E  =  hAv  = 


GMm  GMhv 


R 


Rc 2 


(13.3) 


where,  Av  is  the  change  in  frequency  v.  The  gravitational  redshift,  zg,  caused  by 
this  loss  of  photon  energy  is  given  by 


Vr  Av  Vl  V observed  ^ L  ^ 


observed 


C 


V 


=  2.12  x  10“6 


V  observed 

M\  {Rq 


A 


GM 

Rc 2 


(13.4) 


Mr. 


0. 


R 


where  Vr  is  the  radial  velocity  corresponding  to  the  gravitational  redshift,  vL  and 
v observed  respectively  denote  the  emitted  line  frequency  and  observed  line 
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frequency,  Xl  and  Xobserved  respectively  denote  the  emitted  line  wavelength  and 
observed  line  wavelength,  A2  is  the  change  in  wavelength  caused  by  overcoming 

_  11  o  _ o 

the  gravity,  and  the  gravitational  constant  G  =  6.674  x  10_ii  N  m  kg  . 


Example:  Gravitational  redshift  and  thermal  velocity  of  Sirius  B 

Precise  measurements  indicate  that  the  radius  of  Sirius  B  is  R  =  0.0084 
Rq  =  5.84  x  10  m,  for  a  solar  radius  RQ  =  6.955  x  10  m,  and  that  its 
mass  is  M  =  0.978  M©  (Barstow  et  al.  2005).  This  indicates  a  gravitational 

O  A 

redshift  of  =  GM/(Rc )  «  2.5  x  10  ,  where  the  Newtonian  gravita¬ 
tional  constant  G  =  6.674  x  10“ 11  N  m2  kg-2  and  the  speed  of  light 

o  1 

c  =  2.9979  x  10  ms  .  This  redshift  corresponds  to  a  radial  velocity 
Vr  =  zgc  «  7.5  x  104  ms-1  «  75  km  s_1,  where  the  speed  of  light 

o  1 

c  =  2.9979  x  10  m  s  .  The  measured  gravitational  redshift  of  Sirius  B  is 
80.42  ±  4.83  km  s-1,  which  is  consistent  with  the  estimate  given  the 
uncertainty  of  the  measurement. 

The  bolometric  luminosity  of  Sirius  B  is  L  =  0.0026  L©  =  9.95  x 
1023  J  s-1,  where  the  Sun’s  luminosity  L©  =  3.828  x  1026  J  s_1.  Measure¬ 
ments  indicate  an  effective  temperature  of  Teff  =  25,200  K  for  Sirius  B.  Using 
the  star’s  luminosity  and  radius  with  the  Stefan— Boltzmann  law 
L  =  AugR2  Tip  where  n  =  3.1416  and  the  Stefan— Boltzmann  constant 
( j  =  5.670  x  10-8  J  m-2  K-4  s_1,  we  obtain  Teff  «  14,000  K.  Using 
Teff  =  25,200  K  in  the  expression  for  the  thermal  velocity  Vthermai  = 
(3 kT/m)1^2,  where  the  Boltzmann  constant  k  =  1.381  x  10-23  J  K-1  and  a 
hydrogen  atom  of  mass  m  =  1.67  x  10  ~  kg,  we  have  Vthermai  =  2.5  x 
104  ms-1  =  25  km  s-1,  about  three  times  smaller  than  the  gravitational 
redshift. 


For  a  white  dwarf  star,  R  «  0.01  R©  and  =  Vr/c  «  0.02  with 
Vr  ^60  km  s-1.  The  observed  gravitational  redshifts  for  Sirius  B  and  40  Eridani 
B,  in  radial  velocity  units,  are  80.42  ±  4.83  km  s_1  and  23.9  ±  1.3  km  s_1, 
respectively.  Observations  of  these  gravitational  redshifts  confirm  the  small  size  of 
white  dwarf  stars  that  had  been  inferred  from  their  high  temperature  and  low 
luminosity. 

The  small  size  and  high  mass  density  of  the  white  dwarf  stars  have  also  been 
substantiated  by  measurements  of  their  magnetic-field  strength.  During  gravita¬ 
tional  collapse,  magnetic  flux  is  conserved,  and  the  surface  magnetic-field  strength 
increases  as  the  surface  area  decreases.  For  a  sphere  of  surface  magnetic  field 
strength,  B,  and  radius  R,  the  product  BR~  is  a  constant,  and  in  solar  units: 


B 


2 


R 


(13.5) 
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Table  13.4  Physical  properties  of  white  dwarf  stars 

Mwd  =  mass  of  white  dwarf  star  %  0.6  M0  %  1.2  x  103°  kg 

on 

Mcwd  =  critical  upper  mass  limit  for  white  dwarf  star  =1.4  M0  «  2.3  x  10"u  kg 
Rwd  =  mean  radius  of  white  dwarf  stars  =  0.01  RQ  «  6  x  106  m  Re  =  radius  of  Earth 

Q  _ Q 

pWD  =  mass  density  of  white  dwarf  star  ^ j  10  kg  m 

Vesc  =  escape  velocity  of  white  dwarf  star  =  ( 2GMWD/RWD )1//2  =  9,000  km  s-1  =  0.03  c. 

_ o  23  _ i 

Ewd  —  absolute  luminosity  of  white  dwarf  star  1(T"  Lq  «  3.8  x  10"  J  s'1 

Q  A 

T\vd  =  effective  temperature  of  white  dwarf  star’s  visible  disk  =  4  x  10"  K  to  7  x  10  K 

9  zt 

Bwd  =  surface  magnetic  held  strength  of  white  dwarf  star  =10  tesla  to  10  tesla 

ry 

z„  =  gravitational  redshift  of  white  dwarf  star  =  GMWD/(RWDc  )  w  0.02  =  Vr/c  (or  radial 

_ t 

velocity  Vr  =  zgc  w  60  km  s  ,  where  c  is  the  speed  of  light) 


The  dipolar  magnetic  field  of  the  solar  disk  is  0.01  tesla.  For  a  white 

dwarf  of  radius  R  of  1  /  100th  the  radius  of  the  Sun,  or  R  =  Rwd  «  0.01  R& ,  the 
surface  magnetic  field  strength  will  be  amplified  to  Bwd  ^  100  tesla.  Surface 
magnetic  field  strengths  of  100  to  10,000  tesla  were  inferred  from  the  circularly 
polarized  light  of  white  dwarf  stars,  and  roughly  10  %  of  them  have  magnetic 
fields  in  excess  of  100  tesla  (Kemp  et  al.  1970).  Angel  (1978)  has  reviewed 
magnetic  white  dwarfs.  Individual  papers  on  magnetism  in  white  dwarfs  are  ref¬ 
erenced  in  Lang  (1999). 

The  physical  properties  of  white  dwarf  stars  are  given  in  Table  13.4. 


13.4  The  Degenerate  Electron  Gas 

13.4.1  Nuclei  Pull  a  White  Dwarf  Together  as  Electrons 
Support  It 


The  matter  deep  inside  a  white  dwarf  star  is  completely  ionized  and  composed  of 
equal  numbers  of  atomic  nuclei  and  electrons.  Because  most  white  dwarfs  are  the 
crushed  remnants  of  red  giant  stars,  which  previously  fused  helium  into  carbon, 
their  collapsed  cores  consist  mainly  of  carbon  nuclei  and  electrons.  Stars  that  are 
somewhat  more  massive  than  the  Sun  leave  behind  white  dwarfs  containing 
oxygen  nuclei.  Therefore,  white  dwarf  stars  contain  various  amounts  of  carbon 
nuclei  or  oxygen  nuclei,  depending  on  the  star,  and  it  is  these  nuclei  that  supply  the 
mass  and  gravity  of  a  white  dwarf  star. 

The  nuclei  supply  the  mass  and  gravity  of  a  white  dwarf  star,  with  a  mass 
density,  p,  given  by: 


P 


ynpNe , 


(13.6) 
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for  a  white  dwarf  composed  of  elements  of  atomic  number  Z,  the  number  of 
protons,  and  atomic  mass  number  A.  The  mass  number  of  helium  is  A  =  4,  that  of 
carbon  is  A  =  12  and  the  mass  number  of  oxygen  is  A  =  16,  the  number  of 
protons  and  neutrons.  Equivalently,  A/Z  =  Ne/Np  =  /ie,  the  ratio  of  electron 
number  to  proton  number,  since  the  number  of  electrons  in  an  atom  is  equal  to  the 
number  of  nucleons  or  protons  plus  neutrons.  For  helium,  carbon  and  oxygen 
A/Z  =  2  or  Z/A  =  0.5.  We  also  have  A/Z  =  p/(mpNe).  In  this  expression,  mp  is 
the  proton  mass,  or  mp  =  1.67262  x  10  ~  kg,  and  Ne  is  the  electron  number 
density,  which  is  related  to  the  mass  density  by: 


(13.7) 


where  Nt  is  the  ion  number  density. 

Why  does  the  core  collapse  stop  at  the  white  dwarf  stage?  In  other  words,  what 
is  holding  up  the  star?  There  are  no  internal  nuclear  reactions  to  provide  energy, 
generate  heat,  and  create  pressure  to  oppose  gravity.  As  the  white  dwarf  radiates 
away  the  heat  left  over  from  its  former  life  in  a  red  giant  star,  it  eventually  might 
cool  down  to  a  temperature  of  absolute  zero,  and  there  would  be  no  motion  or 
thermal  energy  left  to  support  the  white  dwarf  star. 

When  the  material  cooled  enough,  it  could  be  expected  that  the  electrons  would 
return  to  their  former  orbits  around  the  nuclei,  making  larger  atoms,  which  would 
force  the  nuclei  apart  and  make  the  white  dwarf  expand  in  size.  However,  the  stars 
have  no  internal  energy  to  push  against  gravitation  and  accomplish  this  feat; 
therefore,  it  seemed  that  a  white  dwarf  star  could  not  become  that  cold. 

This  paradox  was  not  resolved  until  the  development  of  quantum  mechanics 
and  the  realization  that  it  is  the  electronic  properties  of  the  crushed  matter  that  hold 
up  a  white  dwarf.  The  high-speed  motions  of  the  densely  packed  electrons,  rather 
than  the  nuclei,  produce  an  outward  pressure  that  holds  the  gravitational  forces  at 
bay.  These  motions  are  not  due  to  the  star’s  internal  temperature  or  heat;  in  fact, 
the  internal  pressure  of  a  white  dwarf  star  is  unaffected  by  temperature. 

The  quantum-mechanical  description  of  a  very  dense,  crushed  state  of  matter  is 
statistical,  and  it  is  related  to  the  quantum  numbers  that  specify  the  state  of 
subatomic  particles.  That  is,  the  properties  of  the  particles,  such  as  energy  or 
location,  can  take  on  only  specific  quantized  values  and  no  others.  This  situation  is 
related  to  two  principles  that  govern  the  quantum  state  of  the  very  small:  (1)  The 
uncertainty  principle  that  states  that  at  any  given  time  we  cannot  know  exactly 
both  where  a  particle  is  and  where  it  is  going;  and  (2)  the  exclusion  principle  that 
forbids  the  existence  of  two  or  more  particles  in  exactly  the  same  quantum  state. 

The  German  physicist  Werner  Heisenberg  (1901-1976)  first  stated  the  uncer¬ 
tainty  principle  (Heisenberg  1927).  It  states  that  the  more  we  know  about  the 
location  of  a  subatomic  particle,  the  less  we  know  about  its  momentum  and 
velocity,  and  vice  versa.  In  an  alternative  interpretation,  the  more  we  know  about 
the  energy  of  a  subatomic  particle,  the  less  we  know  about  when  it  had  that  energy, 
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and  vice  versa.  Mathematically,  the  product  of  the  uncertainties,  in  location  and 
momentum  or  energy  and  time,  equals  the  Planck  constant  h. 

The  Austrian  physicist  Wolfgang  Pauli  (1900-1958)  proposed  the  exclusion 
principle  (Pauli  1927),  which  states  that  two  identical  subatomic  particles  cannot 
occupy  the  same  quantum  state  at  the  same  time.  It  applies  to  electrons,  protons, 
and  neutrons,  and  it  dictates  how  electrons  behave  in  an  atom,  occupying  their 
various  energy  levels.  Each  electron  in  an  atom  has  its  own  space,  which  prevents 
the  electrons  from  either  joining  together  in  the  same  location  or  falling  into  their 
atomic  nucleus.  The  exclusion  principle  ensures  the  very  existence  of  atoms. 

It  also  means  that  the  free  electrons  in  a  white  dwarf  star,  which  are  not  attached 
to  atoms,  cannot  be  in  precisely  the  same  place  at  the  same  time.  They  instead 
resist  being  squeezed  into  one  another’s  territory,  darting  away  at  high-speeds  just 
to  keep  their  own  space.  This  provides  the  pressure  of  the  crushed  state  of  matter, 
which  is  caused  by  the  electrons’  resistance  to  crowding. 

The  Italian  physicist  Enrico  Fermi  (1901-1954)  first  worked  out  the  statistical 
description  of  a  large  number  of  identical  subatomic  particles  based  on  the 
exclusion  principle  (Fermi  1926,  1928).  He  specified  the  conditions  in  which  all  of 
the  particles  have  the  least  possible  energy  without  violating  Pauli’s  exclusion 
principle,  which  specifies  that  these  particles  cannot  all  occupy  the  very  lowest 
energy,  called  the  ground  state ,  at  the  same  time.  In  Fermi’s  solution,  all  of  the 
states  up  to  a  certain  limiting  energy  can  be  occupied,  and  all  of  those  above  that 
energy  are  not  occupied. 

Under  such  conditions  the  collection  of  subatomic  particles,  or  gas,  is  said  to  be 
degenerate.  In  mathematics,  a  degenerate  case  is  a  limiting  one  in  which  a  class  of 
objects  changes  its  nature  so  as  to  belong  to  another,  usually  simpler,  class;  for 
example,  a  point  is  a  degenerate  circle. 

As  shown  by  Ralph  H.  Fowler  (1889-1944),  who  was  Eddington’s  colleague  at 
Cambridge  University,  it  is  the  degenerate  pressure  of  the  electron  gas  that  sup¬ 
ports  a  white  dwarf  star.  Fowler  applied  Fermi’s  statistical  description  to  such  a 
dense  star,  showing  that  its  electrons  are  completely  degenerate.  They  produce  an 
outward  push,  known  as  degeneracy  pressure ,  to  keep  their  own  space  and  support 
the  star  (Fowler  1926).  This  kind  of  pressure  is  proportional  to  the  5/3  power  of 
mass  density,  and  accordingly  increases  rapidly  with  it. 

Not  only  is  the  electron  degeneracy  pressure  strong  enough  to  withstand  the 
crushing  gravity  of  a  white  dwarf  star;  it  is  also  independent  of  the  temperature  of 
the  electrons  and  involves  no  nuclear  reactions.  Because  the  pressure  does  not 
depend  on  temperature,  it  will  persist  even  if  the  star  cools  to  absolute  zero, 
without  the  electrons  ever  rejoining  the  nuclei.  As  Fowler  showed,  the  individual 
electrons  in  the  crushed  matter,  even  at  a  temperature  of  absolute  zero  still  would 
have  a  kinetic  energy  comparable  to  the  thermal  energy  of  particles  in  an  expanded 
gas  with  a  temperature  as  large  as  10  million  K. 

Because  the  equation  of  state  of  the  degenerate  electron  gas  is  unaffected  by 
temperature,  any  heating  by  hypothetical  nuclear  reactions  will  increase  the 
temperature  and  rates  of  those  reactions.  The  temperature  would  continue 
increasing  until  the  star  exploded,  so  we  conclude  that  white  dwarf  stars  do  not 
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shine  by  nuclear  reactions.  Their  light  must  come  from  the  slow  leakage  of  the 
heat  contained  in  the  nondegenerate  nuclei.  Eventually,  the  white  dwarf  star  will 
fade  into  a  gigantic  black  molecule,  a  frozen  star. 

When  the  electrons  in  a  white  dwarf  star  are  moving  at  non-relativistic  speeds, 
considerably  less  than  the  speed  of  light,  the  degenerate  electron  pressure,  Pe,  is 
given  by: 


(13.8) 


_ o  a 

where  the  Planck  constant  h  =  6.626  x  10  J  s,  the  electron  mass 

_ O  1 

me  =  9.109  x  10  kg,  and  Ne  is  the  electron  density.  Notice  that  the  degenerate 
electron  pressure  does  not  depend  on  the  temperature.  An  equivalent  expression 
for  the  equation  of  state  of  non-relativistic  degenerate  electron  pressure  is  (Fowler 

1926). 
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for  a  fully  ionized  gas  composed  of  elements  of  atomic  number  Z  and  atomic  mass 
number  A.  The  proton  mass  mP  =  1.67262  x  10  ~  kg.  For  a  white  dwarf  star, 
which  is  composed  of  helium,  carbon  and  oxygen  with  Z/A  =  0.5,  and: 


Pe  «  3.074  x  106p5/3  Pa  for  p  <  1010  kg  m“3  (13.10) 


Example:  Gas  pressure,  degenerate  electron  pressure,  and  magnetic 
pressure  in  a  white  dwarf 

A  white  dwarf  star  has  a  mass  density  of  p  =  10  kg  m  and  an  initial 

,~j 

temperature  of  T  =  10  K.  The  gas  pressure,  PG,  is  given  by: 

pG  =  NikT  =  —kT ,  (13.11) 

Amp 

where  Nt  is  the  ion  number  density,  the  mass  number  A  =  4  for  helium, 
A  =  12  for  carbon  and  A  =  16  for  oxygen,  the  Boltzmann  constant 
k  =  1.38065  x  10-23  J  K-1,  and  the  proton  mass  is  mp  =  1.67262  x 
10  ~  kg.  Using  these  numbers  for  A  =  4,  the  gas  pressure  is 
PG  «  2  x  10 19  Pa.  The  non-degenerate  electron  gas  pressure  is 
Pe  =  3.074  x  106  p 5^3  «  3  x  1021  Pa.  So  even  on  formation,  before  a 
white  dwarf  star  has  cooled,  the  degenerate  electron  pressure  is  about  one 
hundred  times  greater  than  the  gas  pressure. 

Since  the  white  dwarf  has  no  nuclear  fusion  reactions  to  supply  heat,  it 
gradually  cools.  The  effective  temperature  of  Sirius  B,  for  example,  is  now 

A  #  1  Z 

2.5  x  10"  K,  so  its  gas  pressure  is  about  5  x  10iO  Pa. 
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Since  magnetic  flux  is  conserved  in  gravitational  collapse,  the  magnetic 
field  strength  B  is  amplified  by  the  inverse  square  of  the  radius,  to  as  much  as 
B  «  104  tesla  for  a  white  dwarf  star  which  is  100  times  smaller  than  the 

O  IQ 

Sun.  The  magnetic  pressure,  PB  =  B  /  (2 p0)  ^  4  x  10iJ  Pa,  is  much 
smaller  than  the  degenerate  electron  gas  pressure,  where  the  permeability  of 
free  space  is  p0  =  1.2566  x  10-6  N  A-2. 


By  1929,  the  Estonian  astrophysicist  Wilhelm  Anderson  (1880-1940)  demon¬ 
strated  that  the  electrons  in  highly  compressed,  degenerate  matter  begin  to  attain 
velocities  on  the  order  of  the  speed  of  light,  and  that  in  this  case  the  variation  of  the 
electron  mass  with  velocity  must  be  taken  into  account  by  using  the  equations  of 
Special  Relativity.  For  a  relativistic  degenerate  electron  gas  the  equation  of  state  is 
(Anderson  1929;  Stoner  1930): 


(13.12) 


which  differs  from  the  non-relativistic  case  in  that  the  electron  mass  does  not  appear 
and  the  pressure  varies  as  the  4/3  power  of  the  mass  density,  p,  rather  than  the  5/3 
power  for  a  non-relativistic  degenerate  electron  gas.  Using  Z/A  =  0.5  and  evalu¬ 
ating  h  =  6.626  x  10-34  J  s"1,  c  =  2.9979  x  108  m  s_1,  and  mp  =  1.67262  x 

_ O  rJ 

10  ~  kg,  we  obtain 


Pe  «  0.49  x  10V/3  Pa  for  p  >  1010  kg  m“3,  (13.13) 

for  the  relativistic,  degenerate  electron  gas.  The  non-relativistic  and  relativistic 
degenerate  electron  pressures  become  equal  at  a  mass  density  of  p  «  4  x  109 
kg  m-3. 


13.4.2  Radius  and  Mass  of  a  White  Dwarf 


A  higher-mass  white  dwarf  will  be  squeezed  into  a  smaller  space  by  its  gravity,  so 
the  star’s  radius  decreases  with  increasing  mass.  For  a  large  enough  mass,  we 
might  imagine  that  the  star’s  radius  would  become  very  small,  perhaps  even 
shrinking  to  almost  zero;  however,  this  is  preposterous  and  there  must  be  a  limit  to 
the  mass. 

An  approximate  expression  for  the  radius,  RWD ,  of  a  non-relativistic  white 
dwarf  star  of  mass,  M,  is: 


R 
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20mem5J3  G  \A 


AT1/3  «  4.82  x  1016M"1/3  m 


(13.14) 
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where  h  is  the  Planck  constant,  me  is  the  electron  mass,  mp  is  the  proton  mass, 
Z/A  =  0.5  and  the  gravitational  constant  G  =  6.674  x  10-11  m3  kg-1  s-2.  This 
means  that 


Rwd  x  3.8  x  106(M;,/M)1/3m,  (13.15) 

AO 

where  the  solar  mass  MQ  =  1.989  x  10  kg;  see  Provencal  (1998)  for  observa¬ 
tional  tests  of  the  mass-radius  relation  of  white  dwarf  stars. 

As  the  white  dwarf  shrinks  in  size,  its  mass  and  mass  density  become  higher  as 
does  the  degeneracy  pressure  of  its  electron  gas.  Under  extreme  compression, 
however,  the  average  speed  of  the  electrons  increases  and  eventually  approaches 
the  speed  of  light.  This  means  that  there  is  an  upper  limit  to  the  mass  that  can  be 
supported  by  their  pressure.  This  makes  common  sense,  since  the  electrons  will 
move  at  greater  speeds  with  increasing  stellar  mass  and  density,  although  they 
cannot  move  faster  than  the  speed  of  light. 

The  limiting  mass  for  a  white  dwarf  star  is  determined  under  high-speed,  rel¬ 
ativistic  conditions,  when  the  electrons  approach  the  speed  of  light.  As  both  the 
German-Estonian  astrophysicist  Wilhelm  Anderson  (1880-1940)  and  the  English 
physicist  Edmund  C.  Stoner  (1899-1968)  demonstrated,  a  star  can  contract  only 
until  the  gravitational  potential  energy  becomes  insufficient  to  balance  the  increase 
in  the  kinetic  energy  of  the  electrons,  which  occurs  for  stellar  masses  of  about 
1  solar  mass  (Anderson  1929;  Stoner  1929,  1930). 

The  two  forms  of  energy  are  roughly  equal  at: 


GM; 

Rwd  ’ 


or  at  a  critical  mass  Mc  given  by 
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where  Pe  is  the  relativistic  electron  pressure,  RWD  is  the  radius  of  the  white  dwarf 
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star,  and  the  gravitational  constant  G  =  6.674  x  10  m  kg  s  .  Assuming  a 

Q  G 

mass  density  of  p  =  10  kg  and  RWD  =  6  x  10°  m,  we  obtain 
Mc  x  1030  kg  x  M&  =  1.989  x  1030  kg. 

Thus,  for  stellar  masses  larger  than  about  1  solar  mass,  there  can  be  no  equi¬ 
librium  white  dwarf  configurations.  More  massive  stars  collapse  under  their  own 
weight  to  form  a  neutron  star  or  a  black  hole  at  the  endpoints  of  stellar  evolution. 

During  his  voyage  from  India  to  Cambridge  University  for  his  graduate  studies, 
the  Indian  astrophysicist  Subrahmanyan  Chandrasekhar  (1910-1995)  derived  the 
detailed  equation  of  state  of  a  degenerate  electron  gas  in  the  extreme  relativistic 
limit  (Chandrasekhar  1931).  The  exact  solution  for  the  critical  mass  is  given  by 
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(13.18) 


or  for  a  white  dwarf  star  with  Z/A  =  0.5, 

Me  ~  1.46  M0, 


(13.19) 


where  the  Sun’s  mass  M0  =  1.989  x  1030  kg. 

Although  both  Anderson  and  Stoner  previously  called  attention  to  the  existence 
of  this  upper  mass  limit,  it  is  known  now  as  the  Chandrasekhar  limit ,  because  he 
was  the  first  to  derive  the  detailed  equilibrium  conditions  in  which  degenerate 
electron  gases  support  a  dense  star’s  gravity. 

In  1983,  the  Nobel  Prize  in  Physics  was  awarded  equally  to  Chandrasekhar,  for 
his  theoretical  studies  of  the  physical  processes  of  importance  to  the  structure  and 
evolution  of  the  stars,  and  to  William  A.  “Willy”  Fowler  (1911-1995)  for  his 
theoretical  and  experimental  studies  of  the  nuclear  reactions  of  importance  in  the 
formation  of  the  chemical  elements  in  the  universe. 


13.5  Exploding  Stars 
13.5.1  Guest  Stars,  the  Novae 

For  at  least  2,000  years,  astronomers,  hunters,  mariners,  and  others  familiar  with 
the  brightest  stars  must  have  been  amazed  by  a  nova ,  or  “new  star”,  that  would 
appear  suddenly  at  a  place  in  the  sky  where  no  star  previously  had  been  seen.  For  a 
few  days,  the  nova  might  be  among  the  brightest  stars  in  the  dark  night  sky.  But 
then  the  star  would  begin  to  fade  away,  and  in  about  a  month  it  would  disappear 
back  into  invisibility,  without  a  trace.  The  Chinese  called  them  “guest  stars”  or 
“visiting  stars”  because  they  were  not  permanent  members  of  the  celestial  sphere, 
instead  appearing  suddenly  and  then  departing  abruptly,  like  uninvited  guests. 

Every  20  years  or  so,  a  nova  is  luminous  enough  and  close  enough  to  be 
conspicuous  without  the  aid  of  a  telescope,  attracting  the  attention  of  both 
astronomers  and  the  superstitious.  Like  good  wine,  they  are  specified  by  the  year 
of  their  occurrence,  and  their  location  is  specified  by  the  constellation  in  which 
they  appear.  Nova  Aquilae  1918  was  the  brightest  of  the  twentieth  century,  at 
apparent  visual  magnitude  —1.1;  Nova  Herculis  1934  has  historical  importance; 
and  Nova  Cygni  1992  was  the  brightest  nova  in  recent  history. 

Something  of  substance  had  to  be  at  a  nova’s  location  before  it  appeared,  to 
supply  the  energy  of  its  outburst.  By  the  mid-twentieth  century,  it  was  realized  that 
this  “mysterious  something”  was  an  inconspicuous  star  that  previously  had  been 
recorded  during  systematic  surveys  of  the  dark  night  sky,  using  large  telescopes 
and  photographic  exposures  to  record  the  dim  light  of  faint  stars. 
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Table  13.5  Physical  properties  of  some  novaea 


Star  Name 

Year 

t^max 

Pfnax  (Tq) 

D  fly) 

Porb  (hours)  Mi  (Mq) 

M2  (M0) 

v  exp 

(km  s'1) 

Classical  novae 

GK  Persei 

1901 

+0.2 

105'3 

1,500 

47.92 

0.9 

0.25 

1,200 

V603  Aquilae 

1918 

-1.4 

1056 

800 

3.31 

0.66 

0.2 

265 

DQ  Herculis 

1934 

+1.4 

103-9 

316 

4.65 

0.62 

0.44 

315 

VI 974  Cygni 

1992 

+4.4 

104-9 

10,430 

19.53 

0.83 

— 

— 

Dwarf  nova 

SS  Cygni 

b 

8.3 

%10 

^541 

6.603 

0.60 

0.40 

Recurrent  nova 

RS  Ophiuchi 

c 

4.5 

0.1 

>2,000 

455.7  d 

1.4 

— 

— 

a  Maximum  visual  magnitude,  mmax,  and  maximum  luminosity  Lmax  in  units  of  the  Sun’s  lumi- 

o/:  _ i 

nosity  L0  =  3.828  x  10  J  s  ,  distance  in  light-years,  abbreviated  ly,  orbital  period,  Porb, 
white  dwarf  mass,  and  companion  mass,  M2  in  units  of  the  Sun’s  mass 

Mq  =  1.989  x  1030  kg,  and  expansion  velocity  Wexp 

b  The  dwarf  nova  SS  Cygni  undergoes  frequent  and  regular  outbursts  every  7-8  weeks,  with  an 
apparent  visual  magnitude  of  mmin  =  12.2  at  minimum  and  mmax  =  8.3  at  maximum.  More  than 
800  outbursts  have  been  observed  since  its  discovery  in  1896 

c  The  recurrent  nova  RS  Ophiuchi  erupted  in  1898,  1907,  1933,  1945,  1958,  1967,  1985  and 
2006.  It  is  a  binary  system  with  a  red  giant  star  in  a  455.7-day  orbit  around  a  white  dwarf  star  of 
mass  near  the  Chandrasekhar  limit 


So,  the  bright,  short-lived  novae  are  neither  new  nor  temporary  but  instead 
existing  stars  that  suddenly  increase  in  brightness  by  as  much  as  100,000  times, 
returning  to  their  original  states  after  several  months  or  a  few  years. 

Telescopic  observations  also  have  enabled  the  detection  of  faint  dwarf  novae. 
They  brighten  repeatedly,  on  a  time-scale  from  days  to  decades,  although  by  a 
smaller  amount  and  with  lower  luminosity  than  the  classical  novae  whose 
explosive  outbursts  are  visible  without  the  aid  of  a  telescope.  More  than 
900  outbursts  of  the  dwarf  nova  SS  Cygni,  for  example,  have  been  observed  since 
its  discovery  in  1896.  It  varies  from  apparent  visual  magnitude  of  12.2  at  minimum 
to  8.3  at  maximum,  every  7-8  weeks. 

The  properties  of  some  of  the  classical  novae,  visible  by  the  unaided  eye,  are 
listed  in  Table  13.5  with  the  dwarf  nova  SS  Cygni. 


13.5.2  What  Makes  a  Nova  Happen? 

A  major  new  understanding  of  novae  occurred  in  the  1950s  and  1960s  when  a  few 
American  astronomers  began  to  examine  the  total  light  and  spectra  of  ex-novae, 
long  after  the  intense  light  of  the  nova  outburst  had  faded  to  a  relatively  weak 
level.  It  then  was  discovered  that  a  nova  is  not  one  star  but  rather  two  stars  very 
close  together.  Twenty  years  after  the  1934  eruption  of  Nova  Herculis,  for 
example,  Merle  F.  Walker  (1926-  )  found  that  this  nova  is  an  eclipsing  binary 
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system  with  a  remarkably  brief  orbital  period  of  only  4.6  h  (Walker  1954).  The 
shortness  of  the  period  indicated  that  the  two  stars  are  very  close  together,  prac¬ 
tically  touching  one  another.  Nearly  a  decade  later,  Walker  was  able  to  show  that 
Nova  T  Aurigae  1891  also  is  an  eclipsing  binary  system  with  a  short  period,  of 
4.8  h  (Walker  1963). 

Alfred  H.  Joy  (1882-1973)  had  examined  the  absorption  and  emission  lines  of 
the  dwarf  nova  SS  Cygni,  identifying  it  as  a  binary-star  system  with  a  short  orbital 
period  of  6.6  h  (Joy  1956).  The  emission  lines  originated  in  a  blue-white  dwarf 
star,  whereas  the  absorption  lines  came  from  a  red  main-sequence  star,  the  size  of 
which  was  estimated  to  be  roughly  half  the  distance  between  the  two  stars.  They 
were  so  close  to  one  another  that  mass  could  spill  from  the  red  star  into  the  blue 
star,  and  the  nova  process  might  be  related  to  this  mass  flow. 

In  the  meantime,  Robert  P.  Kraft  (1927-  )  demonstrated  that  membership  in  a 
short-period  binary  system  is  a  necessary  condition  for  a  star  to  become  a  nova  of 
either  the  classical  or  dwarf  type  (Kraft  1964).  One  of  the  stellar  pair  was  usually  a 
blue-white  dwarf  star;  the  other  red  component  was  usually  a  cool  main-sequence 
star  of  spectral  type  G,  K,  or  M  or  one  of  the  same  spectral  type  that  is  aging  and 
expanding  into  a  red  giant.  The  short  orbital  period  indicated  that  the  two  stars  are 
so  close  that  hydrogen  flows  from  the  red  companion  onto  the  white  dwarf, 
reviving  the  “dead”  star  and  giving  it  a  brief  new  life  in  a  cataclysmic  nuclear 
explosion.  The  term  cataclysmic  variable  star  is  used  now  to  designate  such  close 
binary  star  systems,  in  which  one  of  the  components  -  conventionally  called  the 
primary  star  -  is  a  white  dwarf  that  accretes  matter  from  its  secondary  companion. 
The  category  includes  classical,  dwarf,  and  recurrent  novae. 


Example:  Dwarf  nova  SS  Cygni 

SS  Cygni  is  a  double  star  system  with  an  orbital  period  of 
P  =  0.275  days  =  6.603  h  =  23,770  s.  It  consists  of  a  white  dwarf  of  mass 
Mi  =  0.60  M&  and  a  main  sequence  star  of  mass  M2  =  0.40  M0,  where  the 

on 

Sun’s  mass  M0  =  1.989  x  10  kg.  The  linear  separation,  a,  of  the  two 
stars  can  be  inferred  from  Kepler’s  third  law: 


+M2 


47 i2a3 

GP2 


(13.20) 


_  11  ^  _ O 

where  the  gravitational  constant  G  =  6.61  A  x  10  N  m  kg  .  Using 
Mi  +  M2  =  Mq  and  solving  for  the  linear  separation  we  obtain 
a  «  1.24  x  109  m  «  1.78  Rq,  where  the  Sun’s  radius  R&  =  6.955  x 

o 

10  m.  The  white  dwarf  star  has  a  radius  of  about  0.01  RQ  and  a  low-mass 
main-sequence  star  of  spectral  class  K5  might  have  a  radius  of  about  0.8  R& , 
so  the  two  stars  are  practically  touching  each  other.  It  therefore  is  not  sur¬ 
prising  that  mass  flows  from  the  main-sequence  star  onto  the  white  dwarf 
star,  resulting  in  outbursts  from  SS  Cygni  every  7-8  weeks. 
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Fig.  13.4  Nova  A  classical  nova  is  a  thermonuclear  explosion  that  occurs  on  the  surface  of  a 
white  dwarf  star  that  is  in  a  close  orbit  with  a  main-sequence  star.  The  strong  gravitational 
attraction  of  the  white  dwarf  pulls  its  nearby  companion  into  an  elongated  shape,  the  outer  edge 
of  which  is  designated  the  Roche  lobe.  Some  of  the  hydrogen  in  the  outer  atmosphere  of  the  main- 
sequence  star  spills  over  at  the  inner  Lagrangian  point,  denoted  LI,  where  the  gravitational  pull  of 
the  two  stars  is  equal.  This  hydrogen  spirals  into  a  rotating  accretion  disk  and  down  to  the  white 
dwarf,  igniting  an  explosion,  like  a  colossal  hydrogen  bomb.  (From  “The  Life  and  Death  of 
Stars”  by  Kenneth  R.  Lang,  published  by  Cambridge  University  Press,  2013.  Reprinted  with 
permission.) 


The  immense  gravity  of  the  white  dwarf  star  distorts  the  shape  of  its  nearby 
companion,  stretching  it  into  an  elongated  configuration  (Fig.  13.4).  This  is  a  tidal 
effect  in  which  the  side  of  the  companion  star  that  is  nearest  the  white  dwarf  is 
pulled  toward  it,  and  the  companion’s  center  is  pulled  away  from  the  side  that  is 
farthest  from  the  white  dwarf.  Material  that  is  pulled  close  enough  to  the  white 
dwarf  can  pass  outside  the  gravitational  control  of  the  red  companion  and  go  into 
orbit  around  the  white  dwarf,  eventually  spiraling  into  it. 

The  region  in  which  the  companion  star  retains  gravitational  control  of  its 
substance  is  known  as  its  Roche  lobe ,  named  after  the  French  astronomer  Edouard 
A.  Roche  (1820-1883)  who  described  it  more  than  a  century  ago  (Roche  1849/ 
1850/1851).  When  the  outer  hydrogen  atmosphere  of  the  close  companion  star 
overflows  its  Roche  lobe,  it  does  not  fall  directly  into  the  white  dwarf  star.  In  the 
absence  of  an  intense  magnetic  field,  the  gas  is  accreted  into  an  orbiting  disk  that 
resembles  the  proto-planetary  disks  that  circle  protostars  (Sect.  12.3);  the  gas 
streams  onto  the  magnetic  poles  when  there  is  a  strong  magnetic  field  directing  the 
flow. 

In  either  case,  the  hydrogen  is  pulled  slowly  into  the  white  dwarf,  which 
compresses  and  heats  the  gas  to  very  high  temperatures.  The  accumulating 
hydrogen  is  a  potential  bomb  that  remains  harmless  until  detonated. 

A  thin  layer  of  hydrogen  slowly  builds  up  on  the  white  dwarf  as  its  companion 
keeps  feeding  matter  into  it.  The  pressure  and  temperature  rise  until  hydrogen 
fusion  suddenly  is  ignited,  at  about  10  million  K,  or  10  K.  A  runaway  thermo¬ 
nuclear  explosion  then  occurs,  and  a  white  dwarf  that  normally  would  cool  and 
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fade  away  if  left  alone  suddenly  shines  as  brightly  as  100,000  Suns  (Starrfield 
et  al.  1974,  1985).  The  explosion  subsides  after  a  few  weeks,  but  the  overflow  of 
the  companion  continues.  Gallagher  and  Starrfield  (1978)  have  reviewed  the 
theory  and  observation  of  classical  novae. 

The  envelope  of  the  white  dwarf  star  is  thrown  off  during  the  nova  explosion,  at 
high  speeds  of  up  to  several  thousand  kilometers  per  second.  However,  despite  the 
violence,  the  amount  of  material  ejected  is  only  about  0.000005  of  a  solar  mass. 
The  white  dwarf  therefore  can  retain  its  stability,  and  potentially  generate  addi¬ 
tional  novae  as  its  companion  continues  to  feed  matter  into  it.  An  example  of  such 
a  recurrent  nova  is  RS  Ophiuchi,  which  has  exploded  into  a  bright  nova  state  at 
least  six  times  between  1898  and  2006. 

Eventually,  the  hydrogen  may  build  up  until  it  pushes  the  white  dwarf  above  its 
limiting  mass;  the  entire  star  then  explodes,  not  just  the  thin  outer  atmosphere. 
This  is  a  supernova  that  suddenly  and  unpredictably  brightens  with  the  light  of 
1  billion  Suns. 


13.5.3  A  Rare  and  Violent  End,  the  Supernovae 

On  rare  occasions,  an  entire  star  is  annihilated  and  suddenly  becomes  so  bright  that 
it  can  be  seen  easily  in  daylight  rather  than  just  at  night  like  the  novae.  The 
Chinese  emperor’s  astronomers  in  the  Sung  dynasty  recorded  one  on  July  4,  1054, 
near  the  constellation  now  known  as  Taurus,  the  Bull.  The  Chinese  chronicles 
indicate  that  the  new  star  initially  was  brighter  than  everything  in  the  night  sky 
except  the  full  Moon;  could  be  seen  during  the  daytime  for  three  weeks  after  its 
first  appearance;  and  remained  visible  in  the  night  sky  for  22  months,  without  the 
aid  of  telescopes,  which  had  not  yet  been  invented.  The  “new”  star  of  1054  was 
definitely  far  brighter  and  longer  lasting  than  any  other  guest  star. 

More  than  four  centuries  passed  before  other  exceptionally  brilliant  guest  stars 
were  noticed,  and  this  time  they  shook  the  very  foundations  of  European  thought. 
As  Aristotle  taught,  heavenly  bodies  were  supposed  to  be  eternal,  pure,  changeless, 
incorruptible,  and  perfect,  unlike  anything  on  the  Earth.  Yet,  in  a  span  of  just 
32  years,  two  new  daytime  stars  could  be  seen  by  almost  anyone  in  the  Earth’s 
Northern  Hemisphere  who  happened  to  look  up.  Each  star  remained  fixed  in  the 
heavens  for  about  a  year,  and  then  disappeared  from  view. 

Both  events  were  also  discovered  at  a  time  before  telescopes  were  invented. 
The  Danish  astronomer  Tycho  Brahe  (1546-1601)  witnessed  the  first  visitor  in 
1572,  which  initially  was  brighter  than  the  planet  Venus.  Perhaps  because  of  the 
excitement  caused  by  his  discovery,  Brahe  built  an  observatory  in  which  detailed 
measurements  were  made  of  stars  and  the  planets.  Johannes  Kepler  (1571-1630), 
who  used  Brahe’s  observations  of  planets  to  determine  the  laws  of  their  motion, 
spied  another  bright  new  star  as  it  lit  up  the  heavens  in  1604. 

The  exceptionally  brilliant  guest  stars  of  1054,  1572,  and  1604  were  much 
brighter  and  longer  lasting  than  the  conventional  novae  known  at  the  time  or 
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Table  13.6  Historical  supemovae  visible  with  the  unaided  eyea 


Explosion 

Date 

r^max 

M max 

‘-‘max 

(L0) 

Visible 

(months) 

Type 

Remnant 

name 

D 

(ly) 

e 

(!) 

R 

(ly) 

SN  185 

-8.0 

-20.2 

io100 

8 

la 

RCW  86 

9,100 

45 

56 

SN  386 

+1.5 

— 

— 

3 

— 

— 

— 

— 

— 

SN  393 

-1.0 

-11.0 

106-3 

8 

II/Ib 

b 

3,000 

70 

30 

SN  1006 

-7.5 

-19.2 

109-6 

21 

la 

PKS 

1451-41 

7,200 

31 

32 

SN  1054 

-6.0 

-17.5 

o 

00 

VO 

22 

II 

Crab  nebula 

6,500 

7 

6.6 

SN  1181 

-1.0 

— 

— 

6 

c 

>8,000 

— 

— 

SN  1572d 

-4.0 

-16.4 

00 

O 

H 

16 

la 

Tycho 

11,500 

8.3 

14 

SN  1604e 

-2.5 

-16.4 

in 

00 

O 

H 

12 

la 

Kepler 

20,000 

3.2 

9 

a  Maximum  apparent  visual  magnitude,  mmax,  maximum  absolute  magnitude,  Mmax,  maximum 

r\  s'  1 

luminosity,  Lmax,  in  units  of  the  Sun’s  luminosity  L0  =  3.828  x  10  J  s”  ,  length  of  visibility 
to  the  unaided  eye,  supernova  type,  supernova  remnant  name,  distance  D  in  light-years,  abbre¬ 
viated  ly,  angular  diameter  9  of  supernova  remnant  in  minutes  of  arc,  denoted  ',  and  remnant 
radius,  R,  in  light-years 
b  Supernova  remnant  RX  J17 13.7-3946 

c  Radio  source  3C  38.  The  radio  and  x-ray  pulsar  J0205+6449  may  not  be  associated  with  SN 
1181 

d  Supernova  explosion  also  known  as  Tycho’s  star,  supernova  remnant  3C  10 
e  Supernova  explosion  also  known  as  Kepler’s  star,  supernova  remnant  3C  358 


subsequently.  They  therefore  have  been  dubbed  supernovae ,  a  term  coined  by  the 
Swiss  astronomer  Fritz  Zwicky  (1898-1974). 

Now  that  we  know  the  distances,  a  supernova  also  refers  to  a  stellar  outburst  the 
maximum  luminosity  of  which  exceeds  by  factors  of  several  billions  the  lumi¬ 
nosity  of  our  Sun  (Table  13.6).  This  is  millions  of  times  the  peak  luminosity  of  a 
classical  nova.  Moreover,  unlike  novae,  the  new  breed  of  exploding  stars  has 
nothing  conventional  to  return  to  after  the  explosion.  They  require  so  much  energy 
that  the  mass  of  an  entire  star  is  annihilated. 

In  1934,  Walter  Baade  (1893-1960)  and  Fritz  Zwicky  communicated  to  the 
United  States  National  Academy  of  Sciences  a  remarkable  pair  of  papers  on 
supernovae  (Baade  and  Zwicky  1934a,  b).  In  one  paper,  they  showed  that  the 
enormous  energy  emitted  in  the  supernova  process  corresponds  to  the  total  con¬ 
version  of  an  appreciable  fraction  of  a  star’s  mass  into  energy.  In  the  second  paper, 
they  predicted  that  a  supernova  explosion  will  accelerate  charged  particles  to  very 
high  energies  and  that  supernovae  that  occur  only  once  in  a  millennium  can 
account  for  the  energetic  cosmic-ray  particles  that  now  rain  down  on  the  Earth’s 
atmosphere  from  all  directions  in  outer  space.  In  this  more  speculative  paper,  the 
two  Caltech  astronomers  also  said  that  the  collapsing  core  of  the  explosion  might 
become  a  neutron  star,  of  very  small  radius  and  extremely  high  mass  density.  It 
took  a  half-century  for  astronomers  to  realize  that  Baade  and  Zwicky  were  correct 
on  all  counts. 
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The  initial  evidence  for  supernovae  was  extraordinarily  sparse.  In  the  Milky 
Way,  they  are  seen  at  intervals  of  roughly  100  years,  which  is  about  3,000  times 
less  common  than  dwarf  novae;  so  it  might  take  centuries  before  the  next  super¬ 
nova  could  be  observed  in  the  Milky  Way.  Fortunately,  it  was  found  that  super¬ 
novae  occur  more  frequently  in  the  many  spiral  nebulae  outside  our  Milky  Way. 

In  the  early  decades  of  the  twentieth  century,  astronomers  discovered  numerous 
faint  novae  in  spiral  nebulae,  which  suggested  that  the  spirals  were  very  distant  if 
these  “new”  stars  were  like  the  classical  novae  seen  in  the  Milky  Way.  At  that 
distance,  an  exceptionally  bright  nova,  observed  in  the  nearest  spiral  nebula 
Andromeda  in  1885,  would  have  the  luminosity  of  a  supernova. 

When  the  enormous  distances  to  the  spiral  nebulae  were  confirmed,  it  was 
realized  that  they  are  not  nebulae  at  all  but  instead  galaxies  that  each  contain  about 
100  billion  stars  (Sect.  14.2).  Moreover,  at  maximum,  a  supernova  briefly  will 
outshine  1  billion  stars  in  the  same  galaxy.  Light  from  hundreds  of  supernovae  in 
distant  galaxies  might  be  on  its  way  to  us  now.  Their  light  could  take  many 
thousands  or  even  billions  of  years  to  travel  to  the  Earth. 

Because  there  are  many  of  these  extragalactic  spirals,  now  called  galaxies, 
Zwicky  realized  that  a  systematic  photographic  survey  quickly  would  catch  at  least 
one  star  in  the  act  of  supernova  explosion  -  and  he  was  right.  He  detected  the  first 
one  in  1937,  with  a  camera  attached  to  a  modest  telescope  placed  on  the  roof  of  a 
building  at  Caltech  (Zwicky  1937).  This  supernova  occurred  in  the  spiral  galaxy 
NGC  4157,  now  known  to  be  about  55  million  light-years  away,  so  the  actual 
explosion  occurred  55  million  years  before  Zwicky  saw  it,  the  time  it  took  for  its 
light  to  travel  the  vast  distance  separating  the  galaxy  from  us. 

By  observing  the  spectra  and  fading  light  of  supernovae  in  distant  galaxies, 
astronomers  subsequently  found  that  there  are  two  methods  for  stars  to  come  to 
such  a  violent  end.  The  German-American  astronomer  Rudolph  Minkowski 
(1895-1976),  for  example,  divided  the  supernovae  into  two  categories,  denoted 
Type  I  and  Type  II,  distinguished  by  the  absence  or  presence  of  hydrogen  in  their 
spectra  (Minkowski  1941).  Type  I  was  subsequently  divided  into  three  categories. 
Type  la  exhibits  a  strong  absorption  feature  of  singly  ionized  silicon  at  a  wave¬ 
length  of  615  nm  in  their  spectra  near  peak  light;  Types  lb  and  Ic  do  not  display 
this  spectral  feature.  Modern  identifying  characteristics  of  supemovae  of  different 
types  are  given  in  Table  13.7,  including  the  important  Type  la  and  Type  II. 
Filippenko  (1997)  has  reviewed  the  optical  spectra  of  supernovae. 

The  peak  light  output  from  a  Type  I  supernova  typically  is  one  or  two  orders  of 
magnitude  more  luminous  than  that  of  the  fainter  Type  II  supernovae,  which  fade 
more  slowly.  We  now  know  that  the  decay  of  radioactive  elements  produced 
during  high-temperature  Type  II  explosions  heats  the  expanding  gas  and  produces 
the  optically  visible  light. 

Walter  Baade  used  Tycho’s  observations  of  the  decaying  light  from  the  brilliant 
1572  supernova  to  demonstrate  that  it  is  consistent  with  the  superluminous 
emission  of  a  supernova  of  Type  I  (Baade  1945;  Van  Den  Bergh  1993);  Kepler’s 
meticulous  observations  of  the  1604  event  indicated  a  Type  II  light  variation.  Both 
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Table  13.7  Characteristics  of  supernova  typesa 


Characteristic 

Type  la 

Type  lb 

Type  II 

Optical  spectrum 

No  hydrogen  Si  II  at 

No  hydrogen  He  I  at 

Hydrogen  present  at 

615.0  nm 

587.6  nm 

656.3  nm 

Maximum 

luminosity 

0 

OC 

On 

o 

H 

«  109'1  L0 

1091  Le 

Ejection  velocity 

>104  km  s-1 

>104  km  s-1 

<104  km  s-1 

Ejected  mass 

~  1  Mq 

~  1  M0 

«5  Mq 

Progenitor  star 

White  dwarf 

Wolf-Ray  et 

Supergiant 

Progenitor  star 
mass 

1MS 

4-7  Mq 

>8  Mq 

a  Maximum  luminosity  in  units  of  the  Sun’s  luminosity  L0  =  3.828  x  1026  J  s  1  ,  and  mass 

OA 

values  in  units  of  the  Sun’s  mass  M0  =  3.854  x  10'  kg 


of  these  supernovae  were  observed  before  telescopes  were  invented  and  spectra 
were  obtained  from  cosmic  objects. 

Although  the  two  types  of  supernovae  release  comparable  amounts  of  total 
energy  during  their  explosion,  there  is  a  radical  difference  in  the  mass  and  kinetic 
energy  of  their  ejected  material.  The  expanding  shells  of  Type  la  contain  roughly 
1  solar  mass  of  material,  whereas  those  of  Type  II  events  are  about  five  times  more 
massive.  The  large  mass  difference  suggests  that  the  progenitor  stars  of  Type  la 
supernovae  are  less  massive  than  those  of  Type  II.  There  is  comparatively  little 
discussion  of  the  reasons  for  Type  lb  and  Ic  supernovae  in  the  scientific  literature. 
Like  supernovae  of  Type  II,  they  are  probably  massive  stars  that  have  run  out  of 
nuclear  fuel  at  their  centers.  Type  lb  supernovae  may  be  related  to  the  core 
collapse  of  massive  Wolf-Ray et  stars  that  have  lost  hydrogen  by  strong  winds. 


13.5.4  Why  do  Supernova  Explosions  Occur? 

Both  types  of  supernovae  involve  the  explosive  conversion  of  a  star’s  entire  mass 
into  energy  but  by  different  physical  mechanisms.  A  Type  la  stellar  explosion  is 
due  to  external  causes.  It  involves  a  white  dwarf  star  pushed  into  nuclear  explosion 
by  too  much  mass  overflow  from  a  nearby  companion.  The  other,  Type  II, 
supernova  is  an  internal  event,  which  occurs  during  the  gravitational  collapse  of 
the  iron  core  of  a  massive  star  that  has  depleted  all  of  its  energy.  The  nuclear 
supernovae  of  Type  la  and  gravity-powered  ones  of  Type  II  are  believed  to  occur 
with  about  equal  likelihood  in  the  Milky  Way,  at  the  rate  of  1  every  50-100  years. 

Weiler  and  Sramek  (1988)  and  Trimble  (1982,  1983)  have  reviewed  our 
knowledge  of  supemovae  and  supernova  remnants.  Woosley  and  Weaver  (1986) 
have  reviewed  the  physics  of  supernova  explosions.  Bethe  (1990)  has  reviewed  our 
understanding  of  supernova  mechanisms. 

Hillebrandt  and  Niemeyer  (2000)  review  Type  la  supernova  explosion  models. 
Smartt  (2009)  has  reviewed  the  progenitor  stars  of  Type  II  supernovae.  Heger  et  al. 
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(2003)  describe  how  massive  single  stars  end  their  lives.  Langer  (2012)  has 
reviewed  the  pre-supernova  evolution  of  both  massive  single  and  massive  binary 
stars,  and  Paczynski  (1971)  has  reviewed  evolutionary  processes  in  close  binary 
systems.  The  next  two  subsections  of  the  text  provide  a  general  description  of  the 
two  types  of  supernova  explosions. 


13.5.5  When  a  Nearby  Star  Detonates  Its  Companion 

Like  the  novae,  there  is  one  type  of  supernova  that  gets  assistance  from  the 
outside,  being  pushed  over  the  edge  into  explosion,  shattering  an  entire  star.  Such  a 
supernova  -  now  known  as  Type  la  and  characterized  by  the  absence  of  emission 
from  hydrogen  -  occurs  in  a  close  binary-star  system,  with  a  white  dwarf  star  -  the 
shrunken  dense  remnant  of  a  former  low-mass  star  -  circling  a  main-sequence  star. 

The  English  astrophysicist  Fred  Hoyle  (1915-2001)  and  his  American 
colleague  William  A.  “Willy”  Fowler  (1911-1995)  introduced  the  detailed 
mechanisms  for  this  type  of  supernova  (Hoyle  and  Fowler  1960;  Fowler  and  Hoyle 
1964).  When  the  nearby  companion  star  expands  as  a  result  of  its  normal  evolu¬ 
tion,  hydrogen  from  its  outer  atmosphere  spills  onto  the  white  dwarf.  The  over¬ 
flow,  for  example,  might  happen  when  the  ordinary  visible  companion  runs  out  of 
core  hydrogen  fuel  and  swells  into  a  red  giant  star.  As  the  hydrogen  overflow 
continues,  a  steady  increase  in  the  mass  of  the  white  dwarf  will  compress  and  heat 
the  star.  As  the  increasing  mass  approaches  the  upper  mass  limit  for  a  white  dwarf 
star  at  1.46  solar  masses,  the  rise  in  internal  temperature  ignites  a  nuclear 
explosion. 

Because  the  white  dwarf  is  supported  against  gravity  by  temperature-inde¬ 
pendent,  degenerate  electron  pressure,  adding  heat  to  the  star’s  interior  increases 
the  temperature  but  not  its  pressure;  therefore,  the  white  dwarf  does  not  expand 
and  cool  in  response.  Instead,  the  increased  temperature  initiates  the  fusion  of 
carbon  nuclei  in  a  runaway  nuclear  explosion  that  obliterates  the  star  in  a  few 
seconds  and  releases  about  1049  J  in  energy. 

In  other  words,  the  added  mass  detonates  a  carbon  bomb,  triggering  explosive 
nuclear  reactions  that  quickly  spread  throughout  the  star  and  completely  shatter  it. 
The  entire  star  explodes  into  a  Type  la  supernova  that  shines  with  the  light  of 
billions  of  Suns,  and  there  is  nothing  left. 

Because  every  one  of  these  explosions  is  triggered  at  the  same  mass  limit,  under 
similar  conditions,  and  also  because  the  star  is  completely  destroyed,  Type  la 
supernovae  are  expected  to  produce  about  the  same  maximum  light  output  every 
time  they  occur.  The  typical  absolute  visual  magnitude  is  —19.3,  or  about  5  billion 
times  the  luminosity  of  the  Sun,  with  little  variation.  Astronomers  use  this  bright, 
uniform  luminosity  as  a  “standard  candle”  to  measure  the  distances  to  their  host 
galaxies  located  far  beyond  the  Milky  Way  in  the  remote  parts  of  the  observable 
universe,  thereby  determining  the  pace  of  its  expansion.  They  have  shown  that  a 
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repulsive  force,  called  dark  energy,  is  making  the  universe  expand  at  an  acceler¬ 
ating  rate  (Sect.  15.6). 


13.5.6  Stars  that  Blow  Themselves  Up 

There  is  more  than  one  way  to  explode  a  star,  and  some  of  the  supernovae  are 
gravity-powered,  catastrophic  outbursts  from  very  old  massive  stars.  This  method 
of  shattering  a  star  applies  to  an  isolated  star  with  the  right  mass  -  between  about  8 
and  20  times  the  Sun’s  mass  -  that  blows  itself  apart.  This  Type  II  supernova, 
which  exhibits  hydrogen  in  its  spectra,  follows  the  creation  of  an  iron  core  within 
an  evolving,  massive  supergiant  star.  Smartt  (2009)  has  reviewed  the  progenitors 
of  such  core-collapse  supernovae. 

The  material  in  the  core  of  such  a  massive  supergiant  star  is  not  degenerate,  in 
the  mathematical  sense  used  for  white  dwarf  stars;  nuclear  reactions  proceed  in  the 
advanced  burning  stages  at  ever-increasing  central  temperatures  until  an  iron  core 
is  produced  and  all  of  the  available  nuclear  fuel  has  been  exhausted.  Deprived  of 
these  resources,  the  iron  core  collapses  under  its  own  weight  into  a  neutron  star  or 
black  hole  in  less  than  1  s,  and  an  explosion  blows  away  the  rest  of  the  in-falling 
matter. 

When  the  iron  nuclei  in  the  core  of  such  a  star  are  pushed  together,  no  energy  is 
released.  The  iron  does  not  burn,  regardless  of  how  hot  the  star’s  core  becomes. 
So,  there  is  no  longer  any  energy  being  generated  to  sustain  the  star’s  structure. 
Then,  a  massive  star,  having  burned  brightly  for  perhaps  10  million  years,  can  no 
longer  support  its  own  crushing  weight  and  the  iron  core  collapses. 

The  central  iron  core  can  be  crushed  into  a  ball  no  bigger  than  New  York  City 
in  less  than  1  s,  accruing  energy  from  its  in-fall.  Electrons  are  squeezed  inside  the 
iron  nuclei,  combining  with  their  protons  to  make  neutrons.  The  material  is 
compacted  to  nuclear  density,  and  the  center  collapses  to  form  a  neutron  star.  If  the 
collapsing  core  is  more  massive  than  about  3  solar  masses,  however,  the  collapse 
proceeds  to  the  formation  of  a  black  hole. 

Having  lost  the  supporting  core,  the  surrounding  material  first  plunges  in 
toward  the  center.  When  reaching  mass  densities  approaching  that  of  an  atomic 
nucleus,  the  collapsing  core  bounces  back  and  a  powerful  shock  wave  pushes  out 
against  the  rest  of  the  star.  With  the  help  of  a  dense  shower  of  neutrinos  produced 
in  the  collapsing  core,  the  star’s  outer  layers  are  torn  apart  and  expelled  into  deep 
space  at  supersonic  speeds.  The  doomed  star  suddenly  increases  in  brightness  100 
million  fold,  becoming  a  Type  II  supernova  that  briefly  outshines  up  to  1  billion  of 
its  neighboring  stars  combined  (Fig.  13.5). 

This  type  of  supernova  hurls  into  surrounding  space  all  of  the  elements  syn¬ 
thesized  inside  the  star  and  residing  in  shells  surrounding  the  iron  core  before  its 
collapse.  During  the  high-temperature  explosions,  the  supernova  also  produces 
vast  amounts  of  other  heavy  elements,  including  radioactive  elements  such  as 
uranium.  Newly  formed  radioactive  nickel,  for  example,  eventually  decays  into 
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Fig.  13.5  Type  II  supernova  In  this  type  of  supernova  explosion,  an  isolated  star  blows  up  and 
its  shattered  remains  are  propelled  into  surrounding  space.  Radio  and  x-ray  radiation  from  the 
expanding  supernova  remnant  can  be  observed  for  thousands  of  years  after  the  explosion.  The 
core  of  the  star  is  compressed  by  gravitational  contraction  into  a  neutron  star  or  a  black  hole. 
Neutrinos  emitted  from  the  collapsing  core  remove  most  of  the  supernova  energy  and  assist  shock 
waves  in  pushing  the  stellar  remains  into  an  expanding  remnant.  (From  “The  Life  and  Death  of 
Stars”  by  Kenneth  R.  Lang,  published  by  Cambridge  University  Press,  2013.  Reprinted  with 
permission.) 

iron,  producing  most  of  the  iron  now  found  in  the  universe.  Such  a  supernova 
occurred  within  a  nearby  satellite  galaxy  of  the  Milky  Way  in  1987,  which  led  to 
new  insights  about  how  these  explosions  occur. 


13.5.7  Light  of  a  Billion  Suns,  SN  1987 A 

For  more  than  three  and  a  half  centuries,  nobody  was  fortunate  enough  to  see  a 
supernova  with  the  unaided  eye.  Then,  late  in  the  evening  of  February  24,  1987, 
astronomers  discovered  one  (Fig.  13.6),  which  was  designated  SN  1987 A  (SN  is 
for  supernova;  1987A  denotes  the  first  one  discovered  that  year).  The  star  exploded 
168,000  years  before  that  night  in  the  Large  Magellanic  Cloud,  one  of  the  two 
satellite  galaxies  of  the  Milky  Way  visible  from  the  Earth’s  Southern  Hemisphere. 
It  generated  an  intense  burst  of  light  that  peaked  in  visual  brightness  at  the  third 
magnitude. 

Astronomers  at  the  Las  Campanas  Observatory,  a  barren  mountaintop  near  La 
Serena,  Chile,  were  the  first  to  notice  the  new  star  when  Ian  Shelton  (1957-  ) 
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Fig.  13.6  Light  echoes  from  SN  1987 A  Two  complete  rings  of  light  surround  the  exploded  star 
SN  1987 A  in  this  negative  image  taken  with  the  3.9  m  (153.5  inch)  Anglo-Australian  Telescope 
on  15  July  1988.  The  initial  flash  of  light  from  the  supernova  explosion  has  been  reflected  off 
clouds  of  interstellar  dust  and  observed  14  months  after  the  explosion  was  brightest,  somewhat 
like  an  echo  of  sound.  These  light  echoes  arise  in  two  thin  sheets  of  microscopic  dust  grains 
located  about  470  light-years  ( inner  ring )  and  1,300  light-years  {outer  ring)  in  front  of  the 
supernova.  The  rings  have  been  made  more  prominent  by  photographically  subtracting  an  image 
taken  3  years  before  the  supernova  exploded,  canceling  much  that  existed  previously.  Stars, 
however,  are  still  visible  as  faint  haloes.  (Courtesy  of  David  Malin  and  the  Anglo-Australian 
Observatory.) 


photographed  it  at  3  o’clock  in  the  morning  using  a  small  0.25  m  (10  inch) 
telescope  placed  in  an  unheated  shed.  The  discovery  was  relayed  to  the  Interna¬ 
tional  Astronomical  Union’s  clearinghouse  for  such  events,  which  relayed  the 
news  to  astronomers  throughout  the  world.  By  the  next  evening,  nearly  all  major 
radio  and  optical  telescopes  south  of  the  Equator  were  observing  the  supernova. 
One  month  later,  it  was  featured  on  the  cover  of  Time  magazine  with  just  one 
word:  “BANG!”  Astronomers  were  still  watching  its  expanding  debris  years  after 
the  exploding  star  hurled  it  into  space. 

McCray  (1993),  Arnett  et  al.  (1989),  and  Trimble  (1988)  have  provided  us  with 
reviews  of  Supernova  1987 A. 

SN  1987 A  was  a  gravity-powered,  iron-catastrophe  Type  II  supernova,  gen¬ 
erated  during  the  core  collapse  of  a  former  blue  supergiant  star  that  began  life  with 
a  mass  of  about  20  times  that  of  the  Sun.  The  physical  properties  of  this  progenitor 
star  and  its  subsequent  supernova  explosion  are  listed  in  Table  13.8. 
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Table  13.8  Supernova  SN  1987Aa 

Progenitor  star 

Name:  Sanduleak  —  69°202 

Spectral  Type:  B3  la  (blue  supergiant) 

Location:  Large  Magellanic  cloud 
Distance:  168,000  light-years 
Radius:  3  x  101()  m  w  50  RQ 
Effective  temperature  =  16,000  K 
Luminosity  =  4.6  x  1031  J  s-1  «  105  L0 
Mass  16  Mq 
Neutrino  burst 

Number  of  neutrinos  detected:  19  anti-neutrinos 
Energy  of  each  neutrino:  &  20  MeV  =  3.2  x  10-12  J 
Neutrino  flux  at  Earth:  5  x  10 14  m- 2 

4TO 

Number  of  neutrinos  emitted:  «  10  neutrinos 

A  O 

Energy  released  in  neutrinos  10"°  J 
Duration  of  neutrino  burst:  «  10  s 
Neutrino  luminosity:  ^1048  J  s_1  w  1022  L0 
Visible  explosion 

Peak  visible  luminosity:  3.8  x  1033  J  s_1  w  10 7  L0 

"j  _ i 

Velocity  of  ejected  material:  «  10  m  s 

Mass  of  ejected  material  ^  8  x  1030  kg  «  4  M0 

Kinetic  energy  of  ejected  material  «4  x  1044  J 

a  The  symbols  RQ,  L0  and  M0  respectively  denote  the  radius,  luminosity  and  mass  of  the  Sun 


One  of  the  more  interesting  observations  of  SN  1987 A  was  made  from  beneath 
the  Earth’s  surface,  when  massive  subterranean  instruments  detected  a  few  neutrinos 
emitted  during  the  explosion.  Although  solar  neutrinos  had  been  observed  coming 
from  nuclear  reactions  that  power  the  Sun,  all  other  stars  are  so  far  away  and  the 
number  of  their  neutrinos  striking  the  Earth  is  so  low  that  they  had  never  been 
detected  coming  from  any  other  cosmic  object.  It  had  nevertheless  been  proposed 
that  a  supernova  might  generate  a  great  number  of  neutrinos  (Focus  13.2). 


Focus  13.2  Neutrinos  generated  during  a  supernova 

As  first  realized  by  George  Gamow  (1904-1968)  and  his  Brazilian  colleague 
Mario  Schonberg  (1914-1990),  the  temperature  of  a  collapsing  stellar  core 
can  become  high  enough  to  create  both  neutrinos  and  antineutrinos,  which 
can  easily  escape  and  carry  away  prodigious  amounts  of  energy  (Gamow  and 
Schonberg  1941).  This  would  lead  to  further  loss  of  supporting  pressure,  the 
implosion  of  the  core  with  a  rapid  rise  in  temperature,  and  an  explosion  as  a 
supernova.  They  named  the  nuclear  transformation  associated  with  neutrino 
energy  loss  the  Urea  process,  after  its  similarity  to  the  gambling  operations 
at  the  Casino  de  Urea  near  Rio  de  Janeiro.  There  also,  no  matter  how  you 
played  the  game,  you  always  seemed  to  lose. 
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Today  we  know  that  supernova  neutrinos  may  be  emitted  during  nuclear 
processes  other  than  the  Urea  process,  but  the  basic  idea  was  correct.  Vast 
numbers  of  neutrinos  are  released  as  the  crushed  iron  core  of  a  dying  star  is 
broken  into  its  subatomic  components.  A  neutrino  is  produced  each  time  a 
proton  and  an  electron  combine,  and  pairs  of  neutrinos  and  antineutrinos  are 
created  when  the  core  temperature  reaches  100  billion  K.  All  of  these  neutrinos 
are  without  electrical  charge,  have  almost  no  mass,  and  move  unimpeded  at 
nearly  the  speed  of  light  through  nearly  any  amount  of  matter,  even  the  entire 
Earth.  But  if  enough  cosmic  neutrinos  were  directed  at  the  Earth,  massive, 
subterranean  instruments  might  detect  a  small  number  of  them. 


For  several  seconds,  the  relatively  nearby  supernova  explosion  SN  1987 A 
generated  such  enormous  amounts  of  the  elusive  neutrinos  that  a  small  number 
were  detected.  They  were  recorded  in  two  underground  neutrino  detectors  3  hours 
before  the  first  visible  sighting  of  SN  1987 A. 

Only  19  neutrinos  were  detected  flashing  through  the  underground  darkness. 
Yet,  even  this  small  number  signaled  the  presence  of  an  awesome  energy,  vastly 
exceeding  the  amount  contained  in  the  radiation  and  expanding  debris  of  the 
supernova  (also  see  Table  13.8).  They  indicated  that  10  billion  trillion  trillion 

co 

trillion,  or  10  ,  neutrinos  were  produced  by  the  exploding  star.  As  the  iron  core 

A/r 

imploded,  the  neutrinos  carried  away  energy  of  about  5  x  10  J,  or  roughly  99  % 
of  the  explosion  energy,  and  emitted  a  total  neutrino  luminosity  comparable  to  that 
of  the  optically  visible  luminous  output  of  more  than  ten  thousand  billion  billion 
stars  like  the  Sun. 


Example:  Energy  of  a  supernova 

During  the  supernova  explosion  of  SN  1987 A,  about  a  solar  mass,  or 
1.0  MQ  =  1.989  x  1030  kg  was  ejected  with  a  velocity  V  of  about 

^ 7  _  1  o 

10  m  s  .  The  kinetic  energy  of  the  ejected  mass  is  equal  to  0.5  MQV  « 
1044  J.  The  binding  energy  released  in  forming  a  central  neutron  star  of 
radius  RNS  is  equal  to  EG  =  GMq/Rns  ^  1.9  x  10  J,  where  the  gravita¬ 
tional  constant  G  =  6.674  x  10“ 11  m3  kg-1  s-2  and  RNS  =  14  km.  The 
energy,  E ,  released  in  completely  destroying  a  solar-mass  star  is 
E  =  MqC 2  ^  1.8  x  1047  J,  where  the  speed  of  light  c  =  2.9979  x 

o  _ 1 

10  m  s  .  The  energy  of  each  detected  neutrino  was  20  MeV  =  3.2  x 
10  "  J,  and  the  neutrino  flux  at  Earth  was  5  x  10  m  .  The  number  of 
neutrinos  emitted  is  equal  to  the  product  of  this  flux  times  the  area  at  the 
Earth’s  distance,  or  \i iLt,  where  the  distance  D  =  168,000  light- 

91  IS 

years  =  1.59  x  10"  m  and  1  light-year  =  9.46  x  10iJ  m,  so  the  total 

14  9  S& 

number  of  neutrinos  emitted  is  5  x  101"  x  4tzD  =  1.58  x  10Jo.  The  total 

CO 

energy  emitted  by  the  neutrinos,  Eneutrino,  is  about  1.58  x  10-  x  energy  per 

A/r 

neutrino  ^  5  x  10  J. 
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The  neutrinos  detected  from  SN  1987A  (Bionta  et  al.  1987;  Hirata  et  al.  1987) 
solved  one  of  the  thornier  problems  in  understanding  such  a  supernova  explosion. 
It  was  known  that  a  stellar  collapse  generates  tremendous  amounts  of  energy,  but 
there  was  difficulty  explaining  how  that  energy  was  transferred  from  the  collapsing 
core  into  the  outer  layers  of  the  star  in  sufficient  amounts  to  produce  an  explosion. 
The  core  might  rebound,  sending  shock  waves  propagating  into  the  surrounding 
material,  but  computer  simulations  indicated  that  the  shock  waves  could  not  blow 
away  the  rest  of  the  star.  They  always  became  stalled  when  encountering  the  in¬ 
falling  matter  from  the  outer  layers. 

Unlike  the  stalled  shock  waves,  the  flood  of  escaping  neutrinos  carries  tre¬ 
mendous  amounts  of  energy  far  away  from  the  stellar  core,  a  very  small  fraction  of 
which  gets  caught  in  the  in-falling  outer  layers  of  the  collapsing  star,  heating  up 
the  gas  to  a  temperature  of  more  than  10  billion  K.  This  produces  a  buoyant, 
convecting  bubble  of  energy  that  reverses  the  in-fall  and  powers  the  explosion. 

Three  hours  after  the  initial  collapse  and  generation  of  neutrinos  in  SN  1987 A, 
its  heated  bubble  expanded,  driving  shock  waves  before  it,  and  burst  through  the 
surrounding  material,  breaking  the  star  apart  and  hurling  its  pieces  into  space, 
which  produced  the  dazzling  light  of  the  supernova.  This  explains  why  the  neu¬ 
trinos  were  detected  3  hours  before  any  light  was  seen. 


13.5.8  Will  the  Sun  Explode? 

There  is  no  explosion  forecast  for  the  Sun’s  future.  It  is  going  out  alone,  passing 
into  its  final  resting  state  unaccompanied  by  a  close  companion.  Although  it  will 
end  up  as  a  dense,  high-gravity  white  dwarf  star,  even  the  nearest  star  still  will 
remain  far  beyond  the  dead  Sun’s  gravitational  embrace,  never  orbiting  it. 
Moreover,  our  Sun  is  nowhere  near  massive  enough  to  ever  explode  by  itself.  So, 
no  nova  or  supernova  is  expected  when  the  Sun  runs  out  of  nuclear  fuel.  It  will  go 
quietly  into  the  oblivion  of  permanent  night. 


13.6  Expanding  Stellar  Remnants 

In  their  explosive  death,  stars  that  go  supernova  blast  their  outer  layers  into  sur¬ 
rounding  space,  expelling  much  or  all  of  the  stellar  material  at  supersonic  speeds 
of  up  to  30,000  km  s-1,  or  1/1 0th  of  the  speed  of  light.  A  strong  shock  wave 
forms  ahead  of  the  ejected  material,  colliding  with  the  surrounding  interstellar  gas 
and  heating  it  up  to  temperatures  of  tens  of  millions  of  K.  The  high-temperature 
material  emits  intense  x-rays  that  have  been  observed  with  instruments  aboard 
spacecraft  located  above  the  Earth’s  obscuring  atmosphere,  thereby  recording  the 
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debris  of  cataclysmic  stellar  explosions  that  occurred  even  thousands  of  years  ago, 
before  recorded  history. 

Like  the  explosions  that  cast  this  material  out,  there  are  two  types  of  supernova 
remnants.  They  can  be  the  remains  of  a  white  dwarf  star  sent  into  explosion  by  a 
nearby  companion  in  a  Type  la  supernova  or  the  explosive  debris  of  a  single 
massive  star  that  has  expired  in  a  Type  II  supernova.  Weiler  and  Sramek  (1988) 
and  Trimble  (1982,  1983)  have  reviewed  our  knowledge  of  both  supernovae  and 
supernova  remnants. 

The  Chandra  X-ray  Observatory ,  for  example,  has  imaged  the  Tycho  supernova 
remnant ,  named  for  Tycho  Brahe  (1546-1601)  who  reported  observing  the  ori¬ 
ginal  explosion  in  1572.  A  Type  la  supernova  formed  this  remnant,  when  a  white 
dwarf  was  sent  into  annihilation  by  overflow  from  a  nearby  companion  star.  An  arc 
of  x-ray  emission  in  the  supernova  remnant  was  attributed  to  material  blown  off 
the  companion  star,  which  otherwise  survived  the  destruction  of  its  neighbor  and 
now  is  moving  within  the  remnant  more  quickly  than  its  neighbors  as  the  result  of 
the  explosion.  The  properties  of  the  arc  and  remaining  star  indicate  that  the  former 
white  dwarf  star  and  its  companion  once  orbited  one  another  in  a  five-day  period  at 
a  separation  of  less  than  1  /  10th  of  the  mean  distance  between  the  Earth  and  the 
Sun,  or  0.1  AU  (Lu  et  al.  2011). 

Supernova  remnants  often  emit  intense  radio  radiation.  The  majority  of  these 
radio  supernova  remnants  appear  as  bright  rings,  or  shells,  in  projection  against  the 
sky.  The  intense  radio  radiation  cannot  be  produced  by  a  hot  gas,  like  x-rays,  but 
instead  is  emitted  by  electrons  accelerated  to  high  speeds  by  the  supernova 
explosion  and  spiraling  in  a  magnetic  field. 

A  beautiful  example  of  an  expanding  shell-like  supernova  remnant  is  the 
Cassiopeia  A  supernova  remnant ,  abbreviated  Cas  A.  It  is  located  roughly  10,000 
light-years  away  in  the  direction  of  the  constellation  Cassiopeia  and  is  the  brightest 
radio  source  in  the  sky.  The  expanding  shell  of  Cas  A  is  also  a  strong  source  of 
x-rays,  emitted  by  a  50-million-K  gas  (Fig.  13.7). 

Despite  its  radio  and  x-ray  brilliance,  the  remnant  is  faint  at  optically  visible 
wavelengths,  which  nevertheless  indicate  that  it  is  rich  in  oxygen  and  now 
expanding  at  a  speed  of  about  5,000  km  s-1.  The  Type  II  supernova  explosion 
would  have  been  observed  as  a  daytime  star  around  1680,  but  there  are  no  his¬ 
torical  records  of  the  event.  Thick  clouds  of  interstellar  dust,  or  material  ejected 
from  the  massive  star’s  outer  layers,  may  have  absorbed  the  light  and  rendered  the 
explosion  optically  invisible. 

The  most  spectacular  example  of  a  Type  II  supernova  remnant  is  the  Crab 
Nebula  supernova  remnant  (Fig.  13.8,  Baade  1957),  also  one  of  the  brightest  radio 
sources  in  the  sky.  The  Crab  Nebula,  also  designated  M  1,  NGC  1952,  or  Taurus 
A,  is  the  remnant  of  a  supernova  explosion  that  was  observed  in  1054.  It  is  about 
6,500  light-years  away,  has  a  diameter  of  1 1  light-years,  and  expands  at  a  speed  of 
about  1,500  km  s_1.  The  physical  properties  of  this  fascinating  object  are  given  in 
Table  13.9. 
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Fig.  13.7  X-ray  image  of  Cassiopeia  A  supernova  remnant  The  expanding  supernova 
remnant  Cassiopeia  A  has  a  temperature  of  about  50  million  K  and  therefore  is  a  luminous  x-ray 
source,  seen  in  this  image  from  the  Chandra  x-ray  Observatory.  Still  visible  in  x-rays,  the  tiny 
point-like  source  near  the  center  of  Cas  A  is  a  neutron  star,  the  collapsed  core  of  the  star 
that  exploded  about  330  years  ago,  as  observed  from  the  Earth.  (Courtesy  of 
NASA/CXC/MIT/University  of  Mass.  Amherst/M.  S.  Stage,  et  al.) 


The  English  amateur  astronomer  John  Bevis  (1695-1771)  discovered  the 
nebula  in  1731.  The  French  astronomer  Charles  Messier  (1730-1817)  indepen¬ 
dently  found  it  a  few  decades  later  (Messier  1781),  listing  it  as  the  first  entry  in  his 
famous  catalogue.  The  Irish  astronomer  William  Parsons  (1800-1867),  the  third 
Earl  of  Rosse,  gave  it  its  present  name,  the  Crab  Nebula,  after  sketching  it  to 
resemble  a  crab  (Rosse  1850). 

The  Crab  is  the  first  nebula  to  be  associated  with  expanding  material  -  by  the 
American  astronomer  John  C.  Duncan  (1882-1962);  and  the  first  to  be  recognized 
as  the  remnant  of  a  stellar  explosion  -  by  the  American  astronomer  Edwin  Hubble 
(1889-1953),  who  identified  the  Crab  Nebula  with  the  guest  star  recorded  by 
Chinese  astronomers  in  1054  A.D.  in  the  same  region  of  the  sky  (Duncan  1921; 
Hubble  1928).  The  Crab  supernova  remnant  is  also  the  first  cosmic  radio  source 
(Bolton  et  al.  1949)  and  the  first  cosmic  x-ray  source  (Bowyer  et  al.  1964)  to  be 
discovered  outside  the  solar  system;  it  was  identified  as  one  of  the  brightest, 
persistent  emitters  of  gamma  rays  (Haymes  et  al.  1968).  Hester  (2008)  has  pro¬ 
vided  a  review  of  the  Crab  Nebula,  as  an  astrophysical  chimera.  Reynolds  (2008) 
reviewed  supernova  remnants  at  high  energy. 

Photographs  taken  by  Walter  Baade  using  the  Mt.  Wilson  2.5  m  (100  inch) 
telescope  in  the  early  1940s  indicated  that  the  visible  nebula  consists  of  two 


446 


13  Stellar  End  States 


Fig.  13.8  The  Crab  Nebula  supernova  remnant  The  optically  visible  light  of  the  Crab  Nebula, 
designated  as  M  1  and  NGC  1952,  consists  of  two  distinct  parts:  (1)  A  system  of  expanding 
filaments  forms  an  outer  envelope  in  which  emission  lines  occur  at  well-defined  wavelengths,  and 
(2)  an  inner  amorphous  region  that  emits  continuum  radiation  at  all  wavelengths.  Walter  Baade 
(1893—1960)  took  this  photograph  of  the  expanding  filaments  in  the  wavelength  range  of  640  nm 
to  670  nm  using  the  5.0  meter  (200  inch)  telescope  on  Mount  Palomar,  California  in  1955. 
A  Type  II  supernova  explosion  observed  nearly  1,000  years  ago,  in  1054,  ejected  the  filaments. 
The  continuum  glow  that  is  concentrated  in  the  inner  parts  of  the  nebula  is  the  nonthermal 
radiation  of  high-speed  electrons  spiraling  in  magnetic  fields  (see  Fig.  13.9).  This  continuum 
emission  is  powered  by  a  spinning  neutron  star,  the  southwesternmost  ( bottom  right )  of  the  two 
central  stars.  The  neutron  star  is  the  crushed,  ultradense  core  of  the  exploded  star.  It  also  is  a  radio 
pulsar  that  acts  like  a  lighthouse  spinning  30  times  a  second.  (Courtesy  of  Hale  Observatories.) 


distinct  parts  that  emit  radiation  differently  (Baade  1942).  A  tangled,  oval-shaped 
network  of  red  and  green  filaments,  seen  in  the  light  of  bright  emission  lines  from 
ionized  atoms,  encases  the  inner  blue  and  milk-white  continuum  radiation.  The 
filamentary  remnants  of  the  explosion  contain  about  4  solar  masses  of  material, 
consisting  mostly  of  ionized  helium  and  hydrogen  along  with  lesser  amounts  of 
carbon,  oxygen,  nitrogen,  iron,  neon,  and  sulfur.  The  progenitor  star  that  was 
annihilated  to  make  the  explosion  had  an  estimated  mass  of  between  9  and  1 1  solar 
masses,  some  of  that  mass  was  sent  into  invisibility  within  surrounding  space  and 
the  rest  remained  within  a  central  neutron  star. 

In  addition  to  an  expanding  shell,  the  radio  nebula  has  a  filled  center  that 
coincides  with  the  inner  visible  light.  These  types  of  supernova  remnants  are 
named  pterions,  from  the  Greek  word  pleres  for  “full”  or  “filled.” 

The  inner  radiation  contains  practically  all  of  the  energy  emitted  by  the  Crab 
Nebula,  and  it  is  1,000  times  more  intense  at  radio  wavelengths  than  at  optically 
visible  wavelengths.  It  is  impossible  to  reconcile  the  observed  radio  emission  with 
the  optical  emission  through  the  thermal  radiation  of  a  hot  gas  at  any  plausible 
temperature.  As  pointed  out  by  the  Russian  astronomer  Iosif  Shklovskii 
(1916-1985),  both  the  radio  and  optical  emission  of  the  Crab  Nebula  come  from 
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Table  13.9  Physical  properties  of  the  Crab  Nebula  supernova  remnant 

Date  of  explosion  =  1054  AD 

Ms  =  progenitor  star  mass  ^ j  9M0 

Mejec  =  ejected  mass  =  2-3  M&  «  5  x  103°  kg 

Mn  =  neutron  star  mass  =1.4  M0 

D  =  distance  «  6,500  light-years  «  2.0  kpc  «  6.2  x  1019  m 
9  =  angular  extent  =  4.5'  x  1.0' 

R  =  radius  =  6D/2  =  4.1  light-years  x  6.1  light-years  =  3.9  x  1016  m  x  6.1  x  1016  m 

V  =  velocity  of  ejected  material  &  1,450  km  s-1  «  1.45  x  106  m  s_1 

L  =  total  luminosity  =  1031'14  J  s_1 

S  =  radio  flux  density  =  1,040  Jy  at  1  GHz 

Lx  =  x-ray  luminosity  =  1030'38  J  s_1 

Ne  =  electron  density  %  4  x  107  m-3 

_ o 

B  =  magnetic  field  strength  «  3  x  10  tesla 

Mns  =  neutron  star  mass  «  1.4  M0  w  2.8  x  103°  kg 

Eb  =  binding  energy  released  in  forming  neutron  star  ^ j  GM%/Rm  w  104S  J 

P  =  radio  pulsar  period  =  0.033326  s 

dP/dt  =  radio  pulsar  period  time  derivative  =  421.288  x  10“ 15  s  s-1 
T  =  P/(2dP/dt)  =  approximate  age  of  pulsar  %  3.95  x  1010  s  «  1,250  year 


synchrotron  radiation  emitted  by  high-energy  electrons  spiraling  around  magnetic 
fields  at  nearly  the  speed  of  light  (Shklovskii  1953). 

The  Crab  Nebula  was  the  first  supernova  remnant  known  to  emit  nonthermal 
synchrotron  radiation,  although  the  radio  emission  of  the  Milky  Way  had  been 
attributed  to  the  synchrotron  radiation  of  cosmic  ray  electrons  spiraling  about  the 
interstellar  magnetic  field  (Sect.  11.3).  Electrons  of  extremely  high  energy  emit 
optically  visible  light,  whereas  electrons  of  slightly  lower  energy  radiate  at  radio 
wavelengths.  Because  the  more  energetic  electrons  lose  their  energy  faster  and 
also  radiate  at  shorter  wavelengths,  the  synchrotron-radiation  mechanism  provides 
a  natural  explanation  for  the  nonthermal  spectrum  of  the  Crab’s  radiation,  which  is 
more  intense  at  longer  wavelengths.  At  every  wavelength  of  observation,  from 
x-rays  to  radio  waves,  the  bulk  of  radiation  from  the  Crab  Nebula  is  accounted  for 
by  the  synchrotron-radiation  mechanism.  It  explains  the  nonthermal  spectrum  of 
the  optical  and  radio  emission,  as  well  as  the  polarization  of  the  optical  continuum 
(Fig.  13.9,  Baade  1957).  The  polarized  radiation  has  a  preferred  orientation  or 
direction  due  to  the  high-speed  electrons  moving  in  a  large-scale,  well-ordered 
magnetic  field. 

Despite  the  successes  of  the  synchrotron-radiation  theory,  explaining  the  origin 
of  the  energetic  electrons  that  gave  rise  to  the  radiation  remained  a  fundamental 
difficulty.  The  electrons  radiating  at  optically  visible  wavelengths  will  dissipate 
their  energy  by  synchrotron  radiation  in  about  180  years,  and  the  more  energetic 
electrons  that  produce  the  short-wavelength  x-rays  should  lose  their  energy  and 
disappear  in  less  than  a  year.  Because  the  supernova  radiation  was  emitted  more 
than  900  years  ago,  the  high-speed  electrons  producing  the  synchrotron  radiation 
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Fig.  13.9  Polarized  light  of  the  Crab  Nebula  The  inner  continuum  emission  of  the  Crab 
Nebula  supernova  remnant  should  be  polarized,  or  emitted  in  a  preferred  direction,  if  it  is  caused 
by  nonthermal  synchrotron  radiation.  This  was  confirmed  in  detail  by  these  visible-light 
photographs  taken  in  the  wavelength  range  of  540-640  nm  through  a  polarized  filter  by  Walter 
Baade  (1893-1960)  in  1955  using  the  5.0  meter  (200  inch)  telescope  on  Mount  Palomar, 
California.  The  arrows  indicate  the  direction  of  the  electric  vector  of  the  light  recorded.  The  south 
westernmost  of  the  two  central  stars  is  the  remnant  neutron  star  and  a  radio  pulsar.  This  inner 
amorphous  region  of  the  Crab  Nebula  supernova  remnant  is  a  powerful  source  of  radio  radiation, 
while  also  emitting  optically  visible  light  and  x-rays.  This  radiation  is  most  intense  at  the  longer, 
radio  wavelengths,  a  characteristic  of  non-thermal  radiation.  It  has  been  attributed  to  the 
synchrotron  radiation  of  relativistic  electrons  whirling  at  nearly  the  speed  of  light  around 
magnetic  fields.  Because  the  more  energetic  electrons  lose  their  energy  faster  and  also  radiate  at 
shorter  wavelengths,  the  synchrotron  radiation  mechanism  provides  a  natural  explanation  for  the 
nonthermal  emission  of  the  Crab’s  inner  regions  and  for  the  fact  that  its  radio  emission  is  a 
thousand  times  more  intense  than  its  visible  light.  (Courtesy  of  Hale  Observatories.) 
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cannot  be  survivors  of  the  original  explosion.  Instead,  some  unknown  source  must 
be  continuously  replenishing  these  energetic  electrons. 


Example:  Expansion  age,  pulsar  age,  and  x-ray  synchrotron  lifetime  for 
the  Crab  Nebula 

The  material  ejected  from  the  explosion  that  gave  rise  to  the  Crab  Nebula 

/r  _ i 

supernova  remnant  is  now  expanding  at  a  velocity  of  V  =  1 .45  x  10°  m  s  . 
The  largest  angular  extent  of  the  supernova  remnant  is  6  =  1.0'  = 
420"  «  2.035  x  10-3  radians,  where  1  radian  =  2.063  x  105  At  the  Crab 
Nebula’s  distance  of  D  =  6,500  light-years  =  6.15  x  1019  m,  where  1  light- 
year  =  9.46  x  1015  m,  this  angular  extent  corresponds  to  a  radius  of 
R  =  6D/2  x  6.26  x  10  m.  If  the  ejected  material  has  been  moving  at  a 
constant  velocity  across  this  radius,  the  expansion  age  is  T  =  R/V  « 
4.3  x  1010  s  «  1,367  years,  where  1  year  =  3.1557  x  107  s. 

The  supernova  explosion  observed  by  the  Chinese  in  this  region  of  the  sky 
occurred  in  1054  AD,  or  about  955  years  ago,  which  is  slightly  younger  and 
might  indicate  that  the  debris  initially  expanded  at  a  faster  rate  than  that 
observed  today. 

The  Crab  pulsar,  described  in  the  next  section,  has  a  period  of 
P  =  0.033326  s,  and  that  period  is  increasing  at  the  rate  of  dP/dt  = 
421.288  x  10“ 15  s  s_1.  The  approximate  pulsar  age  is  T  =  P/(2dP/dt) 
=  3.95  x  1010  s  =  1,250  years.  This  upper  limit  to  the  age  is  consistent 
with  the  historical  age  of  about  950  years,  for  the  periods  lengthen  with  time 
and  the  initial  pulsar  period  might  have  been  shorter.  Setting  P  =  0.025  s  for 
that  initial  period  would  give  the  correct  age. 

The  Crab  Nebula  emits  synchrotron  radiation  at  radio,  visible  light,  and 
x-ray  wavelengths.  It  is  emitted  by  energetic  electrons  spiraling  in  a  mag- 

_ o 

netic  field  of  strength  B  «  3  x  10  tesla.  At  a  soft  x-ray  photon  energy  of 
E  =  10  keV  =  1.602  x  10  15  J,  the  frequency  of  the  radiation  is  = 
E/h  =  2.42  x  1018  Hz,  where  the  Planck  constant  h  =  6.626  x 
10-34  J  s.  The  wavelength  of  the  radiation  is  =  c/vs  «  1.23  x  10“10  m, 

O  _  1 

where  the  speed  of  light  c  =  2.9979  x  10  m  s  .  The  Lorentz  factor  y  of 
high-speed  electrons  giving  rise  to  synchrotron  radiation  at  this  frequency  or 
wavelength  is  given  by  y  =  ([vs/(2.8  x  10 10  B )]1//2  «  5.34  x  107,  and  the 
synchrotron  lifetime  of  the  electron  radiating  this  x-ray  radiation  is 

.  o  o 

ts  =  8.187/(2?  y)  «  1.7  x  10  s  «  5.4  years.  More  energetic  x-rays  cor¬ 
respond  to  a  higher  y  and  a  shorter  lifetime  for  electrons  emitting  this 
radiation;  synchrotron  radiation  emitted  at  gamma-ray  wavelengths  has  even 
briefer  electron  lifetimes.  The  electrons  emitting  synchrotron  x-rays  and 
gamma  rays  from  the  Crab  Nebula  could  not  have  been  accelerated  in  the 
supernova  explosion  that  occurred  nearly  1,000  years  ago. 
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An  early  clue  to  the  energy  source  of  the  Crab  Nebula  was  provided  when  the 
American  astronomer  Carl  Lampland  (1873-1951)  observed  moving  wisps  and  knots 
that  originated  at  the  center  of  the  nebula  (Lampland  1921).  Walter  Baade  subse¬ 
quently  showed  that  these  features  move  at  up  to  a  tenth  of  the  speed  of  light,  sug¬ 
gesting  that  the  central  star  was  injecting  high-speed  particles  into  the  nebula.  It  was 
eventually  realized  that  a  neutron  star  was  bom  in  the  cmshed  center  of  the  explosion. 

The  Italian  astronomer  Franco  Pacini  (1939-  )  pointed  out  that  gravitational 
energy  released  during  the  collapse  of  a  normal  star  into  a  neutron  star  will  be 
converted  into  rotational  energy  (Pacini  1967).  It  follows  from  the  conservation  of 
angular  momentum  that  the  neutron  star  will  be  spinning  rapidly,  with  a  rotation 
period  of  less  than  a  second,  and  it  will  also  have  an  intense  magnetic  field,  owing 
to  the  conservation  of  magnetic  flux  in  collapse.  Provided  that  the  magnetic  axis 
and  rotation  axis  are  not  aligned,  a  rotating  dipole  magnetic  field  will  convert  the 
rotational  energy  into  electromagnetic  energy,  thereby  providing  the  luminosity  of 
the  Crab  Nebula  for  its  entire  lifetime. 

In  the  same  year  as  Pacini’s  prescient  publication,  the  first  radio  pulsar  was 
discovered.  The  following  year,  pulsars  were  attributed  to  rapidly  rotating  neutron 
stars,  and  a  pulsar  was  found  at  the  center  of  the  Crab  Nebula,  spinning  30  times  a 
second.  The  Crab  was  the  first  object  whose  luminous  output  was  related  to  a 
central  pulsar. 

The  nebula  is  powered  by  the  pulsar  wind  that  is  composed  of  charged  particles 
accelerated  to  nearly  the  speed  of  light  by  the  rapidly  rotating,  intense  magnetic 
field  of  the  spinning  pulsar.  Short-lived,  flaring  bursts  of  radiation  from  the  Crab 
Nebula  at  gamma  ray  wavelengths  may  be  due  to  sudden  restructuring  of  the 
pulsar  magnetic  field,  accelerating  electrons  to  energies  more  than  100  times 
greater  than  can  be  attained  by  any  particle  accelerator  on  Earth. 

Pulsar  winds  are  found  inside  the  shells  of  other  filled  supernova  remnants,  but 
Cassiopeia  A  is  an  exception.  Although  the  x-ray  emission  of  a  central  neutron  star 
has  been  located  within  this  supernova  remnant,  using  instruments  aboard  the 
Chandra  X-ray  Observatory  (Lu  et  al.  2011),  the  neutron  star  is  relatively  quiet  and 
does  not  emit  any  detectable  pulsar  wind  activity.  One  possible  explanation  is  that 
the  magnetic  fields  are  so  extremely  strong  that  they  have  stifled  pulsar  wind  activity 
rather  than  enhanced  it.  We  now  turn  to  early  speculations  about  the  possible 
existence  of  neutron  stars  and  the  subsequent  discovery  of  radio  and  x-ray  pulsars. 


13.7  Neutron  Stars  and  Pulsars 
13.7.1  Neutron  Stars 

Walter  Baade  (1893-1960)  and  Fritz  Zwicky  (1898-1974)  proposed  the  possibility 
that  neutron  stars  might  exist  just  two  years  after  James  Chadwick’s  (1891-1974) 
discovery  of  the  neutron  (Chadwick  1932a,  b).  They  speculated  that  a  supernova 
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Table  13.10  Physical  properties  of  neutron  stars 
Mns  —  mass  of  neutron  star  %  2  M0  =  3.978  x  1030  kg 

on 

MCns  —  critical  upper  mass  limit  for  neutron  star  ^ j  3  Mq  =  5.967  x  10JU  kg 
RNS  =  radius  of  neutron  star  12  km  =  1.2  x  104  m 

1  >~j  _ o 

pNS  =  mass  density  of  neutron  star  %  5  x  10  kg  m 

Vesc  =  escape  velocity  of  neutron  star  =  (2 GMNS/RNS)* 1^2  =  210,000  km  s-1  =  0.70  c 
Ens  —  binding  energy  released  to  form  a  neutron  star  —  G 1VI j R ^  8.8  x  1046  J 
P  =  rotation  period  =  0.001-10  s 

o 

B  =  magnetic  field  strength  =10  tesla 


explosion  is  driven  by  the  gravitational  energy  released  when  a  massive  star  runs 
out  of  fuel  and  its  core  collapses,  but  the  explosion  may  not  completely  destroy  the 
stellar  core.  A  dense  cinder  could  remain  at  the  center  or,  in  their  prescient  words: 
“With  all  reserve  we  advance  the  view  that  a  super-nova  represents  the  transition 
of  an  ordinary  star  into  a  neutron  star ,  consisting  mainly  of  neutrons.  Such  a  star 
may  possess  a  very  small  radius  and  an  extremely  high  density.”  (Baade  and 
Zwicky  1934b). 

We  now  know  that  if  a  collapsing  star  is  more  massive  than  1.4  solar  masses, 
the  inward  force  of  its  gravitation  will  overcome  the  outward  degenerate  electron 
pressure  that  halts  the  collapse  at  the  white-dwarf  stage.  The  young  Russian 
physicist  Lev  Landau  (1908-1968)  speculated  that  dead  stars  with  a  mass  above 
this  white-dwarf  limit  might  collapse  until  neutron  degeneracy  pressure  would  halt 
the  crush  of  gravity  (Landau  1938).  Baym  and  Pethic  (1979)  provide  an  early 
review  of  the  physics  of  neutron  stars. 

The  electrons  are  pushed  into  direct  contact  with  the  atomic  nuclei,  and  thus 
packed  together  at  nuclear  densities.  The  enormous  pressure  of  the  rapid  collapse 
would  create  neutrons  when  the  electrons  merged  with  the  nuclear  protons, 
forming  a  nuclear  gas  with  a  mass  density  surpassing  that  of  the  Sun,  and  water,  by 

i  n  _ o 

a  factor  of  a  million,  billion,  at  5  x  10  kg  m  (Table  13.10).  At  these  densities, 
a  normal  star  like  the  Sun  would  have  collapsed  to  a  radius  of  only  12  km.  Such  a 
neutron  star  has  a  powerful  gravity,  with  an  escape  velocity  of  about  0.7  or  70  % 
of  the  speed  of  light. 

Within  a  year  of  Landau’s  speculations,  George  Gamow  (1904-1968)  pointed 
out  that  the  high  central  mass  densities  of  neutron  stars  can  be  expected  only 
in  those  stages  of  stellar  evolution  subsequent  to  the  exhaustion  of  the  thermo¬ 
nuclear  fuel  in  normal  stars,  and  that  only  the  very  massive  stars  would  have 
evolved  rapidly  enough  to  reach  this  stage  during  the  lifetime  of  the  universe 
(Gamow  1939). 

At  about  the  same  time,  the  American  physicist  J.  Robert  Oppenheimer 
(1904-1967)  and  a  Canadian  graduate  student  George  M.  Volkoff  (1914-2000) 
found  that  neutron  stars  have  a  limiting  mass  of  their  own  (Oppenheimer  and 
Volkoff  1939).  They  showed  that  at  these  large  mass  densities  both  the  degenerate 
neutron  pressure  and  the  effects  of  gravitation  on  space-time  must  be  considered. 
The  equation  of  state  of  the  nuclear  material  was  obtained  in  the  first 
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approximation  by  identifying  it  with  a  degenerate,  relativistic  gas  of  neutrons 
fulfilling  Fermi  statistics.  The  macroscopic  structure  of  the  star,  its  mass,  radius, 
and  density  distributions,  were  determined  from  Einstein’  General  Theory  of 
gravitation.  Calculating  equilibrium  configurations  along  these  lines,  Oppenheimer 
and  Volkoff  found  that  a  stable  neutron  star  could  exist  only  in  a  finite  range  of 
masses. 

Just  as  degenerate  electron  pressure  supports  a  white  dwarf  star,  it  is  degenerate 
neutron  pressure  that  supports  a  neutron  star.  This  means  that  the  radius,  RNS ,  of  a 
neutron  star  of  mass,  MNS ,  can  be  estimated  by: 


R 


m 


WD 


n 


(13.21) 


or 


Rns  «  1O4(M0/M^)1/3  m,  (13.22) 

_ Q  1 

where  the  electron  mass  me  =  9.1094  x  10  kg,  the  neutron  mass 

_ 

mn  =  1.6749  x  10  ~  kg,  Rwd  denotes  the  radius  of  a  white  dwarf  star, 
Z/A  =1.0  for  a  neutron  star,  and  the  Sun’s  mass  M0  =  1.989  x  10  kg.  So,  for 
a  neutron  star  of  mass  MNS  =1.4  M0  and  radius  RNS  =  10  km,  the  star’s  mass 

O  1  Q 

density,  Pns  —  3  Mns/(4tiRns )  «  7  x  10  kg  m  ,  which  is  comparable  to  the 
mass  density  of  the  nucleus  of  an  atom. 

Although  the  radius  of  a  neutron  star  shrinks  with  increasing  mass,  you  can’t 
increase  the  mass  of  a  neutron  star  without  limit  any  more  than  you  can  for  a  white 
dwarf  star,  and  the  mass  of  a  neutron  star  is  now  believed  to  be  between  1.4  and 
3.0  solar  masses. 

Theoretical  considerations  of  a  dense  neutron  gas  suggest  that  it  could  be 
superfluid,  with  no  resistance  to  flow,  and  superconducting,  without  electrical 
resistance.  Chandra  x-ray  observations  of  the  neutron  star  at  the  center  of  the 
Cassiopeia  A  supernova  remnant  indicate  a  rapid  decline  in  the  temperature  of  the 
ultra-dense  neutron  core,  of  about  4  %  over  a  10  year  period,  suggesting  that  it  is 
made  of  superfluid  and  superconducting  material  (Page  et  al.  2011;  Shternin  et  al. 
2011). 

To  sum  up,  when  an  isolated  massive  star  can  no  longer  support  its  own 
crushing  weight,  the  center  collapses,  and  obtains  energy  from  its  in-fall.  If  the 
collapsing  stellar  core  weighs  between  1.4  and  3.0  solar  masses,  it  is  compacted  to 
nuclear  density,  and  forms  a  neutron  star.  When  the  core  weighs  more  than  3  solar 
masses,  it  collapses  into  a  black  hole. 

Although  important  in  hindsight,  these  early  considerations  about  the  possi¬ 
bility  of  neutron  stars  did  not  evoke  much  interest  at  the  time.  They  would  have 
remained  a  speculative  curiosity  if  it  were  not  for  the  serendipitous  discovery  of 
radio  pulsars. 
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13.7.2  Radio  Pulsars  from  Isolated  Neutron  Stars 

Before  discussing  the  discovery  of  pulsars,  we  provide  references  to  modern 
reviews  for  further  reading.  Lyne  and  Graham-Smith  (2012)  have  discussed  many 
aspects  of  pulsar  astronomy,  and  Taylor  and  Stinebring  (1986)  provided  an  earlier 
review  of  our  understanding  of  pulsars.  Gaensler  and  Slane  (2006)  have  reviewed 
the  evolution  and  structure  of  pulsar  wind  nebulae;  and  Phinney  and  Kulkarni 
(1994)  have  reviewed  binary  and  millisecond  pulsars. 

Pulsars  were  discovered  accidentally  during  a  survey  of  the  scintillations ,  or 
“twinkling”,  caused  when  radio  radiation  from  cosmic  sources  passes  through  the 
Sun’s  winds.  When  the  radio  waves  are  viewed  through  the  wind-driven  material, 
they  blink  on  and  off,  varying  on  time-scales  of  a  few  tenths  of  a  second  -  in  much 
the  same  way  that  stars  twinkle  when  seen  through  the  Earth’s  varying  atmo¬ 
sphere.  Repeated  observations  of  several  scintillating  radio  sources  at  different 
angles  in  relation  to  the  Sun  provide  information  about  the  properties  of  the  solar 
wind  and  the  angular  structure  of  the  radio  sources.  The  fluctuations  are  greatest 
for  the  smaller  emitters,  just  as  stars  twinkle  more  than  the  Moon  or  planets,  which 
have  larger  angular  extents. 

To  study  these  effects,  Antony  He  wish  (1924- )  and  his  colleagues  at  Cambridge 
University  built  a  large  array  of  2048  dipole  antennas,  spread  over  four  and  a  half 
acres  and  operated  at  a  long  radio  wavelength  of  3.7  m,  since  the  scintillating 
fluctuations  were  known  to  be  more  prominent  at  longer  wavelengths.  The  combined 
signals  from  all  the  antennas  were  connected  to  a  radio  receiver  and  chart  recorder 
with  a  time  constant  of  0.1  s,  the  time-scale  of  the  scintillations.  When  examining 
the  charts  in  July  1967,  graduate  student  Jocelyn  Bell  (1943-  )  found  a  strong 
fluctuating  signal  in  the  middle  of  the  night,  when  the  array  was  pointed  away  from 
the  Sun  and  the  effects  of  the  solar  wind  should  have  been  small. 

Further  investigations  led  to  the  astonishing  detection  of  periodic  radio  pulses, 
with  an  exceedingly  precise  repetition  period  of  1.3372795  s.  The  first  radio  pulsar 
had  been  detected  (He wish  et  al.  1968).  No  one  had  foreseen  its  existence,  and  no 
other  known  astronomical  object  kept  time  so  accurately. 

By  the  time  the  discovery  was  ready  for  publication,  in  1968,  evidence  of  other 
radio  pulsars  was  found  in  the  existing  chart  recordings;  within  three  weeks,  a 
second  paper  announced  the  discovery  of  three  additional  radio  pulsars  (Pilkington 
et  al.  1968).  This  triggered  searches  for  other  previously  unknown  pulsars  with 
large  radio  telescopes  using  rapid  time  sampling  rather  than  the  long  integration 
times  formerly  used.  In  less  than  a  year,  the  list  of  pulsars  was  expanded  to  more 
than  two-dozen,  and  a  pulsar  was  detected  at  the  position  of  the  very  star  thought 
to  be  the  neutron  star  remnant  of  the  Crab  Nebula  supernova  explosion. 

We  now  know  that  the  term  pulsar  is  misleading  for  the  compact  stars  do  not 
pulsate  -  they  rotate  -  but  the  name  has  stuck.  It  designates  repeating  pulses  of 
radio  emission  rather  than  a  pulsating  star. 

The  physical  properties  of  radio  pulsars  are  given  in  Table  13.11. 


454 


13  Stellar  End  States 


Table  13.11  Physical  properties  of  radio  pulsars 
PRP  =  period  of  radio  pulsar  =  0.001-4  s 

dPRP/dt  =  rate  of  increase  of  radio  pulsar  period  =  10_15-10-12  s  s-1 

Q  1  A 

T  =  P Rp / (2dP RP / dt)  =  characteristic  radio  pulsar  age  =  10  —10  years  (where 
1  year  =  3.156  x  107  s) 

BRP  =  magnetic  field  strength  of  radio  pulsar  =  109— 1013  G  =  105— 109  tesla 
LRP  =  radio  luminosity  =  1018-1024  J  s_1 

MRP  =  mass  of  binary  pulsar  PSR  1913  +  16  =  1.44  M0  w  2.85  x  1030  kg 


The  pulsars  probably  could  have  been  discovered  many  years  earlier,  when 
other  large  radio  antennae  were  constructed,  but  radio  astronomers  were  used  to 
adding  up  signals  over  long  time  intervals  to  detect  faint  cosmic  radio  signals.  The 
long  time  resolutions  precluded  detection  of  the  pulsars.  They  are  relatively  faint 
radio  sources  when  averaged  over  their  period  because  there  is  no  emission 
between  the  brief  radio  pulses.  If  time  resolutions  comparable  to  the  pulsar  burst 
durations  of  milliseconds  had  been  used,  the  intense  radio  bursts  would  have  been 
detected  easily.  It  is  because  Hewish  specifically  designed  a  new  type  of  radio 
telescope  for  a  study  of  the  rapidly  changing  solar  wind  effects  that  the  radio 
pulsars  were  discovered  accidentally. 

The  extreme  regularity  of  the  periodic  radio  bursts  suggested  that  they  are 
controlled  by  the  rotation  of  a  massive  body,  and  the  short  duration  of  the  pulsar 
bursts  suggested  that  their  radiation  originates  in  a  body  that  cannot  be  much  larger 
than  the  Earth.  That  is,  the  size  should  be  smaller  than  the  product  of  the  speed  of 
light  and  the  burst  duration;  otherwise,  it  may  be  violating  nature’s  upper  speed 
limit,  the  speed  of  light.  A  rotating  white  dwarf  was  initially  suspected,  but  it  could 
not  spin  faster  than  about  once  per  second.  Rotation  with  a  shorter  period  would 
tear  the  white  dwarf  apart,  since  its  outer  atmosphere  would  be  rotating  faster  than 
the  star’s  escape  velocity. 

The  Austrian-born  American  astronomer  Thomas  Gold  (1920-2004)  proposed 
that  radio  pulses  are  produced  by  a  rapidly  rotating  neutron  star  with  an  intense 
magnetic  field  (Gold  1968).  He  assumed  that  a  pulsar  would  emit  radio  radiation  in 
a  beam,  like  a  lighthouse,  oriented  along  the  magnetic  axis  (Fig.  13.10).  An 
observer  sees  a  pulse  of  radio  radiation  each  time  the  rotating  beam  flicks  across 
the  Earth.  Because  the  neutron  star’s  beam  could  be  oriented  at  any  angle,  the 
beams  of  many  pulsars  would  miss  the  Earth  and  would  remain  forever  unseen. 

Gold  suggested  definitive  observational  tests  of  his  ideas.  He  noticed  that  a 
spinning  neutron  star  gradually  loses  its  rotational  energy  and  slows  down,  suc¬ 
cessfully  predicting  that  this  would  cause  a  slow  lengthening  of  the  radio  pulsar 
periods  with  time.  He  also  predicted  that  radio  pulsars  with  much  shorter  periods 
would  be  found,  as  they  were. 

If  a  slowly  rotating  star  collapses  down  to  a  small  size,  the  rate  of  rotation 
increases.  It’s  a  result  of  the  conservation  of  angular  momentum,  which  means  that 
the  period  of  rotation  is  proportional  to  the  square  of  the  radius.  When  an  ordinary 
visible  star  runs  out  of  nuclear  fuel  and  is  compressed  to  the  size  of  a  neutron  star, 


13.7  Neutron  Stars  and  Pulsars 


455 


Fig.  13.10  Radio  pulsar  A  spinning  neutron  star  has  a  powerful  magnetic  field  the  axis  of  which 
intersects  the  north  and  south  magnetic  poles.  The  rotating  fields  generate  strong  electric  currents 
and  accelerate  electrons,  which  emit  an  intense,  narrow  beam  of  radio  radiation  from  each 
magnetic  polar  region.  Because  the  magnetic -field  axis  can  be  inclined  to  the  neutron  star’s 
rotation  axis,  these  beams  can  wheel  around  the  sky  as  the  neutron  star  rotates.  If  one  beam 
sweeps  across  the  Earth,  a  bright  pulse  of  radio  emission,  called  a  pulsar ,  is  observed  once  every 
rotation  of  the  neutron  star 


its  rotation  period  would  speed  up  from  days  to  milliseconds.  Since  the  time 
between  successive  radio  pulses  is  the  same  as  the  rotation  period  of  the  neutron 
star,  it  has  to  be  initially  spinning  at  a  very  fast  rate. 

A  neutron  star  is  also  a  powerful  magnet,  and  the  magnetism  is  attached  to  the 
material  within  the  star.  When  the  star  collapses  it  carries  its  magnetism  with  it, 
packing  it  into  a  smaller  volume  and  amplifying  its  strength  by  factors  of  billions. 
The  magnetic  field  strength  increases  in  inverse  proportion  to  the  surface  area, 
which  is  itself  proportional  to  the  square  of  the  radius.  So  a  typical  surface 
magnetic  field  on  a  visible-light  star,  with  a  strength  of  0.01  tesla,  would  be 

o 

strengthened  by  a  factor  of  a  ten  thousand  million,  or  to  10  tesla,  if  collapsing  to  a 
neutron  star. 


Example:  Period  and  magnetic  field  of  a  rotating  neutron  star  or  white 
dwarf  star 

We  can  infer  the  rotation  period  of  a  neutron  star  using  the  conservation  of 
angular  momentum.  For  a  sphere  of  mass,  M,  radius,  R ,  rotation  period,  P, 
and  rotation  velocity  V  =  2nR/P,  the  conservation  of  angular  momentum  in 
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gravitational  collapse  requires  that  MVR  =  2nMR^/P  =  constant,  or  that 
the  rotation  period  varies  as  R ~  if  the  mass  is  not  changed.  For  the  Sun, 
M  =  M0  =  1.989  x  1030  kg,  R  =  RQ  =  6.955  x  108  m,  P  =  25.67  days 
at  the  solar  equator,  where  one  day  =  86,400  s,  and  V  =  1 ,97 1  m  s_  1  at  the 
solar  equator.  If  a  star  of  this  mass  and  rotation  period  collapsed  to  form  a 
neutron  star  of  radius  RNS  =  10  km,  the  rotation  period  would  be 
PNS  «  4.6  x  10-4  s. 

Conservation  of  magnetic  flux  in  gravitational  collapse  provides  an  esti¬ 
mate  for  the  magnetic  field  of  a  neutron  star.  For  a  sphere  with  a  surface 
magnetic  field  strength,  B ,  and  radius,  R ,  the  conservation  of  magnetic  flux  in 

gravitational  collapse  requires  that  the  magnetic  flux  BR ~  =  constant,  or  that 

_ 

the  magnetic  field  strength  B  varies  as  R  .  For  the  Sun, 
B  =  Bq  =  10-2  tesla  and  R  =  RQ  =  6.955  x  108  m.  If  a  star  of  this 
magnetic  field  strength  and  radius  collapsed  to  a  neutron  star  of  radius 
Rns  =10  km  it  would  have  a  surface  magnetic  field  strength  of  BNS  = 
4.8  x  107  tesla. 

The  same  formulae  apply  for  a  white  dwarf  star,  which  would  collapse  to 
a  radius  of  about  0.01  solar  radii,  to  give  a  rotation  period  of  PWD  =  221  s 
and  a  magnetic  field  strength  of  100  tesla.  Although  such  strong  magnetic 
fields  have  been  detected  for  white  dwarfs,  many  white  dwarfs  rotate  with 
periods  of  days  or  even  years  rather  than  seconds  to  minutes.  The  slow 
rotation  of  white  dwarf  stars  may  be  attributed  to  the  powerful  winds  that 
created  their  surrounding  planetary  nebulae,  removing  rotational  energy  and 
slowing  the  white  dwarf. 


The  association  of  radio  pulsars  with  neutron  stars  in  supernova  remnants 
became  accepted  when  Australian  radio  astronomers  found  a  pulsar  with  an 
extremely  short  period  -  89  ms  -  in  the  center  of  the  extended  radio  source 
Vela  X,  believed  to  mark  the  debris  of  a  supernova  explosion.  Soon  afterward  the 
Crab  Nebula  was  found  to  contain  a  radio  pulsar  with  an  even  shorter  period, 
33  ms  (Reifenstein  et  al.  1969)  and  the  neutron  star  theory  received  further 
impressive  support  when  optically  visible  pulsed  light  was  observed  (Nather  et  al. 
1969)  from  the  very  star  that  Walter  Baade  and  Rudolph  Minkowski  had  identified 
in  1942  as  the  central  stellar  remnant  of  the  Crab  Nebula  supernova  explosion 
(Baade  1942;  Minkowski  1942).  Astronomers  have  used  the  powerful  Chandra 
X-ray  Observatory  to  trace  out  the  jets,  rings,  winds,  and  shimmering  shock  waves 
generated  by  the  highly  magnetized,  rapidly  spinning  pulsar. 

As  Franco  Pacini  previously  demonstrated,  before  the  discovery  of  radio  pul¬ 
sars,  a  rapidly  spinning  and  highly  magnetized  neutron  star  is  a  powerful  source  of 
electromagnetic  radiation  (Focus  13.13),  which  could  keep  the  Crab  Nebula 
supernova  remnant  shining  for  thousands  of  years,  and  ever  since  the  observation 
of  the  supernova  explosion  (Pacini  1967).  Moreover,  the  rotational  periods  of  radio 
pulsars  are  increasing,  just  as  Gold  predicted.  If  P  denotes  the  radio  pulsar  period 
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and  dP/dt  =  P  designates  the  rate  of  increase  of  that  period  with  time,  then  the 
age,  t,  of  the  radio  pulsar  can  be  estimated  from: 


T  = 


PP 

~2m 


(13.23) 


The  ages  determined  from  the  observed  slow-down  rate  and  periods  of  radio 
pulsars  range  from  young  pulsars  with  ages  of  1,000-10,000  years  to  old  pulsars 
with  ages  up  to  100  million  years.  A  supernova  remnant  will  expand  and  dissipate 
into  interstellar  space,  becoming  undetectable  in  less  than  100  thousand  years. 
Most  observed  radio  pulsars  have  therefore  outlived  any  observable  supernova 
remnant  they  might  be  associated  with,  and  most  pulsars  are  not  found  in  one. 
Only  a  few  rare,  young  pulsars,  such  as  the  Crab  Nebula  pulsar  are  found  in  a 
supernova  remnant. 


Focus  13.3  Luminosity,  rotational  energy,  and  magnetic  field  strength  of 
a  radio  pulsar 

A  neutron  star  with  a  dipolar  magnetic  field  will  behave  as  a  rotating 
magnetic  dipole,  with  radiation  luminosity  LNS  given  by: 


2  4 

/u0m±co 
671c3  ’ 


(13.24) 


_ /T  _ ^ 

where  the  magnetic  constant  /to  —  1.2566  x  10  N  A  ,  the  symbol  m± 
denotes  the  component  of  the  magnetic  dipole  moment  perpendicular  to  the 
rotation  axis,  the  angular  rotation  velocity  co  =  2n/P  for  a  rotation  period  P, 

o  _ 1 

and  the  speed  of  light  c  =  2.9979  x  10  m  s  .  For  a  uniformly  magnetized 
neutron  star  of  radius  RNS  and  surface  magnetic  field  of  strength  BNS ,  the 
magnetic  dipole  moment  is: 


A-tc 

m±  = — BNsR3ns  sin6>,  (13.25) 

Po 

where  6  is  the  angle  between  the  rotation  axis  and  the  magnetic  axis.  We  can 
therefore  express  the  neutron  star  luminosity  by: 


(13.26) 


The  magnetic  dipole  radiation  extracts  rotational  energy,  Erot ,  from  the 
neutron  star.  The  rotational  energy  is  related  to  the  moment  of  inertia 

r\ 

I  =  2MR  /  5  for  a  uniform  rotating  sphere  of  mass  M  and  radius  R,  and  given 
by: 


In2 1  _  An2  MR 2 

~pT  ~  ~ 5  pT' 


(13.27) 
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For  a  neutron  star  M  =  MNS  ^1.4  M0,  where  the  Sun’s  mass 
M0  =  1.989  x  1030  kg,  and  the  radius  R  =  RNS  =  10  km  =  104  m. 

The  rate  of  change,  dErot / dt  of  the  rotational  energy  with  increasing  time, 
t ,  is  related  to  the  increase  dP/dt  =  P  o f  the  rotation  period  P,  and  given  by 
the  expression: 


dErot 

dt 


4jl2IP 

P3 


(13.28) 


The  pulsar  period  and  its  rate  of  change  can  be  combined  with  this 
equation  to  give  the  power  extracted  from  the  rotation  of  a  neutron  star,  with 
M  —  Mns  «  1 .4  M0,  where  the  Sun’s  mass  M0  =  1.989  x  1030  kg,  and 
the  radius  R  =  RNS  =10  km  =  104  m. 

These  measurements  can  also  be  used  to  provide  a  lower  limit  to  the 
magnetic  field  strength  B  at  the  surface  of  the  pulsar.  We  do  not  know  the 
inclination  angle  6  between  the  rotation  axis  and  the  magnetic  axis,  but  we 
do  know  that  6  is  less  than  90°  and  that  sin  6  is  less  than  or  equal  to  1 ,  or  that 
sin  6  <  1 .  Setting  the  loss  in  rotational  energy  dErot/ dt  equal  to  the  neutron 
star  luminosity  LNS  and  combining  terms  from  the  previous  equations  we 
obtain: 


B> 


(13.29) 


For  a  neutron  star  with  M  =  MNS  «  1.4  M0,  where  the  Sun’s  mass 
M0  =  1.989  x  1030  kg,  and  a  radius  R  =  RNS  =10  km  =  104  m,  this 
equation  becomes: 


B>  3  x  1015  (/>/>) 1/2  tesla. 


(13.30) 


As  demonstrated  in  the  next  example,  the  loss  of  rotational  energy  inferred  from 
the  period  increase  of  the  pulsar  is  exactly  what  is  needed  to  keep  the  Crab  Nebula 
shining  at  the  present  rate  for  about  1,000  years,  ever  since  the  observation  of  the 
supernova  explosion  that  was  associated  with  the  pulsar’s  birth.  So  it  is  the  central 
pulsar  that  makes  the  nebula  glow.  Moreover,  it  was  soon  shown  that  radio  pulsars 
could  efficiently  accelerate  particles  to  nearly  the  speed  of  light,  accounting  for 
both  the  synchrotron  radiation  of  the  Crab  Nebula  and  the  beamed  radio  waves  that 
are  observed  as  pulsars. 
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Example:  Energy  loss  and  magnetic  field  strength  of  the  Crab  Nebula 
pulsar 

The  pulsar  at  the  center  of  the  Crab  Nebula  has  a  period  P  =  0.033326  s  and 
a  period  increase  of  dP/dt  =  P  =  4.213  x  10“  s  s“  .  Assuming  that  this 
pulsar  has  a  mass  equal  to  the  mass  of  a  neutron  with  a  mass  M  =  MNS  ~ 
1.4  MQ,  where  the  Sun’s  mass  M0  =  1.989  x  1030  kg,  and  a  radius 
R  =  Rns  =10  km  =  104  m,  the  rate  of  change  of  the  rotational  energy  is 
dErot/dt  =  (8n2  /  5)  MR2  P  /  P3  «  5  x  1031  J  s_1.  This  rate  at  which  the  Crab 
Nebula  pulsar  loses  its  rotational  energy  is  comparable  to  the  energy 
requirements  of  the  surrounding  supernova  remnant,  including  its  non- 
thermal  synchrotron  radiation  and  its  expansion. 

We  can  also  provide  a  lower  limit  to  the  pulsar’s  surface  magnetic  field 

from  B>  [3Li0c3M/(80n}R4)}]/2(Pp)i/2x  3  x  10  l5(PP)'/2  tesla  *4x 

o 

10  tesla.  This  magnetic  field  strength  is  comparable  to  that  expected  from 
the  gravitational  collapse  of  the  Sun  to  a  neutron  star. 


When  the  observed  slow-down  rate  of  most  radio  pulsars  is  combined  with  their 
periods,  typical  ages  of  between  1  million  years  to  100  million  years  are  obtained. 
A  supernova  remnant  will  expand  and  dissipate  into  the  vastness  of  interstellar 
space,  becoming  unrecognizable  in  less  than  100  thousand  years,  removing  all 
signs  of  the  pulsar’s  birth.  Pulsars  therefore  outlive  their  supernova  remnants  and 
most  pulsars  are  not  found  in  one.  Only  a  few  rare,  young  pulsars,  such  as  the  Crab 
Nebula  Pulsar,  are  rotating  fast  enough  to  efficiently  accelerate  particles  to  very 
high  energies. 

Although  most  radio  pulsars  are  alone  in  space  without  a  nearby  companion, 
some  binary  pulsars  have  been  discovered,  the  most  famous  being  PSR  1913  +  16 
with  a  period  of  0.05898  s;  the  PSR  designates  pulsar  and  1913  +  16  specifies  its 
position  in  the  sky.  Russell  A.  Hulse  (1950-  )  and  Joseph  H.  Taylor,  Jr.  (1941-  ) 
found  it  as  a  result  of  a  deliberate,  high-sensitivity,  computerized  search  for  new 
radio  pulsars  at  the  Arecibo  Observatory  in  Puerto  Rico  (Hulse  and  Taylor  1975). 
The  discovery  of  a  radio  pulsar  that  is  a  member  of  a  double-star  system  indirectly 
suggested  the  emission  of  gravitational  waves,  which  had  never  been  seen  before 
(Focus  3.6,  Sect.  3.5). 

A  second  binary,  millisecond  radio  pulsar,  designated  PSR  J1 614-2230,  has 
now  been  found,  with  a  period  of  just  3.15  ms.  It  is  attributed  to  a  neutron  star  in 
orbit  around  a  white  dwarf  star  with  an  orbital  period  of  8.7  days,  and  has  been 
used  to  test  aspects  of  general  relativity  theory  other  than  gravitational  waves. 
These  investigations  have  shown  that  the  pulsar  is  the  most  massive  neutron  star 
known  so  far,  with  a  mass  of  1.97  solar  masses;  the  mass  of  the  white  dwarf 
companion  is  0.50  solar  masses  (Demorest  et  al.  2010). 
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Kramer  and  Stairs  (2008)  have  summarized  knowledge  of  the  double  pulsar; 
Hughes  (2009)  has  discussed  gravitational  waves  from  merging  compact  binaries; 
and  Joss  and  Rappaport  (1984)  have  reviewed  neutron  stars  in  interacting  binary 
systems. 

Individual  stars  are  bound  together  so  tightly  that  a  supernova  that  leads  to  the 
formation  of  a  neutron  star  in  a  binary-star  system  may  not  disrupt  its  companion; 
the  two  stars  can  remain  together,  as  evidenced  by  the  binary  pulsars.  This  is 
important  for  understanding  pulsars  that  have  been  detected  at  x-ray  wavelengths. 
Unlike  most  radio  pulsars,  they  are  members  of  close  binary-star  systems  rather 
than  single,  isolated  neutron  stars. 


13.7.3  X-ray  Pulsars  from  Neutron  Stars  in  Binary  Star 
Systems 

Because  x-rays  are  absorbed  in  our  atmosphere,  cosmic  x-ray  sources  must  be 
observed  with  instruments  launched  above  the  obscuring  air,  in  rockets  or  satel¬ 
lites.  By  the  mid-twentieth  century  brief,  5 -minute  rocket  flights  had  shown  that 
the  Sun  radiates  detectable  x-rays,  and  it  was  thought  that  lunar  material  also  might 
emit  them  when  illuminated  by  solar  x-rays. 

Riccardo  Giacconi’ s  (1931-  )  group  at  the  American  Science  and  Engineering 
Company  (AS&E)  concluded  that  x-rays  emitted  by  conventional  stellar  objects 
other  than  the  Sun  would  be  too  faint  to  be  detected  with  existing  instruments. 
They  designed  the  sensitive  equipment  needed  to  detect  the  Moon’s  x-rays  and  to 
search  for  other  unknown  sources  of  x-ray  radiation.  They  unexpectedly  found  the 
first  known  discrete  x-ray  source  outside  of  the  solar  system,  which  led  to  the 
discovery  of  a  new  class  of  cosmic  objects  and  new  physical  processes  (Giacconi 
et  al.  1962). 

This  pioneering  rocket  flight  set  the  stage  for  a  host  of  rocket  and  satellite 
observations  of  discrete  x-ray  sources,  including  x-ray  stars  that  are  1,000  times 
brighter  in  x-rays  than  the  Sun  at  all  wavelengths  and  that  are  1,000  times  more 
luminous  in  x-rays  than  in  visible  light.  The  x-rays  signaled  the  presence  of 
1 -million-degree  gas  spiraling  from  a  close  companion  star  into  a  neutron  star  or 
black  hole. 

Giacconi  was  awarded  the  2002  Nobel  Prize  in  Physics  for  these  pioneering 
contributions,  which  led  to  the  discovery  of  cosmic  x-ray  sources.  He  shared  the 
prize  with  Raymond  Davis,  Jr.  (1914-2006),  and  Masatoshi  Koshiba  (1926-  ), 
who  were  recognized  for  their  detection  of  cosmic  neutrinos. 

One  of  the  brightest  sources  in  the  newly  discovered  x-ray  sky,  designated 
Centaurus  X-3,  pulses  in  x-rays  every  4.84  s.  The  rapid  pulsating  x-ray  variations 
were  discovered  shortly  after  the  launch  of  the  first  dedicated  x-ray  satellite,  on 
December  12,  1970  from  the  offshore  San  Marco  platform,  near  the  Coast  of 
Kenya  (Giacconi  et  al.  1971).  Because  this  date  coincided  with  the  seventh 
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Table  13.12  Physical  properties  of  binary  x-ray  pulsars 
Pxp  =  period  of  x-ray  pulsar  =  0.7-800  s 

dPxp/dt  =  rate  of  decrease  of  x-ray  pulsar  period  =  —  (10_5-10-2)  PXp  year 

P0  =  orbital  period  =  5-10  days 

LXp  =  x-ray  luminosity  =  (0.1-10)  x  1030  J  s_1 

MXP  =  mass  of  x-ray  pulsar  =  1.05-1.87  M0  w  (2. 1-3.7)  x  103°  kg 


anniversary  of  the  independence  of  Kenya,  the  satellite  was  given  the  name  Uhuru , 
the  Swahili  word  for  freedom. 

After  analyzing  a  year  of  observations  of  Centaurus  X-3,  the  Uhuru  scientists 
found  a  regular  pattern  of  intensity  changes  of  the  x-ray  pulses,  which  increased 
and  decreased  in  strength  with  a  much  longer  period  of  2.087  days  and  systematic 
changes  in  the  timing  of  pulses  in  the  same  period.  These  effects  were  attributed  to 
a  companion  star,  which  was  orbiting  the  x-ray  source  and  regularly  eclipsing  its 
emission. 

During  the  next  year,  accurate  measurements  showed  that  the  average  pulsation 
period  of  Centaurus  X-3  was  getting  shorter,  which  meant  that  its  rotation  was 
speeding  up,  not  slowing  down  like  radio  pulsars.  This  indicated  that  the  rotational 
energy  of  the  x-ray  pulsar  was  increasing,  rather  than  decreasing,  with  time. 
The  physical  properties  of  such  binary  x-ray  pulsars  are  given  in  Table  13.12. 

The  gain  in  rotational  energy  is  attributed  to  matter  drawn  in  from  a  nearby 
optically  visible  companion  star  (Fig.  13.11).  Because  matter  is  being  pulled 
toward  the  surface  of  an  x-ray-emitting  neutron  star,  instead  of  being  expelled 
from  it,  the  neutron  star  is  knocked  up  to  a  faster  rotation.  The  material  spirals  in  at 
the  same  direction  as  the  neutron  star’s  rotation.  When  it  lands,  it  gives  the  neutron 
star  a  sideways  kick,  increasing  its  rotational  energy,  speeding  it  up,  and  causing 
the  rotation  period  to  become  shorter  as  time  goes  on.  Because  most  radio  pulsars 
are  not  members  of  binary-star  systems,  they  expel  material,  lose  rotational 
energy,  slow  down,  and  have  periods  that  lengthen. 

So  a  neutron  star  can  be  seen  in  x-rays  when  it  is  in  very  close  orbit  with  a 
normal  star  that  shines  in  visible  light  detected  with  an  optical  telescope.  This 
stellar  companion  does  not  radiate  detectable  x-rays,  but  gas  flowing  from  it  fuels 
the  x-ray  emitting  neutron  star. 

And  there  are  two  ways  of  looking  at  the  stellar  duo.  The  visible  picture 
portrays  only  the  normal  star,  and  the  x-ray  image  just  reveals  its  compact 
neighbor,  the  neutron  star.  A  complete  understanding  of  the  double-star  system, 
one  visible  and  the  other  unseen,  can  only  be  understood  when  the  two  perspec¬ 
tives  are  combined. 

Because  the  two  stars  are  rapidly  orbiting  around  one  another,  the  gas  from  the 
ordinary  star  does  not  fall  directly  onto  the  neutron  star  but  instead  shoots  past  the 
neutron  star,  swinging  around  it  to  form  a  whirling  disk  of  hot  gas,  known  as  an 
accretion  disk.  It  spirals  around  and  down  onto  the  central  neutron  star.  The  inner 
portions  of  the  swirling  accretion  disk  revolve  more  rapidly  than  the  outer 
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Fig.  13.11  X-ray  pulsar  The  outer  atmosphere  of  an  ordinary  star,  detected  in  optically  visible 
light,  spills  onto  its  companion,  an  invisible  neutron  star.  The  flow  of  gas  is  diverted  by  the 
powerful  magnetic  fields  of  the  neutron  star,  which  channel  the  in-falling  material  onto  the 
magnetic  polar  regions.  The  impact  of  the  gas  on  the  star  creates  a  pair  of  x-ray  hot  spots  aligned 
along  the  magnetic  axis  at  each  magnetic  cap.  Because  the  magnetic-field  axis  can  be  inclined  to 
the  neutron  star’s  rotation  axis,  the  x-ray  radiation  from  the  hot  spots  can  sweep  across  the  sky 
once  per  rotation,  which  is  observed  as  periodic  x-rays  if  one  of  the  hot  spots  intersects  the 
observer’s  line  of  sight 


portions,  just  as  the  closer  planets  orbit  the  Sun  at  faster  speeds  than  more  remote 
planets. 

The  rapidly  spinning  inner  parts  of  the  disk  constantly  rub  against  the  slower- 
moving  outer  parts.  This  viscous  friction  heats  up  the  accretion  disk  and  causes  the 
material  in  it  to  spiral  inward.  The  closer  the  material  moves  toward  the  central 
neutron  star,  the  hotter  the  in-falling  gas  becomes,  eventually  reaching  tempera¬ 
tures  of  millions  of  K  and  emitting  luminous  x-rays. 

The  intense  magnetic  field  of  a  rotating  neutron  star  acts  as  a  funnel  to  guide 
in-falling  matter  onto  a  neutron  star’s  magnetic  north  and  south  poles,  creating  an 
x-ray  pulsar.  As  the  accreting  material  heats  up  and  falls  onto  the  polar  surfaces  of 
the  neutron  star,  it  emits  two  beams  of  x-rays  that  flash  in  and  out  of  view  as  the 
neutron  star  rotates  and  one  or  two  of  the  beams  sweep  past  the  Earth. 

But  there  is  a  limit  to  the  x-ray  emission  produced  by  the  accreting  material, 
owing  to  the  radiation  pressure  it  develops.  The  maximum  luminosity  is  now 
called  the  Eddington  limit,  after  Arthur  Stanley  Eddington  (1882-1944)  who 
showed  that  greater  luminosity  would  blow  away  any  surrounding  matter  -  long 
before  the  discovery  of  any  x-ray  star  (Eddington  1926).  The  outward  force  of 
radiation  pressure  becomes  equal  to  the  inward  gravitational  force  on  the  accreting 
material  when  the  radiation  luminosity  is  at  the  Eddington  luminosity.  A  neutron 
star  that  is  accreting  material  at  close  to  the  Eddington  limit  will  heat  up  to  a 
temperature  of  about  20  million  K  and  emit  intense  x-ray  radiation  (Focus  13.4). 
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Focus  13.4  Accretion  luminosity  and  the  Eddington  limit 

The  luminous  output  resulting  from  mass  falling,  or  accreting,  onto  a 
compact  object  will  depend  upon  the  rate  of  mass  transfer,  denoted  by 
M  =  dM/dt ,  as  well  as  the  mass,  M,  and  radius,  R ,  of  the  compact  object. 
From  the  conservation  of  energy,  half  the  release  in  gravitational  potential 
energy  will  be  equal  to  the  gain  in  kinetic,  or  thermal,  energy  of  the  hot 
accreting  gas,  and  the  remaining  half  will  be  converted  into  heat.  The 
luminous  output,  Lacc ,  of  the  thermal  radiation  from  the  accreting  material 
will  be: 


L 


acc 


GMM 
2  R 


(13.31) 


The  temperature,  T,  of  the  radiating  gas  can  be  obtained  from  the  Stefan- 
Boltzmann  law,  which  results  in 
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where  the  gravitational  constant  G  =  6.61  A  x  10  m  kg  s  ,  and  the 
Stefan-Boltzmann  constant  a  =  5.6704  x  10-8  J  s-1  m-2  K-1. 

When  the  compact  receiving  object  is  a  white  dwarf  star,  the  mass  is  M  & 
M0  =  1.989  x  1030  kg,  the  radius  is  RWD  ^  6  x  106  m,  and  the  mass 
accretion  rate  M  =  10-9  M0  yr  \  where  1  year  =  3.1557  x  107  s.  The 
accretion  luminosity  is  Lacc  ^  7  x  10^  Js  ^2  L0,  and  the  temperature 
T  «  3  x  105  K.  For  a  neutron  star  with  the  same  rate  of  mass  transfer,  the 
luminosity  will  be  about  a  thousand  times  that  of  the  Sun  and  the  temper¬ 
ature  will  be  about  a  million  K,  but  higher  rates  of  mass  accretion  are 
possible. 

Arthur  Stanley  Eddington  (1882-1944)  showed  that  there  is  a  maximum 
luminosity,  now  called  the  Eddington  luminosity,  LEdd,  for  any  source  of 
radiation  before  it  blows  away  the  surrounding  matter.  In  effect,  the  outward 
force  of  radiation  pressure  from  the  compact  accreting  star  pushes  against 
the  inward  gravitational  force  on  the  accreting  material.  The  two  forces 
become  equal  when  the  radiation  luminosity  is  (Eddington  1926a,  b  Sect. 
10.1,  Focus  10.1): 


or 


4nGm„cM  ^  „  , 

LEdd  = - —  »  6.3M  J  s"1, 


G  j1 


(13.33) 
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where  M  is  the  mass  of  the  compact  accreting  star,  the  Thomson  scattering 
cross  section  for  the  electron  is  gt  =  6.65246  x  10  ^  m  ,  the  mass  of  the 
proton  is  mp  =  1.6726  x  10-27  kg,  and  the  speed  of  light  c  =  2.9979  x 
108  m  s-1. 

The  numerical  approximation  can  also  be  written: 


LEdd  ~  3.3  x  104  —  L0  J  s"1.  (13.35) 

1V1Q 

where  the  Sun’s  luminosity  L0  =  3.828  x  1026  J  s_1, 

The  Eddington  limit  sets  an  upper  limit  to  the  accretion  luminosity  of  a 
compact  object,  since  for  Lacc  greater  than  LEdd,  the  radiation  pressure  will 
inhibit  further  accretion.  The  maximum  accretion  rate,  MEdd,  is  obtained  by 
equating  the  accretion  luminosity  to  the  Eddington  limit,  or 


MEdd 


87 impcR 
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(13.36) 


or 


MEdd  ~  2  x  1011/?  kg  s-1  «3x  1O“12/?M0  yr-1,  (13.37) 

for  a  compact  object  of  radius  R.  At  the  Eddington  limit,  a  white  dwarf  star 
with  R  =  6  x  10  m,  the  accretion  rate  would  be  about  2  x  10  solar 
masses  per  year.  It  can  be  substantially  less  than  this  amount. 


Sometimes  a  compact,  invisible  object  with  a  visible  companion  can  be  force- 
fed  with  more  matter  than  it  can  consume,  and  it  hurls  the  in-falling  matter  out  in 
two  oppositely  directed  jets.  This  can  happen  when  the  invisible  star  is  a  black 
hole,  which  we  discuss  next. 


Example:  Accretion  luminosity  and  temperature  from  mass  transfer  to 
a  neutron  star 

The  maximum  mass  transfer  rate  MEdd  onto  a  neutron  star  of  radius 
Rns  =  10  km  =  104  m  will  be  MEdd  =  StwipcRns/ gt  «  2  x  1015  kg  s-1, 
where  the  proton  mass  mP  =  1.6726  x  10  ^  kg,  the  speed  of  light  c  = 

o  1 

2.9979  x  10  m  s  ,  and  the  Thomson  scattering  cross  section 
aT  =  6.65246  x  10  ~  nT.  The  accretion  luminosity  onto  a  solar-mass 
neutron  star  of  mass  M  =  1.4  M0  =  2.785  x  1030  kg  will  be  Lacc  = 

GMQMEdd/2RNs  «  1.25  x  10  J  s~  ,  where  the  Newtonian  constant  of 
gravitation  G  =  6.674  x  10"11  N  m2  kg-2.  The  temperature,  T,  of  the  gas 
can  be  estimated  from  the  Stephan-Boltzmann  law  Lacc  =  AugR2nsTa,  or 


T  —  Race  /  (4 TIGRIS ) 


1/4 


2  x  107  K,  where  the 


Stefan — B  oltzmann 
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constant  a  =  5.6704  x  10  Js  m  “  K  .  From  the  Wien  displacement 
law  (Sect.  2.4),  a  thermal  gas  at  this  temperature  will  emit  most  of  its 
radiation  at  a  wavelength  Xmax  =  0.0029/T  =  1.4  x  10“ 10  m,  or  at  x-ray 
wavelengths. 


13.8  Stellar  Black  Holes 
13.8.1  Imagining  Black  Holes 

John  Michell  (1724-1793),  a  British  clergyman  and  natural  philosopher,  suggested 
more  than  two  centuries  ago  that  certain  stars  could  remain  forever  invisible.  He 
reasoned  that  a  star  might  be  so  massive  and  its  gravitational  pull  so  powerful  that 
light  could  not  escape  it.  He  wrote:  “All  light  from  such  a  body  would  be  made  to 
return  to  it  by  its  own  proper  gravity”  (Michell  1784). 

The  French  astronomer  and  mathematician  Pierre  Simon  de  Laplace 
(1749-1827)  popularized  the  idea,  including  it  in  his  Exposition  du  systeme  du 
monde.  He  subsequently  showed  that  light  could  never  move  fast  enough  to  escape 
the  immense  gravitational  attraction  of  some  compact  stars  (Laplace  1796).  Their 
matter  might  be  so  concentrated,  and  the  pull  of  gravity  so  great,  that  light  could 
not  emerge  from  them,  making  these  stars  forever  dark  and  invisible. 

When  it  was  realized  that  light  travels  at  a  definite  speed  of 
2.9979  x  108  m  s  \  or  roughly  300,000  km  s-1,  a  black  hole  could  be  defined  as 
any  object  whose  escape  velocity,  the  velocity  required  for  escape  from  an  object’s 
gravitational  pull,  exceeds  the  velocity  of  light.  All  stars  that  we  can  observe  have 
escape  velocities  smaller  than  the  speed  of  light,  which  is  why  we  can  observe 
their  light.  The  Sun,  for  example,  has  an  escape  velocity  of  600  km  s-1.  Com¬ 
pressing  a  star  into  a  smaller  size  raises  its  escape  velocity.  When  a  dead  stellar 
core  of  approximately  the  Sun’s  mass  collapses  into  a  white  dwarf  star,  its  escape 
velocity  increases  to  about  9,000  km  s_1,  or  3  %  of  the  speed  of  light.  At  a 
neutron  star’s  radius,  the  escape  velocity  becomes  about  210,000  km  s  \  or  70 
percent  of  the  speed  of  light. 

If  a  massive  star  has  consumed  all  of  its  available  nuclear  fuel  and  its  core  mass 
exceeds  the  upper  limit  to  a  neutron  star’s  mass,  at  about  3  solar  masses,  the  stellar 
core’s  gravity  will  overcome  the  degenerate  neutron  pressure  of  even  nuclear 
matter.  At  this  point,  there  is  no  known  force  that  can  halt  the  collapse.  The 
smaller  the  collapsing  core  becomes,  the  larger  is  its  escape  velocity,  until  it 
exceeds  the  speed  of  light  and  a  stellar  black  hole  is  formed.  It  is  black,  or  rather 
invisible,  because  no  light  can  leave  it,  and  it  is  a  hole  because  nothing  that  falls 
into  it  can  escape. 
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In  other  words,  if  a  star  has  used  up  its  thermonuclear  fuel  and  its  core  is 
sufficiently  massive,  there  is  nothing  left  to  hold  back  the  inexorable  force  of 
gravity.  The  core  continues  to  collapse  forever,  vanishing  from  the  directly 
observable  universe. 


13.8.2  Observing  Stellar  Black  Holes 

Since  a  black  hole  is  invisible,  and  it  does  not  absorb,  emit,  or  reflect  radiation, 
how  do  we  know  it  is  there?  We  detect  a  black  hole  by  its  gravitational  effect  on 
the  motion  of  a  visible  star. 

With  remarkable  foresight,  the  Reverend  John  Michell  also  speculated,  in  1784, 
that  the  unseen  star  might  betray  its  presence  by  its  gravitational  effects  on  a 
nearby,  luminous  star  in  orbit  around  it  (Michell  1784).  In  modern  extensions  of 
this  idea,  a  black  hole  may  be  detected  if  it  is  in  a  tight,  close  orbit  with  a  visible 
star  whose  outer  atmosphere  spills  over  into  the  dominant  gravitational  influence 
of  the  black  hole.  This  material  swirls  around  and  down  into  the  black  hole, 
orbiting  faster  and  faster  as  it  gets  closer  -  as  a  result  of  the  ever-increasing 
gravitational  forces.  The  rapidly  moving  particles  collide  as  they  are  compressed 
to  fit  into  the  hole,  heating  the  material  to  temperatures  of  millions  of  K.  At  these 
temperatures,  the  gas  emits  almost  all  of  its  radiation  at  x-ray  wavelengths.  It  is 
analogous  to  the  accretion  disks  of  binary  x-ray  pulsars,  except  that  the  invisible 
companion  is  a  black  hole  rather  than  a  neutron  star. 

Remillard  and  McClintock  (2006)  have  reviewed  the  x-ray  properties  of  black- 
hole  binaries,  whereas  Eardley  and  Press  (1975)  provided  an  earlier  review  of 
astrophysical  processes  near  black  holes.  Mirabel  and  Rodriguez  (1999)  review 
black-hole  sources  of  relativistic  jets  in  our  Galaxy. 

So,  the  way  to  find  a  stellar  black  hole  is  to  look  for  two  stars  that  are  in  close 
orbit,  one  a  normal  visible  star  and  the  other  unseen  except  for  its  x-rays.  The 
mass,  velocity,  and  orbital  period  of  the  visible  star  can  be  used  to  determine  the 
mass  of  its  orbital  partner,  which  emits  no  visible  light.  If  that  mass  is  noticeably 
greater  that  the  upper  mass  limit  for  a  neutron  star,  set  at  about  3  solar  masses,  the 
unseen  star  is  thought  to  be  a  stellar  black  hole.  Any  normal  star  with  this  mass 
would  be  very  bright  and  easily  seen  through  a  telescope,  but  a  black  hole  is  dark, 
emitting  no  detectable  visible  light. 

The  archetype  of  a  stellar  black  hole  is  Cygnus  X- 1 ,  located  in  the  constellation 
Cygnus  and  one  of  the  first  x-ray  sources  to  be  discovered.  Rapid,  irregular  x-ray 
bursts  from  this  object  were  detected  from  the  Uhuru  satellite  in  1970.  The  x-rays 
flickered  on  and  off  as  rapidly  as  a  few  milliseconds,  and  because  nothing  travels 
faster  than  the  speed  of  light  the  emitter  had  to  be  less  than  300  km  across.  This 
meant  that  it  was  smaller  than  a  white  dwarf  star,  and  had  to  be  either  a  neutron 
star  or  a  black  hole. 
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Cygnus  X-l  is  accompanied  by  a  bright,  blue  supergiant  star  of  spectral  class  O, 
located  at  a  distance  of  about  6,000  light-years  from  the  Earth.  Observations  of  the 
continuously  and  periodically  shifting  spectral  lines  of  this  bright  visible  star  -  by 
the  English  astronomers  B.  Louise  Webster  (1941-1990)  and  Paul  Murdin 
(1942-  ),  and  independently  confirmed  by  the  Canadian  astronomer  Charles 
Thomas  Bolton  (1943-  )  -  indicated  that  it  is  revolving  every  5.60  days  about  an 
invisible  companion  of  more  than  eight  times  the  mass  of  the  Sun  for  the  unseen 
companion  (Webster  and  Murdin  1972;  Bolton  1972,  1975).  It  emits  no  light  and 
its  mass  is  greater  than  that  of  the  upper  mass  limit  for  a  neutron  star,  of  about  3.0 
solar  masses;  therefore,  by  elimination,  it  must  be  a  stellar  black  hole. 

Recent  measurements  of  the  Cygnus  X-l  binary  star  system  indicate  that  the 
visible  supergiant  star  has  a  mass  of  19.2  solar  masses  and  the  unseen  companion 
has  a  mass  of  14.8  solar  masses,  which  confirms  that  its  mass  is  well  above  the 
upper  mass  limit  for  a  neutron  star.  In  addition,  the  two  stars  are  separated  by  only 
0.2  AU,  or  20  percent  of  the  distance  from  the  Earth  to  the  Sun  and  about  half  the 
separation  of  Mercury  from  the  Sun  (Orosz  et  al.  2011;  Reid  et  al.  2011).  Like 
other  supergiant  stars  of  its  spectral  type,  the  visible  star  is  thought  to  be  shedding 
mass  in  a  stellar  wind  at  a  rate  of  about  2.3  solar  masses  every  1  million  years.  Due 
to  the  proximity  of  the  invisible  companion,  a  significant  portion  of  this  wind  is 
being  drawn  into  the  black  hole  to  form  its  x-ray- emitting  accretion  disk. 

We  now  know  of  many  stellar  black  holes  identified  in  this  way.  The  orbital 
properties  of  visible  companions  of  cosmic  x-ray  sources  indicate  masses  beyond 
the  neutron-star  limit.  Like  Cygnus  X-l,  many  of  these  black  holes  can  also  exhibit 
highly  luminous,  rapid,  and  irregular  x-ray  outbursts,  showing  that  they  are  very 
small  on  a  cosmic  scale.  The  transient  x-ray  flickering,  sometimes  brightening  a 
million-fold  in  milliseconds,  is  most  likely  emitted  as  in-falling  material  takes  the 
final  plunge  and  vanishes  into  the  black  hole. 


13.8.3  Describing  Black  Holes 

The  outer  edge  of  a  black  hole  can  be  defined  as  the  radius  at  which  the  escape 
velocity,  required  to  escape  from  its  gravitational  pull,  is  equal  to  the  speed  of  light, 
or  when  the  kinetic  energy  of  an  object  moving  at  this  speed  is  equal  to  the  gravi¬ 
tational  potential  energy  of  the  mass  holding  it  in.  This  radius  is  3,000  m  for  a  stellar 
black  hole  with  the  mass  of  the  Sun.  It  is  known  as  the  Schwarzschild  radius  in 
recognition  of  the  German  astronomer  Karl  Schwarzschild  (1873-1916),  who  first 
recognized  its  mathematical  significance,  and  also  sometimes  called  the  gravita¬ 
tional  radius.  While  serving  as  an  artillery  lieutenant  on  the  Russian  front  during 
World  War  I  (1914-1918),  Schwarzschild  derived  the  solution  to  Einstein’s  General 
Theory  of  Relativity  for  a  spherical,  non-rotating  black  hole.  His  publication,  titled 
“On  the  Field  of  Gravity  of  a  Point  Mass  in  the  Einsteinian  Theory”  can  be  used  to 
specify  the  space  time  intervals  outside  a  black  hole  (Schwarzschild  1916). 
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The  radius  of  a  black  hole  can  be  defined  as  the  radius,  R ,  at  which  the  escape 
velocity,  Vesc ,  of  a  particle  of  mass,  m,  from  a  larger  mass,  M,  and  radius,  R ,  is 
equal  to  the  speed  of  light,  c,  or  when  Vesc  =  (2 GM/R)1^2  =  c,  for  a  gravitational 
constant  G.  That’s  obtained  from  classical  physics  by  equating  the  kinetic  energy 
of  a  moving  object  to  the  gravitational  potential  energy  of  the  mass  holding  it  in, 
with  Vesc  =  c.  That  is  the  kinetic  energy  mV2sc/2  =  GMm/R ,  the  gravitational 
potential  energy.  Solving  for  the  radius,  we  obtain  the  Schwarzschild  radius  Rg 
given  by 


2  GM 


2.95  x  103 


m, 


(13.38) 


_  11  O  _  1  _ O 

where  the  gravitational  constant  G  =  6.674xl0  mkg  s  ,  the  speed  of 
light  c  =  2.9979  x  108  m  s-1,  and  the  solar  mass  MQ  =  1.989  x  1030  kg. 


Example:  Schwarzschild  radius  of  the  Earth  and  Sun 

The  Schwarzschild  radius,  also  known  as  the  gravitational  radius,  of  the 
Earth  is  RSE  =  2 GME/c  «  8.87  x  10“^  m,  where  the  Newtonian  gravi- 

_  11  o  _ o 

tational  constant  G  =  6.674  x  10  N  m^  kg  ,  the  mass  of  the  Earth 
Me  =  5.974  x  1024  kg,  and  the  speed  of  light  c  =  2.9979  x  108  m  s_1. 
The  physical  radius  of  the  Earth  is  RE  =  6.378  x  106  m,  which  is  almost  a 
billion  times  larger  than  the  planet’s  gravitational  radius.  For  the  Sun,  the 

o 

gravitational  radius  Rs&  ~  2.95  x  10  m  «  3  km,  for  a  solar  mass  M0  = 
1.989  x  1030  kg.  The  physical  radius  of  the  Sun  R&  =  6.955  x  108  m  is 
more  than  a  hundred  thousand  times  larger  than  its  gravitational  radius. 
At  the  time  that  Schwarzschild  derived  the  metric  involving  this  term,  there 
was  no  known  physical  object  whose  linear  size  was  smaller  than  its  grav¬ 
itational,  or  Schwarzschild,  radius. 


In  the  Special  Theory  of  Relativity ,  space  and  time  were  combined  to  define  a 
metric,  ds ,  or  space  time  interval,  given  by  (Minkowski  1908;  Hargreaves  1908) 

ds 2  =  c2dt 2  —  dx2  —  dy2  —  dz 2,  (13.39) 

for  time  interval  dt  and  space  coordinates  v,  y,  z.  The  speed  of  light,  c,  has  been 
added  to  give  dt  the  units  of  distance.  This  metric  applies  to  Euclidean  space 
without  gravity  and  no  curvature  of  space. 

In  the  General  Theory  of  Relativity,  for  which  gravity  manifests  itself  in  the 
curvature  of  space  time,  the  metric  outside  a  non-rotating  mass  in  a  vacuum  is  the 
Schwarzschild  metric  given  by  (Schwarzschild  1916): 


2  GM 
c2r 


c2dt2 


dr2 

"|  2GM~ 
c2r  . 


r2dd2  —  r2  sin2  6d(j>2. 


(13.40) 
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Here  r,  6,  <f>  are  spherical  coordinates  whose  origin  is  at  the  center  of  the 
massive  object,  and  M  is  the  mass.  The  trajectories  of  free-falling  particle  and 
radiation  are  described  by  a  null  geodesic,  for  which  ds  =  0. 

A  proper  time  x  is  defined  by  the  proper  time  interval  dx  given  by: 

(13.41) 

For  a  clock  at  rest,  the  space  coordinates  are  not  changing,  so  dr  =  d6  = 
d<j)  =  0,  and  the  proper  time  interval  is: 

1/2 

dt.  (13.42) 

The  proper  time  interval  approaches  zero  times  dt  when  the  radius  R  approaches 

r\ 

the  Schwarzschild  radius  Rs  =  2 GM/c  .  A  clock  placed  on  a  collapsing  star  will 
appear,  to  a  distant  observer,  to  tick  more  slowly  as  the  star’s  radius  approaches 
the  Schwarzschild  radius.  This  is  known  as  gravitational  time  dilation.  The  clock 
will  appear  to  stop,  at  this  critical  radius,  when  time  will  seem  to  go  on  forever  and 
the  star  has  become  “frozen”. 

The  decline  in  the  observed  radiation  due  to  gravitational  time  dilation 
is  expressed  as  a  decrease  in  the  number,  Nph,  of  photons  observed,  with 
(Oppenheimer  and  Snyder  1939): 


dx  = 


1  - 


2  GM 

c2R 


Nph  oc  exp 


or  with  a  characteristic  observed  free-fall  time,  Tff,  given  by: 


Tff 


2 Rs 
c 


4  GM  o 

— —  ss  2.94  x  103 
cl 


where  the  Sun’s  mass  M0  =  1.989  x  1030  kg. 


(13.43) 


(13.44) 


Example:  A  star’s  core  collapse  time  to  become  a  black  hole 

If  a  sufficiently  massive  star  consumes  all  the  available  thermonuclear  fuel, 
and  its  core  is  more  massive  than  about  3  solar  masses,  it  will  collapse  to  a 
black  hole  in  a  free-fall  time  given  by  xff  =  4 GM/ c  ,  with  the  gravitational 

_  11  O  _  1  _ o 

constant  G  =  6.674xl0  m  kg  s  ^  and  the  speed  of  light 

o  1 

c  =  2.9979  x  10  m  s  .  For  a  5  solar  mass  star  with  M  =  5  M0,  where  the 
Sun’s  mass  M0  =  1.989  x  1030  kg,  the  free-fall  time  is  Tff  «  104  s. 

During  this  time,  the  number  of  observed  photons  will  have  decreased  by 
exp  (—1,000),  or  e~ 100°,  a  very  small  number,  so  the  star  will  have  effec¬ 
tively  disappeared,  becoming  invisible  and  black. 
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The  Schwarzschild  metric  has  a  one  over  zero  term,  a  singularity,  which  blows 
up  at  the  Schwarzschild,  or  gravitational,  radius  Rg  =  2 GM/ c  ,  which  is  the  radius 
at  which  the  escape  velocity  becomes  equal  to  the  speed  of  light.  In  1963  the  New 
Zealand  mathematician  Roy  Kerr  (1934- )  described  the  space-time  metric  outside 
a  rotating  black  hole,  and  it  reduces  to  Schwarzschild’ s  metric  when  there  is  no 
rotation.  The  gravitational  field  outside  black  holes  are  uniquely  determined  by 
their  mass  and  angular  momentum  and  described  by  Kerr  metrics. 

The  singularity  can  be  moved  down  to  the  center  of  the  black  hole  by  a  clever 
change  in  geometry,  but  it  never  disappears.  It  is  a  location  where  matter  is 
compressed  into  an  undefined  state  of  infinite  mass  density.  That  is,  in  1960  two 
mathematicians,  the  American  Martin  Kruskal  (1925-2006)  and  the  Hungarian- 
Australian  George  Szekeres  (1911-2005),  defined  a  line  element  that  removes  the 
singularity  at  the  Schwarzschild  radius  through  a  coordinate  transformation,  but 
that  didn’t  mean  that  the  singularity  disappeared  -  you  can’t  form  a  black  hole 
within  the  constraints  of  General  Relativity  without  having  one;  the  English 
astrophysicist  Roger  Penrose  (1931-  )  demonstrated  that  (Penrose  1965).  The 
singularity  was  just  moved  down  to  the  center  of  the  black  hole  in  the  Kruskal- 
Szekeres  transformation. 

Nevertheless,  the  defining  notion  of  a  black  hole,  in  terms  of  an  escape  velocity 
that  exceeds  the  speed  of  light,  and  the  method  of  inferring  its  presence  in  an  x-ray 
emitting  binary  star  system  with  a  close  visible  companion,  do  not  depend  on 
Einstein’s  theory  and  can  be  determined  from  classical  Newtonian  physics  without 
any  singularity. 

The  Schwarzschild  radius,  which  can  be  located  outside  the  singular  center,  has 
not  lost  its  significance.  It  marks  the  event  horizon  -  literally,  a  horizon  in  the 
geometry  of  space-time  beyond  which  no  event  can  be  seen,  just  as  the  Earth’s 
horizon  is  the  boundary  for  our  vision.  Nearby  space  then  is  said  to  curl  into  a 
black  hole,  carrying  light  and  matter  and  any  other  form  of  energy  with  it.  They  are 
so  intensely  wrapped  around  a  black  hole  that  it  becomes  a  cocoon  disconnected 
from  the  outside  and  cut  off  forever  from  the  rest  of  the  universe. 

Black  holes  are  mysterious  objects.  They  cannot  be  observed  directly,  because 
any  radiation  they  might  emit  cannot  escape.  A  black  hole’s  presence  can  be 
inferred  only  from  indirect,  circumstantial  evidence,  using  measurements  in  the 
accessible  parts  of  the  universe  —  the  visible  stars  —  to  make  inferences  about  the 
dark  places  that  cannot  be  observed. 


Chapter  14 

A  Larger,  Expanding  Universe 


14.1  The  Milky  Way 

14.1.1  A  Fathomless  Disk  of  Stars 

On  a  clear,  moonless  night,  we  can  look  up  and  see  a  hazy,  faintly  luminous  band 
of  light  that  stretches  across  the  sky  from  one  horizon  to  the  other;  it  is  known  as 
the  Milky  Way  (Fig.  14.1).  According  to  ancient  Greek  myth,  the  goddess  Hera, 
Queen  of  Heaven,  spilled  milk  from  her  breast  into  the  sky.  The  Romans  called  the 
spilt  milk  the  Via  Lactea ,  or  the  “Milky  Way.”  It  also  is  designated  as  our  Galaxy, 
derived  from  the  Greek  word  galakt-  for  “milk,”  the  celestial  milk  from  Hera’s 
breast. 

We  are  immersed  within  the  Milky  Way ,  viewing  it  edgewise  from  inside.  When 
gazing  directly  into  the  band  of  starlight,  we  cannot  see  through  to  stars  at  the 
center  or  distant  edges  of  the  Milky  Way,  but  if  we  look  up  and  outside  the  thin 
disk  of  stars,  we  can  look  beyond  them.  It  is  similar  to  living  in  a  city:  We  notice 
buildings  all  around  us,  but  none  when  we  look  up  into  the  sky. 

When  Galileo  Galilei  (1564-1642)  turned  one  of  the  first  telescopes  toward  the 
Milky  Way,  he  found  that  it  contains  many  otherwise  unseen  stars,  which  are  too 
faint  to  be  seen  by  the  unaided  human  eye  (Galilei  1610).  Astronomers  subse¬ 
quently  built  increasingly  larger  telescopes,  which  collect  more  starlight  and 
enable  us  to  see  the  dim,  golden  beacons  of  fainter  stars.  They  discovered  more 
dim  stars  located  between  or  beyond  the  brighter  ones,  which  make  the  Milky  Way 
look  like  a  continuously  distributed  band  of  light  when  observed  by  the  unaided 
eye. 

The  German-born  English  astronomer  William  Herschel  (1738-1822)  spent 
much  of  his  life  trying  to  determine  the  shape  and  size  of  the  Milky  Way;  he 
constructed  the  biggest  telescopes  at  the  time,  with  the  largest  mirrors  and  greatest 
light-gathering  power.  By  counting  the  number  of  stars  of  different  observed 
brightness  in  various  directions  in  the  night  sky,  he  hoped  to  determine  the  places 
at  which  the  stars  disappeared,  thereby  determining  the  depths  of  the  Milky  Way. 


K.  R.  Lang,  Essential  Astrophysics,  Undergraduate  Lecture  Notes  in  Physics, 
DOI:  10. 1007/978-3-642-35963-7_14,  ©  Springer- Verlag  Berlin  Heidelberg  2013 
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Fig.  14.1  The  Milky  Way  A  panoramic  telescopic  view  of  the  Milky  Way,  the  luminous 
concentration  of  bright  stars  and  dark  intervening  dust  clouds  that  extends  in  a  band  across  the 
celestial  sphere.  We  live  in  this  disk  and  look  out  through  it.  Our  view  is  eventually  blocked  by 
the  buildup  of  interstellar  dust,  and  the  light  from  more  distant  regions  of  the  disk  cannot  get 
through.  The  center  of  the  Milky  Way  is  located  at  the  center  of  the  image,  in  the  direction  of  the 
constellation  Sagittarius.  Although  the  disk  appears  wider  in  that  direction,  the  center  is  not 
visible  through  the  dust.  The  large  and  small  magellanic  clouds  can  be  seen  as  bright  swirls  of 
light  below  the  plane  to  the  right  of  center  (this  map  of  the  Milky  Way  was  hand-drawn  from 
many  photographs  by  Martin  and  Tatjana  Keskula  under  the  direction  of  Knut  Lundmark; 
courtesy  of  the  Lund  Observatory,  Sweden) 

But  the  giant  telescopes  were  not  big  enough  to  fathom  the  profundity  or  depth 
of  the  Milky  Way.  By  collecting  greater  amounts  of  light  than  a  smaller  telescope, 
Herschel’s  biggest  telescope  brought  fainter  stars  into  view  and  pushed  the  edge  of 
the  known  universe  further  into  space.  These  stars  were  concentrated  in  a  flattened 
disk  with  the  Sun  at  the  center,  and  whose  greatest  extent  is  in  the  plane  of  the 
Milky  Way  (Herschel  1785). 

Although  Herschel  concluded  that  the  Sun  is  in  the  center  of  a  flattened  disk  of 
stars  with  a  disk  diameter  five  time  times  its  thickness,  he  had  no  way  to  determine 
its  size.  Early  in  the  twentieth  century,  the  Dutch  astronomer  Jacobus  C.  Kapteyn 
(1851-1922)  and  his  colleagues  resumed  the  star  counts,  arriving  at  a  similarly 
flattened,  Sun-centered  distribution  of  stars  with  the  greatest  extent  in  the  Milky 
Way.  Measurements  of  the  distance  of  some  of  these  stars,  using  their  parallax, 
provided  a  scale  to  Kapteyn’ s  universe  of  about  1.5  kpc  by  12  kpc,  where  1  kpc  is 
1,000  pc  or  3.0857  x  1019  m  (Kapteyn  1922). 

However,  astronomers  have  never  succeeded  in  deciphering  the  true  extent  of 
the  Milky  Way  by  observing  its  stars,  even  when  looking  much  farther  into  it  using 
larger  telescopes  and  photographic  or  electronic  techniques  that  permitted  long 
exposures.  This  is  because  the  most  powerful  telescopes  can  discern  only  the 
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Fig.  14.2  Globular  star  cluster  More  than  a  hundred  thousand  stars  are  collected  together  in 
this  globular  star  cluster  designated  NGC  362,  which  is  located  about  27,700  light-years  away  in 
the  southern  sky.  It  is  one  of  many  star  clusters  that  are  located  in  an  extensive,  spherical  halo 
around  our  Milky  Way.  They  formed  in  the  early  evolution  of  the  Milky  Way,  and  contain  stars 
that  are  more  than  10  billion  years  old.  (Courtesy  of  Royal  Observatory,  Edinburgh.) 


visible  parts  of  our  stellar  system,  not  its  most  distant,  invisible  parts  that  lie 
behind  an  opaque  veil  of  interstellar  dust. 

This  dust  blocks  our  view  when  we  look  deep  into  the  plane  of  the  Milky  Way. 
The  total  amount  of  dust  through  which  we  are  looking  builds  up  with  distance  and 
eventually  makes  an  impenetrable  barrier;  it  becomes  so  thick  and  dense  that  it 
blocks  the  light  of  distant  stars.  We  can  see  only  that  far;  more  distant  objects  are 
hidden  from  view.  New  perspectives  were  required  to  look  outside  and  eventually 
beyond  this  barrier  to  the  heavens. 


14.1.2  The  Sun  is  Not  at  the  Center  of  Our  Stellar  System 

The  true  enormity  of  our  stellar  system  was  discovered  by  using  Cepheid  variable 
stars  to  gauge  the  distances  of  globular  star  clusters  (Fig.  14.2)  located  outside  the 
plane  of  the  Milky  Way.  These  yellow  supergiant  Cepheids  periodically  brighten 
and  dim  with  a  period  that  increases  with  a  star’s  luminosity.  Measurements  of  this 
period  and,  therefore,  the  stellar  luminosity  can  be  combined  with  observations  of 
a  star’s  brightness  to  determine  its  distance  (Focus  14.1).  Because  the  Cepheid 
variable  stars  are  very  luminous,  they  are  conspicuous  and  can  be  seen  to 
exceptional  distances,  where  conventional  parallax  methods  of  determining  stellar 
distance  do  not  work.  At  large  distances,  the  parallax  angles  are  too  small  to  be 
reliably  measured,  even  from  space. 
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Focus  14.1  Cepheid  variable  stars 

The  luminosity  of  some  stars  does  not  remain  constant  but  instead  fluctuates 
over  regular  periods.  These  stars  do  not  only  turn  on  and  off,  like  a  switched 
house  light,  but  instead  gradually  vary  from  dimmer  to  brighter  and  then 
back  to  dimmer  again  with  periods  ranging  from  a  few  days  to  a  few  months. 

The  very  luminous  variable  stars  are  know  as  Cepheid  variable  stars ,  from 
their  prototype,  Delta  Cephei.  The  deaf  English  astronomer  John  Goodricke 
(1764-1786)  first  noticed  its  variability  (Goodricke  1785).  Edward  Pigott 
(1753-1825),  another  Englishman,  discovered  the  Cepheid  variable  Eta 
Aquilae  a  few  months  earlier  in  the  same  year  (Pigott  1785).  The  North  Star, 
Polaris,  is  also  a  Cepheid  variable  star,  the  closest  one  known. 

The  Cepheids  have  luminosities  up  to  100,000  times  that  of  the  Sun  and 
masses  of  4-20  times  the  solar  mass.  Because  they  are  so  luminous,  these 
stars  can  be  seen  over  a  wide  range  of  distances,  from  Delta  Cephei,  located 
at  only  about  272  parsecs,  or  887  light-years,  from  the  Earth  (Benedict  et  al. 
2002),  to  galaxies  100  million  light-years  away  (Freedman  et  al.  2001). 

The  more  luminous  a  Cepheid  variable  star  is,  the  more  slowly  it  varies 
and  the  longer  the  period  of  its  luminosity  change.  This  period-luminosity 
relationship  was  first  discovered  from  observations  of  variable  stars  in  the 
Large  and  Small  Magellanic  clouds,  which  are  nearby  satellites  of  the  Milky 
Way.  These  stellar  systems,  visible  from  the  Earth’s  Southern  Hemisphere, 
are  named  for  the  Portuguese  explorer  Ferdinand  Magellan  (1480-1521), 
who  observed  them  when  his  ships  were  circumnavigating  the  world  for  the 
first  time. 

At  the  end  of  the  nineteenth  century,  Harvard  College  established  an 
observatory  at  Arequipa,  Peru,  with  a  0.6  m  (24  inch)  refractor  that  was  used 
in  a  photographic  survey  of  the  southern  sky,  including  the  Magellanic  Clouds. 
Because  of  their  proximity,  the  clouds  could  be  resolved  into  stars.  From  these 
photographs,  Henrietta  Swan  Leavitt  (1868-1921),  a  researcher  in 
Cambridge,  Massachusetts,  found  an  extraordinary  total  of  1,777  variable 
stars.  She  reported  that  the  brighter  stars  tended  to  have  the  longer  cycles  of 
variation  (Leavitt  1908).  Because  the  extent  of  the  Magellanic  Clouds  is  small 
compared  to  their  distance,  the  relationship  of  period  to  apparent  brightness 
also  implied  a  real  connection  with  luminosity.  Four  years  later,  Leavitt  had 
obtained  precise  apparent  brightness  and  period  data  for  25  variable  stars  in 
the  Small  Magellanic  Cloud,  thereby  establishing  the  important  period- 
luminosity  relation  for  the  Cepheid  variables  (Leavitt  1912). 

Once  this  relation  is  calibrated  suitably  by  the  measurement  of  a  precise 
distance  to  one  Cepheid  variable  star  using  independent  methods,  observa¬ 
tion  of  the  variation  period  leads  to  determination  of  the  star’s  luminosity. 
Then,  using  the  observed  brightness,  the  distance  of  the  star  can  be  calcu¬ 
lated.  This  technique  of  measuring  distances  with  Cepheid  variable  stars  has 
been  used  to  demonstrate  the  vast  extent  of  the  Milky  Way  -  as  well  as  the 
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Sun’s  place  within  it  -  and,  subsequently,  to  discover  the  extragalactic  nature 
of  spiral  nebulae. 

The  early  history  of  the  period-luminosity  relation  and  its  calibration  has 
been  reviewed  by  Fernie  (1969)  and  Sandage  (1972),  and  includes  the  initial 
calibration.  The  relation  has  been  used  to  place  constraints  on  the  size  and 
shape  of  the  Milky  Way,  and  to  determine  slightly  incorrect  distance  esti¬ 
mates  for  nearby  spiral  nebulae. 

There  are  the  brilliant,  younger,  and  more  massive  classical  Cepheids,  of 
stellar  Population  I,  which  are  found  in  the  arms  of  spiral  galaxies,  and  the 
older,  fainter  Cepheids  of  Population  II,  located  in  globular  star  clusters. 

Leavitt  had  no  idea  why  the  luminous  output  of  these  stars  varies,  but 
within  a  few  years  the  inquisitive  English  astronomer  Arthur  Stanley  Edd¬ 
ington  (1882-1944)  showed  how  (Eddington  1918,  1919).  The  stars  are 
pulsating  with  a  regular  beat,  expanding  out  and  contracting  in,  rising  and 
falling  back.  And  since  the  star’s  size  is  changing,  the  period-luminosity 
relation  can  also  be  expressed  as  a  period-radius  relation.  The  radial  oscil¬ 
lations  have  been  observed  using  the  Doppler  shift  of  spectral  lines  that  arise 
in  the  stellar  atmospheres. 

It  follows  from  Eddington’s  theory  that  the  pulsation  period  is  inversely 
proportional  to  the  square  root  of  the  star’s  average  mass  density,  so  more 
rarefied  stars  pulsate  more  slowly  than  dense  ones.  This  also  means  that  the 
more  massive  stars,  which  are  also  the  larger  and  more  luminous  stars, 
possess  longer  pulsation  periods. 

Once  it  was  realized  that  stars  are  primarily  composed  of  hydrogen  and 
helium,  Eddington  could  show  that  stellar  pulsations  might  originate  in  an 
outer  convective  zone  where  hydrogen  is  alternately  ionized  and  neutral, 
acting  as  a  valve  that  absorbs  and  releases  heating  radiation  during  the 
course  of  the  pulsation. 

Over  a  decade  later,  the  Russian  astronomer  Sergei  A.  Zhevakin 
(1916-2001)  showed  that  the  convective  ionized  hydrogen  zone  could  not 
maintain  the  pulsations  because  it  does  not  absorb  sufficient  energy  during 
the  contraction  of  the  star.  He  found  that  an  outer  region  of  doubly  ionized 
helium  acts  as  a  valve  for  the  heat  engine  that  drives  the  pulsations  of 
Cepheid  variable  stars  (Zhevakin  1963;  Cox  1980).  Lang  (1999)  provides 
references  to  individual  papers  on  stellar  pulsations.  The  varying  star  absorbs 
the  outward  flow  of  energy  from  the  star’s  center  during  stellar  contraction 
and  repeatedly  returns  it  during  expansion. 

Because  the  pulsation  depends  upon  a  critical  stage  of  ionization,  a  star 
can  maintain  them  only  for  a  specific  combination  of  size  and  temperature  in 
a  narrow  range  of  mass  densities.  This  defines  a  strip  of  instability  in  the 
Hertzsprung-Russell  diagram. 
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At  optically  visible  wavelengths,  astronomers  establish  the  zero  point,  a,  and 
slope,  b ,  in  the  period-luminosity  relation  for  a  variable  star: 

My  =  —a  —  b  log  P.  (14.1) 

where  Mv  is  the  absolute  visual  magnitude  and  P  is  the  period.  A  precise  cali¬ 
bration  of  the  period-luminosity  relation  involves  an  accurate  determination  of  the 
stellar  distances  by  some  independent  method.  Benedict  et  al.  (2007),  for  example, 
used  the  Hubble  Space  Telescope  to  obtain  the  parallax  and  distance  of  10  nearby 
Cepheid  variables,  obtaining  the  relation: 


Mv  =  -1.62  -  2.43  log  P ,  (14.2) 

where  P  is  the  period  in  days  and  the  uncertainty  in  the  absolute  visual  magnitude 
Mv  is  ±0.10.  This  is  the  period-luminosity  relation  for  classical  Cepheid  variables , 
also  known  as  Population  I  Cepheids ,  in  the  Milky  Way. 


Example:  Distance  to  Delta  Cephei 

The  prototype  Cepheid  variable  star  is  Delta  Cephei,  whose  variation  period  is 
P  =  5.36634  days.  The  period-luminosity  relation  Mv  =  —1.62  —  2.43  log 
P  results  in  an  absolute  visual  magnitude  Mv  «  —3.40  ±  0.01.  One  could 
infer  the  star’s  distance,  D ,  from  the  apparent  magnitude,  mv,  and  the  relation 
mv  —  Mv  =  5  log  D  —  5,  where  the  distance  is  in  parsecs  (Sect.  10.1).  Unlike 
most  stars,  however,  Delta  Cephei  is  a  variable  star  with  an  apparent  magnitude 
that  ranges  from  mv  =  3.48  to  4.37,  resulting  in  uncertainties  in  the  distance 
determination  unless  a  mean  apparent  magnitude  is  used.  Instead,  the  Hubble 
Space  Telescope  has  been  use  to  determine  the  parallax,  nA  = 
(3.66  =b  0.15)  x  10-3  s  of  arc  and  a  distance  of  D  =  l/nA  =  273  d=  10 
parsecs  and  about  887  light-years  (Benedict  et  al.  2002).  This  distance  corre¬ 
sponds  to  an  apparent  visual  magnitude  mv  «  3.8  ±  0.1. 


It  was  the  American  astronomer  Harlow  Shapley  (1885-1972)  who  observed 
Cepheids  in  globular  star  clusters  outside  the  plane  of  the  Milky  Way.  He  showed 
that  they  are  distributed  within  a  roughly  spherical  system,  which  is  centered  far 
from  the  Sun  in  the  direction  of  the  constellation  Sagittarius  (Shapley  1918, 
Fig.  14.3). 

These  dense  stellar  clusters  orbit  the  core  of  the  Milky  Way  in  great  elongated 
ellipses,  passing  through  the  Milky  Way  every  100  million  years  or  so.  In  contrast, 
the  stars  within  the  plane  of  the  Milky  Way  go  around  its  center  in  roughly  circular 
orbits  with  comparable  periods  for  stars  about  as  distant  from  the  core  as  the  Sun  is. 

Most  of  the  stars  and  interstellar  gas  in  the  Milky  Way  are  located  within  this 
flattened,  plate-shaped,  rotating  disk.  Its  center,  known  as  the  galactic  center ,  is 

on 

located  at  a  distance  D0  ^8.5  kpc,  or  about  2.5  x  10“  m  and  27,700  light-years, 
from  the  Sun.  This  distance  is  1 .7  billion  times  the  distance  between  the  Earth  and 
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Fig.  14.3  Edge-on  view  of  the  Milky  Way  As  shown  in  1918  by  the  American  astronomer 
Harlow  Shapley  (1885—1972),  the  globular  star  clusters  are  distributed  in  a  roughly  spherical 
system  whose  center  coincides  with  the  core  of  our  Milky  Way.  The  Sun  is  located  in  the  disk, 
about  27,700  light-years  away  from  the  center.  The  disk  and  central  bulge  are  shown  edge-on  in  a 
negative  print  of  an  infrared  image  taken  from  the  InfraRed  Astronomical  Satellite.  The  infrared 
observations  can  penetrate  the  obscuring  veil  of  interstellar  dust  that  hides  the  distant  Milky  Way 
from  observation  at  optically  visible  wavelengths.  It  is  this  dust  that  limited  astronomers’  view  of 
stars  to  a  much  smaller  Kapteyn  Universe,  centered  on  the  Sun 

the  Sun.  Reid  (1993)  has  reviewed  determinations  of  this  distance  that  were 
available  at  the  time.  Even  relatively  recent  estimates  indicate  that  it  is  quite 
uncertain  with  D©  =  8.33  =b  0.35  kpc  (Gillessen  et  al.  2009). 

The  disk  has  a  radius  of  about  50,000  light-years  or  15  kpc,  and  a  thickness 
around  3,000  light-years  or  1  kpc.  The  bright,  massive  young  stars  are  found  in  a 
disk  that  is  only  120  pc  or  so  thick,  despite  being  over  30,000  pc  across.  Other 
older  types  of  stars  define  thicker  disks;  some  of  them  form  disks  that  are  2,000  pc 
thick.  Assuming  a  disk  thickness  of  about  1,000  pc  and  a  disk  radius  of  15,000  pc, 
the  Milky  Way  has  a  volume  of  about  700  billion  (7  x  1011)  cubic  parsecs. 

How  many  stars  are  in  the  Milky  Way?  The  distribution  of  stars  mapped  by  the 
HIPPARCOS  mission  indicates  that  the  mass  density  of  the  stars  near  the  Sun  is 

_ nn  _ o  o 

Pdisk  ~  0.515  x  10  ^  kg  m  «  0.076  M0pc  ,  which  is  probably  accurate  to 
within  a  factor  of  two.  The  local  mass  density  of  main-sequence  stars  has,  for 

_ Q 

example,  been  estimated  at  about  0.03 1  M©  pc  ,  including  those  of  spectral  type 
M  that  account  for  the  overwhelming  majority  of  stars  in  the  Milky  Way  (Reid 
et  al.  2002).  Assuming  a  uniform  distribution  of  stars  in  the  Milky  Way  disk  and 
multiplying  the  lower  mass  density  estimate  by  the  disk  volume  of  700  billion 
cubic  parsecs,  we  obtain  a  lower  limit  of  at  least  20  billion  stars  with  a  mass  equal 
to  that  of  the  Sun,  which  is  designated  M©.  Because  the  stars  are  more  concen¬ 
trated  toward  the  central  region,  the  Milky  Way  most  likely  contains  at  least  50 
billion  to  100  billion  stars  like  the  Sun,  and  we  can  see  only  about  5,000  of  them 
with  the  unaided  eye. 

The  physical  parameters  of  the  galactic  disk  are  listed  in  Table  14.1. 
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Table  14.1  Physical  properties  of  the  Milky  Way  disk 

Rdisk  —  radius  of  disk  =  50,000  light-years  %  15,000  pc  «  4.6  x  1020  m 

Ldisk  =  thickness  of  disk  =  3,000  light-years  w  1,000  pc  «  3.0857  x  1019  m 

Dq  =  R0  =  Sun’s  distance  from  the  center  =  27,700  light-years  =  8.5  kpc  =  2.6  x  102()  m 

VQ  =  V o  =  Sun’s  orbital  velocity  about  center  =  220  km  s-1 

PQ  =  P0  =  Sun’s  orbital  period  about  center  =  7.6  x  1015  s  =  2.4  x  108  years 

Mdisk  =  mass  of  disk  =  1011  M0  w  2  x  1041  kg 

Ndisk  =  number  of  stars  in  Milky  Way  =  100  billion  or  1011  stars  like  the  Sun 

LBdisk  =  luminosity  in  blue  band  =  1.9  x  101()  LBQ  &  7.2  x  1036  J  s-1 

_ _ o 

pdisk  =  mass  density  of  disk  near  the  Sun  0.515  x  10  kg 
=  O.O76M0pc~3  =  0.0022 M0  (light-year)-3. 

_ Cl  _ Q 

Ndisk  =  number  density  of  stars  in  disk  near  Sun  =  2.59  x  10-J1  m 

Sdisk  =  separation  of  adjacent  stars  in  disk  %  6.5  light-years  «  2  pc  %  6.2  x  1016  m 

Age  =  oldest  disk  stars  =  (6—13.5)  x  109  year  =  6  to  13.5  Gyr,  young  stars  are  still  forming 

Oort’s  constants  A  =  (+14.82  ±  0.84)  km  s-1  kpc-1  and  B  =  (—12.7  ±  0.64)  km  s-1  kpc-1. 

Center  of  the  Milky  Way:  right  ascension  a(2,000)  =  17  h  45  m  40.04  s, 

Declination  d(2,000)  =  -29°  00'  28.1" 


Relatively  young  stars  are  found  in  the  Milky  Way  disk;  some  are  seen  even  in 
the  earliest  stages  of  formation  and  they  are  all  less  than  10  billion  years  old.  These 
stars  are  designated  Population  I  stars.  In  addition  to  their  cosmic  youth,  they 
contain  a  relatively  high  abundance  of  the  heavier  elements,  commonly  called  the 
metals.  The  Sun  is  a  Population  I  star;  the  open  star  clusters  found  in  the  Milky 
Way  contain  Population  I  stars. 

The  Population  II  stars  are  found  mainly  outside  the  Milky  Way,  in  globular 
star  clusters.  They  include  the  oldest  known  stars  of  up  to  almost  14  billion  years 
old,  and  these  stars  have  a  relatively  low  abundance  of  elements  heavier  than 
hydrogen  or  helium.  A  spherical  aggregation  of  Population  II,  metal-poor  stars 
also  is  found  near  the  center  of  the  Milky  Way.  It  is  mainly  closer  to  the  galactic 
center  than  the  Sun.  Ivezic  et  al.  (2012)  have  reviewed  our  recent  knowledge  of 
galactic  stellar  populations.  The  first  stars  to  be  formed  in  the  observable  universe 
have  been  designated  Population  III ;  Heger  (2012)  has  provided  a  review  of  metal 
enrichment  by  Population  III. 

Freeman  and  Bland-Hawthom  (2002)  have  reviewed  the  formation  of  our 
Galaxy;  Putman  et  al.  (2012)  summarized  our  knowledge  of  the  gaseous  galactic 
halo;  and  Van  den  Berg  et  al.  (1996)  have  reviewed  estimates  for  the  age  of  the 
galactic  globular  cluster  system. 

Physical  properties  of  this  galactic  spheroid  are  listed  in  Table  14.2. 


Table  14.2  Physical  properties  of  the  globular  cluster  spheroid 

Rgs  =  radius  of  spheroid  >  130,000  light-years  =  40  kpc  %  1.2  x  1021  m 
Mgs  =  mass  of  spheroid  =  (2-10)  x  109  MQ  w  (4-20)  x  1039  kg 
pgs  =  mass  density  of  spheroid  «  0.000  26  M0  pc-3  ^  1.9  x  10-23  kg  m-3 
LBgs  =  luminosity  of  spheroid,  blue  band  =  (1-2)  x  109  L0  %  (4-8)  x  1035  J  s-1 
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14.1.3  The  Rotating  Galactic  Disk 

As  suggested  by  the  German  philosopher  Immanuel  Kant  (1724-1804)  near  the 
end  of  the  eighteenth  century,  the  flattened  shape  of  the  Milky  Way  can  be 
attributed  to  its  formation  from  a  large,  collapsing,  rotating  nebula,  much  like  the 
origin  of  our  solar  system  from  a  considerably  smaller  nebula  (Kant  1755). 
Observations  of  the  motions  of  nearby  stars  and  interstellar  gas  in  the  galactic  disk 
indicate  that  the  entire  stellar  system  indeed  is  rotating  around  a  remote  axis  that 
pierces  the  center  of  the  Milky  Way.  The  enormous  mass  at  this  central  hub  steers 
stars  into  circular  orbits  with  an  orbital  speed  that  decreases  with  increasing  dis¬ 
tance  from  the  center  -  all  in  accordance  with  Kepler’s  third  law. 

It  is  rotation  that  has  flattened  the  Milky  Way.  Observations  of  the  motions  of 
nearby  stars  and  interstellar  gas  indicate  that  the  entire  system  is  whirling  about  a 
remote,  massive  center.  The  enormous  mass  at  this  central  hub  steers  the  stars  into 
circular  orbits  with  an  orbital  speed  that  decreases  with  increasing  distance  from 
the  center,  at  least  as  far  as  the  Sun,  all  in  accordance  with  Kepler’s  third  law 
(Lindblad  1925). 

Stars  in  orbits  inside  the  solar  orbit  travel  faster  than  the  Sun,  thereby  forging 
ahead  of  it,  whereas  the  stars  moving  in  orbits  outside  the  Sun’s  orbit  are  falling 
behind.  When  viewed  from  the  Earth,  nearby  stars  that  are  a  little  closer  than  the 
Sun  to  the  center  therefore  seem  to  move  in  one  direction,  whereas  those  a  little 
farther  away  appear  to  move  in  the  opposite  direction,  in  two  star  streams 
(Kapteyn  1905;  Joy  1939). 

We  can  measure  the  radial,  or  line-of-sight,  component  of  a  star’s  motion  by 
observing  the  Doppler  shift  in  the  wavelength  of  its  spectral  lines.  Moreover,  radio 
astronomers  can  use  the  same  Doppler  effect  with  the  spectral  line  of  interstellar 
hydrogen  atoms,  emitted  at  a  wavelength  of  21  cm,  to  trace  out  the  motions  of 
interstellar  gas.  Both  techniques  indicate  that  the  Sun  and  nearby  gas  and  stars  are 
revolving  about  the  distant  center  of  the  Milky  Way  at  a  speed  of 
V©  «  220  km  s"1. 

The  period  P&  for  one  rotation  of  the  Sun  around  the  center  is  PQ  =  2tlR0/Vq 
«  7.6  x  1015  s  «  2.4  x  108  years,  where  1  year  =  3.1557  x  107  sec  and 
R0  =  Dq  =  8.5  kpc  =  2.6  x  1020  m  =  27,700  light-years. 

The  stars  are  moving  with  speeds  that  are  greater  than  those  of  the  planets  that 
orbit  them.  The  orbital  speed  of  the  Earth  around  the  Sun,  for  example,  is  about 
30  km  s'1  and  that  of  Mercury  is  just  48  km  s-1.  The  orbiting  planets  accompany 
their  planet  star  in  its  faster  motion  through  space. 

Stars  and  interstellar  gas  rotate  differentially,  revolving  about  the  galactic 
center  in  independent  orbits  at  speeds  that  vary  with  distance.  Stars  that  are  nearer 
to  the  center  of  the  Milky  Way  than  the  Sun  revolve  about  the  center  at  faster 
speeds  and  take  less  time  to  circle  it.  Astronomers  describe  this  differential  orbital 
motion  by  specifying  Oort’s  constants  (Focus  14.2). 
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Focus  14.2  Differential  rotation  of  the  Milky  Way 

The  Dutch  astronomer  Jan  Oort  (1900-1992)  provided  observational 
evidence  for  the  differential  rotation  of  stars  and  interstellar  matter  in  the 
Milky  Way,  and  described  their  circular  motion  about  a  distant  galactic 
center  (Oort  1927,  1928).  The  Doppler  shift  velocity  along  the  line-of-sight, 
or  the  observed  radial  velocity,  Vr,  of  a  galactic  object  at  the  distance,  R , 
from  the  galactic  center  is  given  by: 


Vr  =  Ro[co(R)  —  m(7?0)]sin  /, 


(14.3) 


where  R0  is  the  distance  of  the  Sun  from  the  galactic  center,  co(R)  is  the 
circular  velocity  of  the  Milky  Way  at  R ,  and  /  denotes  the  galactic  longitude 
(Fig.  14.4).  The  observed  proper  motion  //  of  the  celestial  object  in  galactic 
longitude  and  at  distance  D  less  than  R  or  R0  is: 


1 


4.74 


1  (dco\  1  fdco 

”  2Ro { dRj R=Rocos  ~2Ro V Jr 7  R=Ro 


—  oj(Ro) 


Vl 

D  ’ 
(14.4) 


where  the  velocity  Vj_  transverse  to  the  line  of  sight  is  in  units  of  km  s_1,  the 
distance  is  in  parsecs  and  the  proper  motion  is  in  seconds  of  arc  (Sect.  4.2). 
Oort’s  constants,  A  and  B ,  are  given  by: 
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=  —12.7  dz  0.64  km  s  1  kpc  1 , 


where  the  numerical  values  are  derived  from  HIP  PARC  OS  observations 
(Feast  and  Whitelock  1997),  which  also  give  the  distance  of  the  Sun  from  the 
galactic  center,  R0 ,  as 


Ro  =  8.5  ±  0.5  kpc  «  2.6  x  1020  m  ~  27,700  light-years,  (14.7) 
and  a  rotation  velocity  of: 


V0  =  R0{A  -  B)  «  234  km  s-1.  (14.8) 

Blauuw  et  al.  (1960)  reviewed  the  IAU  system  of  galactic  coordinates. 
In  1986,  the  International  Astronomical  Union  adopted  the  standard  values 
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of  R0  =  8.5  kpc  and  a  rotational  velocity  of  the  Sun  about  the  galactic 
center,  V0,  given  by 


V0  =R0(A  -B)  «220kms-1,  (14.9) 

(Kerr  and  Lynden-Bell  1986),  which  implies  A  —  B  =  25.9  km  s-1  kpc-1, 
not  quite  in  accord  with  the  HIPPARCOS  result  that  would  give 
Vo  =  231  km  s-1  at  R0  =  8.5  kpc. 

Oort’s  constants  can  be  used  to  determine: 

Vr  = -2A(R  -  R0)sin  l  forR-R0^:Ro  (14.10) 

and 


1 

4774 


B  +  Acos  2/]. 


(14.11) 


For  a  nearby  object  in  the  galactic  plane  at  a  distance  D  of  less  than  one 
kiloparsec  from  the  Sun,  the  radial  component  of  velocity  due  to  differential 
galactic  rotation  is 


Vr=  AD  sin2 /, 

and  its  transverse  velocity  due  to  differential  galactic  rotation  is: 

V±  =  D [Acos (2/)  +  B  . 


(14.12) 

(14.13) 


Sun 


Galactic  Center 


Fig.  14.4  Differential  rotation  of  the  Milky  Way  The  Sun  rotates  about  the  galactic  center, 
located  at  a  distance  Rq  of  about  8.5  kpc  or  27,700  light-years  and  a  speed  of  about  220  km  s-1. 
Another  star  that  lies  closer  to  the  center  revolves  about  it  at  a  distance  R  and  a  different,  faster 
speed.  This  diagram  provides  the  geometry  for  deriving  Oort’s  constants  A  =  15  km  s-1  kpc-1 
and  B  =  — 10  km  s-1  kpc-1  that  describe  this  differential  rotation  about  the  center  of  our  galaxy 
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We  can  use  Kepler’s  third  law  with  measurements  of  the  Sun’s  distance  from 
and  velocity  about  the  remote  center  of  the  Milky  Way  to  infer  the  mass  that 
gravitationally  controls  solar  motion.  That  is,  the  galactic  disk  within  the  orbit  of 
the  Sun  has  a  central  mass,  Mcdisk,  of: 


Mcdisk 
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1.9  x  1041  kg 


(14.14) 


_  11  Q  _  1  _ O 

where  the  gravitational  constant  G  =  6.674  x  10  m  kg  s  ,  Pq  «  7.6  x 
1015  s,  Vq  =  V0  «  220  km  s_1,  and  D0  =  R0  «  2.6  x  1020  m.  That  is  equiva¬ 
lent  to  about  100  billion  solar  masses,  or  1011  M0,  where  the  mass  of  the  Sun  is 

on 

Mq  =  1.989  x  10“  kg.  So  there  is  a  mass  equivalent  to  about  100  billion  stars 
like  the  Sun  within  the  Sun’s  galactic  orbit.  Fish  and  Tremaine  (1991)  have  pro¬ 
vided  a  review  of  the  mass  of  the  Galaxy. 

The  luminosity  of  our  galactic  disk,  LBdisk,  in  blue  light  is  about  25  billion  times 
that  of  the  Sun,  or  LBdisk  =  2.5  x  1010  LBQ ,  where  the  blue  luminosity  of  the  Sun 
is  Lbq  =  3.0  x  1026  J  s_1.  These  stars  shine  with  the  light  of  25  billion  Suns. 
Their  combined  absolute  magnitude  is  MBdisk  =  —20.5,  and  their  mass  to  lumi¬ 
nosity  ratio,  Mdisk/Ldish  «  4  M0/L0. 


14.1.4  Whirling  Coils  of  the  Milky  Way 

The  stars  do  not  reside  in  a  uniform  whirling  disk.  They  instead  are  concentrated  into 
arms  that  coil  out  from  the  center  of  the  Milky  Way,  giving  our  stellar  system  a  spiral 
shape.  These  features  are  delineated  by  relatively  young,  very  luminous,  and  mas¬ 
sive  stars  (Morgan  et  al.  1952;  Georgelin  and  Georgelin  1976;  Paladini  et  al.  2004), 
which  light  up  the  nearby  arms  (Fig.  14.5).  They  coincide  with  the  well-known 
emission  nebulae,  or  H II  regions,  which  are  less  than  a  few  million  years  old  and  at 
least  a  thousand  times  younger  than  the  oldest  stars  in  the  Milky  Way.  This  suggests 
that  recent  star  formation  takes  place  in  the  spiral  arms  of  the  Milky  Way. 

Because  the  Sun  is  embedded  in  one  of  the  arms,  astronomers  must  look 
through  that  arm  to  see  the  rest  of  the  Milky  Way.  This  obscures  their  distant 
vision,  hiding  most  of  our  stellar  system  from  view  in  optically  visible  light. 
However,  radio  waves  pass  unimpeded  through  the  obscuring  material,  permitting 
the  detection  of  most  of  the  Milky  Way.  This  is  because  long  radio  waves  are  not 
absorbed  by  the  relatively  small  particles  of  interstellar  dust. 

By  observing  the  radio  emission  of  interstellar  hydrogen  atoms  at  a  wavelength 
of  21  cm,  radio  astronomers  constructed  a  face-on  view  of  the  Milky  Way,  which 
we  might  see  if  we  were  transported  into  distant  space  and  looked  down  on  the 
plane  of  the  Milky  Way  from  above  (Fig.  14.6).  They  delineated  extensive,  arm¬ 
like  concentrations  that  extend  out  from  the  short  segments  defined  by  young 
massive  stars  in  the  vicinity  of  the  Sun  (Oort  et  al.  1958). 
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Fig.  14.5  Spiral  arms  of  the  Milky  Way  from  H  II  regions  Luminous  emission  nebulae, 
known  as  H  II  regions,  act  like  beacons  that  mark  out  the  spiral  structure  of  the  Milky  Way.  The 
H  II  regions  have  lifetimes  of  just  a  few  million  years,  which  is  thousands  of  times  less  than  the 
ages  of  the  oldest  stars  in  our  Milky  Way.  This  suggests  that  stars  are  now  formed  in  the  spiral 
arms  of  the  Milky  Way.  The  center  of  both  diagrams  coincides  with  the  center  of  the  Milky  Way, 
labeled  as  the  galactic  center  or  GC,  and  the  galactic  longitude  is  indicated  along  the  figure  edges 
with  180°  at  center  top  and  0°  at  center  bottom.  The  linear  scales,  shown  in  the  lower  right  of 
each  diagram,  are  set  at  5  kpc  «  16,000  light  years  {right)  and  4  kpc  %  13,000  light-years  {left). 
The  Sun  is  located  at  the  upper  center  of  both  diagrams,  and  H  II  regions  are  denoted  by  filled 
circles  and  squares  {right)  and  diamonds  and  triangles  for  550  objects  {left).  [Adapted  from 
{right)  Y.  M.  Georgelin  and  V.  P.  Georgelin,  “The  spiral  structure  of  our  Galaxy  determined  from 
H  II  regions,”  Astronomy  and  Astrophysics  49,  57-69  (1976)  and  {left)  R.  Paladini  R.  D.  Davies 
and  G.  DeZotti,  “Spatial  Distribution  of  Galactic  H  II  regions,”  Monthly  Notices  of  the  Royal 
Astronomical  Society  347,  237-245  (2004).] 

The  Sun  has  circled  the  center  of  the  Milky  Way  more  than  19  times  during  the 
Sun’s  4.6-billion-year  lifetime.  So,  the  spiral  arms  should  have  wrapped  around  the 
massive  center  many  times  during  the  lifetime  of  the  Sun.  A  persistent  dilemma 
has  been  why  they  haven’t  wound  up  forming  a  featureless  ball  of  gas,  dust  and 
stars.  The  explanation  seems  to  be  density  waves  that  control  the  concentrations  of 
stellar  and  interstellar  material  (Lin  and  Shu  1967). 

The  wave  pattern  orbits  the  galactic  center  at  a  steady  rate  and  does  not  wind 
up;  it  moves  independently  of  the  motions  of  individual  stars,  which  follow  their 
own  orbit  around  the  center.  The  spiral  arms  are  places  where  the  interstellar 
material  and  stars  linger  -  like  traffic  at  a  stoplight  -  and  they  mark  the  loca¬ 
tions  where  new  stars  tend  to  form  and  hot,  massive,  luminous,  young  stars  are 
found. 
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Fig.  14.6  Structure  of  our 
stellar  system  This  drawing 
depicts  our  Milky  Way  as 
viewed  from  above  its  plane. 
The  stars  and  interstellar 
material  are  concentrated 
within  spiral  arms.  The  Sun 
lies  within  one  of  these  spiral 
arms  at  a  distance  of  27,700 
light-years,  from  the  center, 
designated  here  as  8,500  pc, 
or  8.5  kpc.  This  distance  is 
1.75  billion  times  the  distance 
between  the  Earth  and  the 
Sun 
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14.1.5  A  Central  Super-Massive  Black  Hole 

Radio  astronomers  have  looked  right  through  interstellar  dust  and  detected  an 
exceptionally  powerful  and  compact  radio  source  at  the  center  of  the  Milky  Way. 
It  is  in  the  direction  of  the  constellation  Sagittarius,  and  therefore  been  named 
Sagittarius  A*  (abbreviated  Sgr  A*  and  pronounced  “Sag  A”  star). 

Radio  interferometer  measurements  with  very  long  baselines  (VLB  I)  show  that 
Sgr  A*  is  smaller  than  our  planetary  system,  with  a  radius  of  about  half  the 
distance  between  the  Earth  and  the  Sun,  which  is  1  AU  =  1.496  x  1011  m 
(Doeleman  et  al.  2008;  Reynolds  2008).  It  seems  likely  that  an  exceptionally 
massive  black  hole  is  energizing  the  extremely  bright,  compact  radio  source. 

Because  such  a  black  hole  is  very  massive,  dense,  and  compact  on  a  cosmic 
scale,  its  formidable  gravity  can  dominate  a  star’s  motion  if  it  is  close  enough.  The 
super-massive  black  hole  guides  nearby  stars  into  rapid  orbital  motion,  betraying 
its  presence.  These  stars  can  be  seen  at  infrared  wavelengths  that  also  penetrate 
interstellar  dust. 

Astronomers  have  used  large  visible-light  telescopes  in  Chile  and  Hawaii  to 
detect  individual  stars  in  the  infrared,  although  they  shine  too  faintly  at  radio 
wavelengths  to  be  observed  with  radio  telescopes.  By  watching  the  motions  of 
infrared  stars  that  are  near  the  center  of  the  Milky  Way  and  orbit  it,  a  central  super- 
massive  black  hole  has  been  found. 

Melia  and  Falcke  (2001)  reviewed  evidence,  available  at  the  time,  for  a  super- 
massive  black  hole  at  the  galactic  center,  whereas  Genzel  and  Townes  (1987) 
provided  an  earlier  review  of  the  center  of  our  Galaxy. 

After  an  unprecedented  study  lasting  more  than  a  decade,  researchers  were  able 
to  track  the  full  revolution  of  one  infrared  star,  designated  S2,  around  the  invisible 
black  hole.  This  star  moves  within  17  light-hours  of  the  unseen  center,  with  speeds 
of  up  to  5,000  km  s-1  (Ghez  et  al.  2005,  2008).  These  orbital  parameters  imply 
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that  the  mass  of  the  central  black  hole  is  a  colossal  4  million  times  the  mass  of  the 
Sun  -  that  is,  4  million  invisible  solar  masses  not  shining  but  rather  gravitationally 
confining  the  observed  stellar  orbit. 


Example:  Super-massive  black  hole  at  the  galactic  center 

After  more  than  a  decade  of  observations,  Andrea  Ghez  (1965-  )  and  her 
colleagues  reported  measurements  from  the  W.M.  Keck  10  m  telescopes 
describing  the  elliptical  orbit  of  the  star  SO-2  about  the  galactic  center  (Ghez 
et  al.  2005,  2008).  Its  orbit  passes  within  a  distance  of  D  =  17  light- 
hours  =  1.83  x  10  m  of  the  black  hole,  at  which  time  it  accelerates  to  a 

z  _ 1 

speed  of  V  =  5  x  10  m  s  .  If  we  assume  that  this  velocity  is  equal  to  the 
escape  velocity,  Vesc,  of  the  black  hole,  of  mass,  MBH ,  controlling  the  orbit  at 
that  time,  then  we  can  estimate  the  mass  from 

DV2 

Mbw  =  ^g£’  (14-15) 

_  1  1  Q  _  1  _ n 

where  the  gravitational  constant  G  =  6.674  x  10“  ■ 11  m  kg  s“  .  We 
obtain  MBH  «  3.4  x  1036  kg  «  1.7  x  106  M0  where  the  Sun’s  mass 
Mq  =  1.989  x  1030  kg. 

Detailed  comparison  with  the  stellar  orbit  indicate  that 
Mbh—  (4.1  db  0.6)  x  10  Mq.  Using  this  mass,  we  can  determine  the 
Schwarzschild  radius,  Rs ,  of  the  black  hole  from: 

Rs  =  2GAfg//  pa  1.21  x  1010m  »  0.08  AU,  (14.16) 

O  _  1 

where  the  speed  of  light  c  =  2.9979  x  10  ms  and  the  mean  distance 
between  the  Earth  and  the  Sun  is  1  AU  =  1 .496  x  10 1 1  m.  VLBI  observations 
at  1.3  mm  wavelength  indicate  that  the  central  radio  source  Sagittarius  A* 
subtends  an  angle,  6,  of  6  «  4  x  10  5  s  of  arc  2x10  10  radians,  where 
1  radian  =  2.063  x  105  s  of  arc  (Doeleman  et  al.  2008).  The  linear  radius 
R  corresponding  to  this  angular  size  is  R  =  6D  «  1.7  x  10-6 
pc  «  5  x  10lom  «  0.34  AU  at  the  D  &  8.5  kpc  distance  of  the  galactic 
center.  That  size  is  about  4  times  the  gravitational  radius  of  4  million  solar 
masses. 


Where  did  this  super-massive  black  hole  come  from  and  why  is  it  located  at  the 
center  of  the  Milky  Way?  Its  formation  probably  coincided  with  the  origin  of  our 
stellar  system,  by  the  collapse  of  a  huge  rotating  mass  with  a  nucleus  at  the  center 
and  the  flattened  Milky  Way  spinning  around  it.  The  globular  star  clusters  prob¬ 
ably  date  back  to  the  beginning  of  the  collapse,  about  14  billion  years  ago. 

The  super-massive  black  hole  may  have  originated  at  the  central  nucleus, 
perhaps  as  the  result  of  the  gravitational  collapse  of  an  exceptionally  massive 
cloud  of  gas  located  there.  Alternatively,  it  may  have  grown  by  the  coalescence  of 
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smaller  stellar  black  holes,  each  formed  at  the  end  of  the  lifetime  of  the  first 
massive  stars.  The  central  black  hole  would  have  continued  to  gather  in  nearby, 
smaller  black  holes  and  surrounding  stars  and  gas,  with  an  active  youth  and  a  more 
sedate  old  age. 


14.1.6  Dark  Matter  Envelops  the  Milky  Way 

As  the  result  of  differential  rotation,  stars  and  gas  near  the  Sun  should  revolve 
about  the  galactic  center  at  faster  speeds  than  those  at  greater  distances  from  the 
center.  However,  the  stars  and  gas  observed  near  the  apparent  edges  of  the  Milky 
Way  rotate  at  speeds  that  do  not  decrease  with  distance.  This  means  that  the  Milky 
Way  does  not  end  where  the  light  does,  and  that  there  are  appreciable  amounts  of 
dark  unseen  matter  well  outside  the  boundary  of  the  visible  Milky  Way.  That  dark 
invisible  matter  keeps  the  fast-spinning  visible  material  connected  to  our  stellar 
system. 

The  mass  of  the  Milky  Way  within  the  Sun’s  orbit  around  the  center  of  the 
Milky  Way  is  roughly  100  billion,  or  1011,  Suns.  However,  the  rapid  motions  of 
dwarf  satellite  collections  of  stars,  which  revolve  about  the  Milky  Way  at  distances 
of  up  to  1  million  light-years,  indicate  that  a  great  reservoir  of  unseen  matter 
envelops  the  observed  disk  of  stars.  A  total  mass  of  roughly  1  trillion,  or  10  , 
times  the  mass  of  the  Sun  and  about  10  times  the  mass  of  its  visible  stars,  is 
required  to  hold  onto  these  dwarf  systems.  This  invisible,  massive,  outer  region  is 
known  as  the  dark  halo.  It  surrounds  the  Milky  Way  and  outweighs  it  by  a  factor 
of  about  10. 

Example:  Mass  of  the  dark  halo 

We  can  infer  the  total  mass,  MG,  of  our  stellar  system  under  the  assumption 
that  distant,  small  companions  are  gravitationally  bound  to  it.  (Here  we  use 
the  subscript  G  to  denote  our  Galaxy,  which  is  a  term  that  is  introduced  later 
in  the  book.)  The  dwarf  spheroid  Leo  I  is,  for  example,  located  at  a  distance 
of  D  =  230  kpc,  where  1  kpc  =  3.0857  x  1019  m,  and  moves  with  a  radial 
velocity  Vr  —  111  km  s-1  =  1.77  x  105  m  s_1.  This  velocity  must  be  less 
than  or  equal  to  the  escape  velocity,  Vesc,  of  our  stellar  system  at  this 
distance,  so 


(14.17) 


or 


(14.18) 


14.1  The  Milky  Way 


487 


_ 11  o  _ i  _ o 

where  the  gravitational  constant  G  =  6.674  x  10  m  kg  s  .  Inserting 
the  distance  and  radial  velocity  of  Leo  I  into  this  equation,  we  obtain 
Mg  >  1.67  x  1042  kg  >  0.8  x  1012  M0  where  A/0  =  1.989  x  1030  kg. 
That  is  about  20  times  the  total  mass  of  stars  in  the  Milky  Way.  The  mass-to- 
luminosity  ratio  is  about  one  hundred  times  that  of  stars  like  the  Sun,  or 
Mg/Lg  «  100  Mq/Lq  ,  indicating  that  most  of  the  mass  is  dark  and  unseen 
(Zaritsky  et  al.  1989;  Kulessa  and  Lynden-Bell  1992;  Peebles  1995). 


Thus,  our  stellar  system  is  held  together  by  the  gravity  of  dark  matter ,  which  is 
beyond  the  range  of  our  vision.  A  similar  darkness  pervades  and  controls  much  of 
the  universe,  giving  off  neither  light  nor  any  other  radiation  to  let  us  know  it  is 
there.  Dark  matter  is  studied  by  its  gravitational  influence  on  the  motions  of  the 
stars  that  we  can  see. 

Moreover,  even  the  observable  universe,  the  part  we  can  see  directly,  is  not 
limited  to  our  stellar  system,  the  Milky  Way,  but  instead  is  populated  by  more  than 
100  billion  galaxies,  each  composed  of  about  100  billion  stars  and  perhaps  con¬ 
taining  10  times  as  much  mass  in  unseen  dark  matter.  These  galaxies  stretch  as  far 
as  the  largest  telescope  can  see  -  and  perhaps  beyond. 


14.2  The  Discovery  of  Galaxies 

Long  before  the  discovery  of  dark  matter,  Edwin  Hubble  (1889-1953),  showed 
that  the  Milky  Way  does  not  contain  everything  there  is,  and  settled  an  ongoing 
controversy  about  the  nature  of  spiral  nebulae.  The  issue  was  presented  during  the 
now-famous  Shapley-Curtis  debate  over  “The  Scale  of  the  Universe”  during  a 
meeting  of  the  National  Academy  of  Sciences  on  26  April  1920  at  the  Smithsonian 
Institution  in  Washington,  DC  (Shapley  and  Curtis  1921).  Harlow  Shapley 
(1885-1972),  of  the  Harvard  College  Observatory,  defended  his  novel  conception 
of  a  much  larger  Milky  Way  than  previously  had  been  supposed,  with  a  distant 
center  and  the  Sun  at  its  periphery,  but  he  supposed  that  the  spiral  nebulae  are 
embedded  in  the  Milky  Way.  In  contrast,  Heber  D.  Curtis  (1872-1942),  of  the  Lick 
Observatory,  attempted  to  defend  a  smaller  Sun-centered  stellar  system  but  pro¬ 
vided  cogent  arguments  that  the  spiral  nebulae  are  distant  stellar  systems  located 
far  beyond  the  Milky  Way  (Curtis  1919).  Shapley  was  correct  about  the  shape  and 
size  of  the  Milky  Way,  and  Curtis  was  correct  in  supposing  that  the  spiral  nebulae 
are  distant  “island  universes”  composed  of  numerous  stars. 

The  argument  over  the  location  of  the  spiral  nebulae  was  finally  and  definitely 
resolved  when  Hubble  used  the  2.5  m  (100  inch)  Hooker  telescope  on  Mount 
Wilson  to  photograph  the  spiral  nebula  Andromeda,  or  M  31  (Fig.  14.7),  night 
after  night.  He  compared  hundreds  of  photographs  to  find  Cepheid  variable  stars 
whose  brightness  waxed  and  waned  like  clockwork  in  a  period  of  several  days. 
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Fig.  14.7  The  Andromeda  Nebula  The  nearest  spiral  galaxy,  the  Andromeda  Nebula,  also 
known  as  M  3 1  and  NGC  224,  is  located  at  a  distance  of  about  800  kpc  or  2.6  million  light-years, 
so  its  light  takes  about  2.6  million  years  to  reach  us.  Both  the  Andromeda  Nebula  and  our  Galaxy 

i  o 

are  spiral  galaxies  with  total  masses  of  about  1  million  million,  or  10  ,  solar  masses,  and  roughly 
100  billion,  or  1011,  optically  visible  stars.  The  several  distinct  stars  surrounding  the  diffuse  light 
from  Andromeda  are  stars  within  our  own  Galaxy;  these  stars  lie  well  in  front  of  Andromeda. 
Two  smaller  galaxies  also  are  shown  in  this  image:  M  32,  also  designated  NGC  221,  at  the  edge 
of  the  Andromeda  Nebula,  and  NGC  205,  that  is  located  somewhat  farther  away.  These  are 
elliptical  systems  at  about  the  same  distance  as  M  31  but  with  only  about  1/ 100th  of  its  mass. 
(Courtesy  of  Karl-Schwarzschild  Observatorium,  Tautenburg.) 

On  New  Year’s  Day  1925,  Hubble’s  results  were  read  in  absentia  at  a  meeting  of 
the  American  Astronomical  Society  in  Washington,  DC  and  caused  an  over¬ 
whelming  sensation.  The  landmark  paper,  titled  Cepheids  in  Spiral  Nebulae 
(Hubble  1925),  combined  the  known  period-luminosity  relation  of  Cepheids  with 
observations  of  the  variable  stars  in  M  31  and  M  33,  another  spiral  nebula,  to 
derive  a  distance  of  0.275  Mpc  =  275  kpc  for  the  two  spiral  nebulae,  where 
1  Mpc  =  3,260,000  light-years  =  1,000  kpc  =  3.09857  x  1022  m.  Their  size  or 
linear  extent,  determined  from  this  distance  and  their  angular  extents,  was  roughly 
comparable  to  that  of  our  Milky  Way.  They  had  to  be  remote  objects  ablaze  with 
stars,  galaxies  in  their  own  right  and  separated  from  the  Milky  Way  by  wide  gulfs 
of  apparently  empty  space. 

It  took  so  long  to  establish  the  true  nature  of  the  spiral  nebulae  because  the 
method  of  establishing  the  distances  of  the  remote,  luminous  Cepheid  variable 
stars  needed  to  be  developed,  and  a  large  powerful  telescope  was  needed  to  detect 
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the  stars  in  spiral  nebulae  and  collect  enough  of  their  faint  starlight  for  a  reliable 
measurement  of  their  periodic  brightness  variation  and,  therefore,  distance. 

The  notable  Estonian  astronomer  Ernst  Opik  (1893-1985)  had  already  con¬ 
tributed  to  the  notion  that  spiral  nebulae  lie  outside  the  confines  of  the  Milky  Way 
(Opik  1922).  He  used  F.  G.  Pease’s  (1881-1938)  measurements  of  the  rotation 
velocities  of  M  3 1  to  show  that  its  distance  has  to  be  about  0.480  Mpc  if  its  mass  to 
luminosity  ratio  is  comparable  to  that  of  stars  in  our  the  Milky  Way.  Opik  was 
much  more  nearly  correct  than  Hubble  in  estimating  the  distance  to  Andromeda, 
whose  current  distance  estimate  is  about  0.788  Mpc.  As  Walter  Baade 
(1893-1960)  showed  nearly  three  decades  later,  the  period-luminosity  relation  that 
Hubble  used  had  been  incorrectly  calibrated. 


_  •  •  _ 

Example:  Opik’s  calculation  of  the  distance  to  M  31 

If  a  star  in  the  Andromeda  spiral  nebula  is  located  at  a  distance  R  from  the 

center  of  the  nebula,  with  a  rotational  velocity  Vrot ,  then 


r2  _  2 GM  _  4 GM  _  AGID 2  (M\ 

rot  ~  r  ~  eb  ~  eb  \l ) ’ 


(14.19) 


where  M  is  the  mass  of  the  galaxy  within  radial  distance  R,  the  angular 
diameter  of  the  nebula  is  6 ,  the  distance  of  the  Andromeda  nebula  from  the 
Earth  is  D,  so  6  =  2 R/D,  and  its  apparent  luminosity  is  l  =  L/D2  for  an 
absolute  luminosity  L.  Collecting  terms,  we  can  specify  the  distance  to 
Andromeda  by 


D  = 


OV 2 

u  v  rot 

\Gl 


(14.20) 


By  assuming  the  mass  to  luminosity  ratio  M/L  was  the  same  as  that  of  the 
Sun,  Opik  (1922)  was  able  to  derive  a  distance  of  D  =  0.480  Mpc  using  the 
angular  diameter  and  rotational  velocity  of  the  Andromeda  Nebula  measured 
by  F.  G.  Pease  (1918),  and  the  apparent  magnitude  of  Andromeda,  at 
m  =  3.44  to  infer  the  apparent  luminosity. 


Once  the  observed  periodic  variations  of  the  Cepheid  variable  stars  was  cor¬ 
rectly  calibrated,  the  distance  to  Andromeda  turned  out  to  be  2.54  million  light- 
years,  about  three  times  farther  away  than  Hubble  initially  supposed  -  but  his 
dramatic  conclusion  remained  unchanged.  Hubble  broke  through  the  stars,  and 
moved  the  outer  boundaries  of  the  universe  far  out  into  space,  enlarging  our 
horizons.  The  universe  was  no  longer  limited  to  the  objects  our  unaided  eyes 
perceive,  and  our  stellar  system  had  become  just  one  of  myriad  galaxies  located  far 
beyond  the  Milky  Way  -  which  became  our  Galaxy,  written  with  an  uppercase  G 
to  show  that  it  is  special.  All  of  the  other  galaxies  were  shown  to  be  extragalactic, 
or  outside  of  our  Galaxy. 
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Radius  (thousands  of  light-years) 


Fig.  14.8  Dark  matter  envelops  a  spiral  galaxy  The  rotation  velocity  of  the  spinning  spiral 
galaxy  NGC  3198  plotted  as  a  function  of  radius  from  its  center  {bottom).  The  observed  neutral 
hydrogen  21  cm  data  are  attributed  to  an  optically  luminous  disk,  containing  all  the  visible  stars, 
and  a  dark  halo  that  contributes  most  of  the  mass  at  distant  regions  from  the  center.  The  visible 
stars  and  surrounding  halo  of  dark  matter  are  illustrated  in  a  hypothetical  drawing  of  a  spiral 
galaxy  seen  from  above  {top).  The  fact  that  the  rotational  speed  of  the  cool  hydrogen  gas  remains 
high  even  at  the  largest  distances  indicates  that  the  outermost  gas  must  be  constrained  and  held  in 
by  the  gravitational  pull  of  dark  matter  far  outside  the  visible  part  of  the  galaxy.  [Adapted  from  T. 
S.  Van  Albada  et  al.  “Distribution  of  dark  matter  in  the  spiral  galaxy  NGC  3198,”  Astrophysical 
Journal  295,  305  (1985).] 


Hubble  called  the  spirals  extragalactic  nebulae,  but  they  are  not  large  gaseous 
nebulae.  Hubble  (1926,  1936)  has  described  our  early  knowledge  of  extra-galactic 
nebulae,  now  known  as  the  galaxies. 

The  spectral  characteristics  of  the  light  from  the  spirals  are  similar  to  that  of  our 
Sun,  indicating  stellar  temperatures  of  thousands  of  K.  If  a  spiral  were  filled  with  gas 
at  this  temperature  throughout  its  enormous  dimensions,  the  nebula  would  be  more 
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than  1  billion,  billion  times  more  luminous  than  the  Sun,  rather  than  the  much  smaller 
luminosity  of  Andromeda  at  about  100  billion  times  the  solar  luminosity.  This  means 
that  light  is  not  coming  from  the  entire  surface  of  a  spiral  nebula  but  instead  from 
individual  stars,  separated  by  vast  spaces  without  any  stars. 

Therefore,  we  now  have  dropped  the  nebula  designation  and  use  the  term  spiral 
galaxy  or  elliptical  galaxy ,  depending  on  their  shape.  Each  galaxy  contains  about 
100  billion  stars,  just  like  our  Galaxy.  The  designation  nebula  now  is  reserved  for 
cloudy,  gaseous  material  enveloping  bright  stars.  An  exception  is  the  famous 
Andromeda  Nebula,  the  closest  spiral  galaxy. 

Like  our  Milky  Way  Galaxy,  the  other  galaxies  contain  about  ten  times  more 
dark  matter  then  optically  visible  matter  (Fig.  14.8).  Sofue  and  Rubin  (2001)  have 
reviewed  rotation  curves  of  spiral  galaxies;  references  to  individual  papers  about 
dark  matter  that  envelops  galaxies  are  given  in  Lang  (1999). 

When  looking  up  at  the  night  sky,  we  see  only  stars  and  the  black  spaces 
between  them.  The  galaxies  are  out  there,  but  we  cannot  see  them  without  a 
telescope.  They  are  so  far  away  that  their  brightness  is  below  the  detection 
threshold  of  the  human  eye.  This  meant  that  the  universe  was  no  longer  limited  to 
the  things  our  unaided  eyes  can  focus  on,  and  our  stellar  system  became  just  one  of 
myriads  of  galaxies. 

Blanton  and  Moustakas  (2009)  have  summarized  our  knowledge  of  the  physical 
properties  and  environments  of  nearby  galaxies.  Faber  and  Gallagher  (1979) 
provided  an  earlier  review  of  the  masses  and  mass-to-light  ratios  of  galaxies,  while 
Binggeli  et  al.  (1988)  reviewed  their  luminosity  function.  Binney  (1982)  has 
reviewed  the  dynamics  of  elliptical  galaxies.  Mateo  (1998)  provided  a  review  of 
dwarf  galaxies  in  the  Local  Group  of  galaxies.  Van  Der  Kruit  and  Freeman  (2011) 
have  reviewed  galaxy  disks. 

Some  properties  of  these  galaxies  are  listed  in  Table  14.3. 

The  light  we  receive  from  the  most  distant  galaxies  was  emitted  before  the 
Earth  and  the  Sun  were  formed.  Even  more  fantastic,  they  are  all  in  flight,  rushing 
away  from  us  at  speeds  that  increase  with  their  distance.  Nearly  a  decade  before 
Hubble’s  determination  of  the  distance  of  Andromeda,  Vesto  M.  Slipher 
(1875-1969)  already  had  helped  us  move  beyond  the  stars  in  an  entirely  unsus¬ 
pected  way. 


14.3  The  Galaxies  are  Moving  Away  from  us 
and  from  Each  Other 

At  the  time  of  their  discovery  in  enormous  numbers,  most  astronomers  thought 
that  the  spiral  nebulae  were  nascent  planetary  systems,  not  galaxies.  The  bright 
center  was  supposed  to  be  a  newborn  star,  and  the  spiral  arms  surrounding  it  were 
thought  to  be  developing  planets,  whirling  and  rotating  around  the  central  star  just 
as  the  Earth  revolves  around  the  Sun. 
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Table  14.3  Physical  properties  of  galaxies21 

Rg  =  radius  of  a  spiral  or  elliptical  galaxy  «  33  light-years  to  326  light-years  =  10  kpc  to 
100  kpc  «  (3-30)  x  1020  m 

Ms  =  mass  of  a  spiral  galaxy  «  (KW-IO12)  M0  «  (2-20)  x  1041  kg 
Me  =  mass  of  elliptical  galaxy  %  (1012-1013)  MQ  %  (2-20)  x  1042  kg 
Ms/Lb  =  mass  to  light  ratio  of  spiral  galaxy  within  radius  R  ~  60  h  (R/0.1  Mpc)  «  42 
(R/0.1  Mpc) 

Me/Lb  =  mass  to  light  ratio  of  elliptical  galaxy  within  radius  R  «  200  h  (R/0.1  Mpc)  «  140 
(R/0.1  Mpc) 

o  _ o 

Lb  =  mean  galaxy  luminosity  density  in  blue  band  =  1.93  x  10  h  LBQ  Mpc 
«  3  x  1034  J  s_1  Mpc-3 

O 

Lg  =  mean  galaxy  luminosity  density  =  (2-3)  x  10  h 
L@  Mpc-3  «  (5-8)  x  1034  J  s_1  Mpc-3 

Q  1  Q  C  _  1 

Lx  =  x-ray  luminosity  of  spiral  or  elliptical  galaxy  =  10'  -10  Js 
MSgas  —  mass  of  cool  gas  of  hydrogen  atoms  and  molecules  in  spiral  galaxy  109-1010 
M0  «  (2-20)  x  1039  kg 

M Egas  =  mass  of  hot  gas  in  elliptical  galaxy  «  109-1010  M0  w  (2-20)  x  103°  kg 

Ng  =  average  volume  density  of  galaxies  =  5.52  x  1 0  2  h3  Mpc-  1.89  x  1(T2  Mpc-3 

_ O/T  Q  _ Q  _ ry-j  _ Q 

—  mass  density  of  galaxies  0.37  x  10  kg  m  "  ^ j  1.5  x  10  kg  m  (for  visible 

12 

stars  and  dark  matter  with  a  galaxy  mass  of  about  10  solar  masses.) 
a  For  Hubble  constant  H0  =  100  h  km  s-1  Mpc-1  «  75  km  s_1  Mpc-1  with  h  %  0.75.  The 

op 

mass  of  the  Sun  is  M0  =  1.989  x  10  kg  and  the  total  luminosity  of  the  Sun  is 

O/T  I 

Lq  =  3.828  x  10  J  s  and  its  luminosity  in  the  blue  region  of  the  spectrum  is 
Lbq  =  1.9  x  1026  J  s”1 


Percival  Lowell  (1855-1916),  a  wealthy  Bostonian,  had  built  an  observatory  in 
Tucson,  Arizona,  primarily  to  detect  canals  on  Mars  supposedly  built  by  parched, 
industrious  Martians.  Lowell  also  believed  that  the  spiral  nebulae  resemble  our 
solar  system  in  its  early  formative  stages;  he  therefore  instructed  his  staff 
astronomer,  Vesto  M.  Slipher  (1895-1916),  to  measure  their  rotations,  hoping  to 
gather  insight  about  our  own  planetary  system. 

When  using  the  0.6  m  (24  inch)  refractor  at  the  Lowell  Observatory  in  Flag¬ 
staff,  Arizona,  to  record  the  spectra  of  bright  spiral  nebulae  and  measure  their 
rotations,  he  found  that  they  almost  unanimously  are  moving  away  from  us  at  high 
velocities.  They  were  also  rotating,  and  a  few  were  approaching  -  but  at  modest 
speeds  in  comparison  to  the  outward  motion  of  most  spirals.  The  Andromeda 
Nebula,  for  example,  was  moving  toward  the  Earth  at  an  apparent  velocity  of 
300  km  s'1  (Slipher  1914).  However,  the  other  bright  spirals  were  moving  in  the 
opposite  direction,  usually  with  higher  velocities  of  up  to  1,100  km  s  \  much 
faster  than  any  star  in  the  Milky  Way  (Slipher  1917). 

By  1917,  Slipher  had  accumulated  spectra  of  25  spiral  nebulae,  using  the 
Doppler  effect  to  measure  their  radial  velocities,  and  he  showed  that  none  of  them 
are  at  rest  (Slipher  1917).  A11  but  three  were  rushing  away  from  us  and  from  each 
other,  dispersing,  moving  apart,  and  occupying  an  ever-increasing  volume. 

According  to  Slipher,  the  observed  motions  of  the  majority  of  spirals  indicated 
a  general  fleeing  from  the  Milky  Way  or  us.  It  certainly  was  difficult  to  believe  that 
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objects  with  such  enormous  speeds  could  long  remain  a  part  of  our  stellar  system. 
The  combined  gravitational  pull  of  the  entire  100  billion  stars  in  the  Milky  Way  is 
not  enough  to  retain  any  spiral  nebula  moving  at  speeds  in  excess  of  1,000  km  s_1. 


Example:  Escape  velocity  of  the  Milky  Way 

The  escape  velocity  Vesc  =  ( 2GM/R j1^2  of  the  stellar  Milky  Way  can  be 
determined  by  using  R  =  R0  =  8.5  kpc  «  2.6  x  10  m,  the  distance  of  the 
Sun  from  the  center  of  the  Milky  Way,  and  M  —  10 1 1 M0  where  the  Sun’ s  mass 

OA 

Mq  =  1.989  x  10  kg,  as  inferred  from  the  Sun’s  orbital  velocity 
around  the  center.  The  gravitational  constant  G  =  6.674  x 

_  11  O  _  1  _ A 

10  m  kg  s  .  The  inferred  escape  velocity  of  the  stellar  Milky  Way  is 
about  Vesc  «  3.2  x  105  m  s”1  «  320  km  s_1.  The  recession  velocities 
measured  by  Vesto  Slipher  for  spiral  nebulae  exceeded  this  escape  velocity  by 
up  to  four  times.  When  the  dark  matter  that  envelops  the  Milky  Way  is  taken 

1  A 

into  account,  the  total  mass  increases  to  M  ^  10  “  M0  and  the  escape  velocity 
to  about  1,000  km  s-1,  but  most  spiral  nebulae  were  soon  found  to  have 
recessional  velocities  exceeding  this  amount. 


By  1929,  Hubble  showed  that  the  measured  distances  of  spirals,  which  he  had 
established  using  the  superb  light-gathering  power  of  the  2.5  m  (100  inch)  Hooker 
telescope,  were  roughly  correlated  with  Slipher’ s  velocities  (Hubble  1929).  The 
comparison  indicated  that  the  farther  a  spiral  is  the  faster  it  is  moving  away  from  us. 
This  relationship  now  is  attributed  to  the  expanding  universe,  which  no  one  had 
anticipated  at  the  time  Slipher  made  his  measurements. 

In  his  publication  of  these  results,  titled  A  Relation  between  Distance  and 
Radial  Velocity  Among  Extra-Galactic  Nebulae ,  Hubble  drew  a  straight  line 
through  a  plot  of  the  observed  data.  However,  there  was  a  wide  dispersion  between 
the  plotted  points  and  only  a  mild  tendency  for  velocity  to  increase  with  distance 
(Fig.  14.9).  Nevertheless,  his  conclusion  subsequently  was  confirmed  by  more 
comprehensive  observations  of  a  much  greater  number  of  galaxies. 

The  discovery  of  the  expanding  universe  also  explained  why  the  night  sky  is 
dark,  resolving  Olbers’  paradox,  named  for  the  German  astronomer  Heinrich 
Wilhelm  Olbers  (1758-1840).  He  realized  that  the  night  sky  in  an  infinite,  uni¬ 
form,  non-expanding  universe  should  be  covered  with  stars  shining  as  bright  as  the 
Sun  (Jaki  1969).  The  expansion  of  the  universe  redshifts  the  most  intense  light  of 
distant  galaxies,  and  their  stars,  out  of  the  visible  part  of  the  spectrum;  therefore, 
we  do  not  see  them  and  the  paradox  is  resolved. 

The  connection  between  velocity  and  distance  is  known  now  as  the  Hubble  law, 
and  the  ratio  of  the  velocity  of  recession  of  any  galaxy  and  its  distance  from  us  is 
called  now  the  Hubble  constant,  a  fundamental  measure  of  the  universe.  It  is 
designated  by  the  symbol  H0 ,  in  which  the  H  is  for  Hubble  and  the  zero  subscript 
denotes  its  current  value. 
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Fig.  14.9  Discovery  diagram  of  the  expanding  universe  A  plot  of  the  distance  of  extragalactic 
nebulae,  or  galaxies,  versus  the  radial  velocity  at  which  each  galaxy  is  receding  from  the  Earth, 
published  in  1929  by  the  American  astronomer  Edwin  Hubble  (1889—1953).  The  linear 
relationship  between  the  distance  and  radial  velocity  indicates  that  the  universe  is  expanding. 
Vesto  M.  Slipher  (1875—1969)  determined  most  of  these  velocities  more  than  a  decade  before 
this  diagram  was  drawn.  Here,  the  velocity  is  in  units  of  kilometers  per  second,  abbreviated 
km  s-1,  and  the  distance  is  in  units  of  millions  of  parsecs,  or  Mpc,  where  1  Mpc  is  equivalent  to 
3.26  million  light-years.  Hubble  underestimated  the  distances  of  the  spiral  nebulae;  therefore,  the 
distance  scale  for  modern  versions  of  this  diagram  is  about  seven  times  larger.  The  filled  circles 
and  solid  line  represent  the  solution  for  individual  nebulae;  the  open  circles  and  dashed  line  are 
for  groups  of  them 


The  units  of  the  Hubble  constant  are  given  in  kilometers  per  second  per 
Megaparsec,  abbreviated  km  s-1  Mpc-1,  and  Hubble’s  initial  estimate  was  pegged 
at  530  in  these  units.  The  radial  velocity  is  in  units  of  km  s-1,  and  the  distance  is  in 

f\  99 

Megaparsecs,  abbreviated  Mpc,  in  which  1  Mpc  =10  pc  =  3.08568  x  10  m  is 
equivalent  to  3.26  million  light-years.  Galaxies  typically  are  separated  by  a  few 
Mpc,  or  about  10  million  light-years,  which  is  about  100  galaxy  diameters.  So,  the 
universe  is  largely  empty  space  relative  to  galaxies. 

According  to  the  now-famous  Hubble  law,  the  radial  velocity,  Vn  of  a  galaxy, 
as  measured  by  the  Doppler  effect,  is  given  by  the  linear  relation: 


Hubble’s  Law  =  cz  =  Hq  x  D,  (14.21) 

o  _ 1 

where  the  speed  of  light  c  =  2.9979  x  10  m  s  ,  and  z  is  the  redshift.  The 
Hubble  constant,  H0,  is  the  ratio  of  the  speed  with  which  distant  galaxies  are 
receding  from  us  to  their  distance,  D.  It  is  as  though  the  expanding  universe  started 
with  a  gigantic  explosion,  with  the  fastest-moving  parts  having  traversed  the 
greatest  distances.  The  redshift  is  an  observational  parameter  defined  by: 


z 


A observed  A emitted 

A emitted 


which  means  that 


1  +  z  = 


A 


observed 


A 


emitted 


(14.22) 


(14.23) 
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for  a  spectral  line  emitted  at  the  wavelength  A, emitted  and  observed  at  Aobserved.  For 
relatively  low  velocities 


Vr  =  cz  =  H0D  for  Vr  <C  c , 


and  for  velocities  comparable  to  the  speed  of  light, 


or  equivalently: 


c 


(z+l)2-l 

(z+  1)2+1 


for  Vr  «  c, 


1  +z  = 


(14.24) 


(14.25) 


(14.26) 


In  just  2  years,  Hubble  extended  the  velocity-distance  relationship  to  substan¬ 
tially  greater  distances,  with  the  help  of  Milton  Humason  (1891-1972),  who  made 
the  velocity  measurements  (Hubble  and  Humason  1931,  1934).  Because  of  the 
impossibility  of  measuring  distances  to  such  faint  objects,  Hubble  and  Humason 
simply  assumed  that  all  galaxies  have  the  same  intrinsic  luminosity.  They  inferred 
a  velocity-distance  relationship  by  comparing  the  observed  velocities  of  the  gal¬ 
axies  to  their  apparent  brightness.  Thus,  they  reformulated  Hubble’s  law  and 
showed  that  a  linear  relationship  between  distance  and  recession  velocity  is  valid, 
within  the  observational  uncertainties,  to  distances  as  far  as  100  Mpc  and  radial 
velocities  of  nearly  20,000  km  s-1. 

Since  distances  can  only  be  measured  for  relatively  nearby  extragalactic 
objects,  Hubble’s  law  for  remote  objects  is  sometimes  expressed  by  the  redshift- 
magnitude  relation  in  which  the  apparent  magnitude,  m,  is  given  by: 


m  =  51ogfc) +M  +  25,  (14.27) 

where  M  is  the  absolute  magnitude,  the  Hubble  constant  H0  is  given  in  units  of 
km  s'1  Mpc-1  and  the  factor  of  +  25  arises  because  the  distance  unit  is  in 
Megaparsecs,  abbreviated  Mpc. 

Humason  then  teamed  up  with  Nicholas  May  all  (1906-1993)  at  the  Lick 
Observatory  in  an  ambitious  25  year  project  of  painstakingly  measuring  the 
Doppler-effect  redshifts  of  nearly  a  thousand  galaxies  visible  from  the  northern 
hemisphere.  Mayall  used  the  venerable  0.9  m  (36  inch)  Crossley  reflector  on 
Mount  Hamilton  to  observe  the  brighter  galaxies,  while  Humason  observed  the 
fainter  ones  using  the  2.5  m  (100  inch)  Hooker  telescope  on  Mount  Wilson. 

Although  Hubble  had  initiated  the  project,  he  died  before  the  work  was  fin¬ 
ished,  and  the  analysis  was  left  in  the  hands  of  his  young  protege  Allan  Sandage 
(1926-2010).  He  obtained  a  value  for  the  Hubble  constant  of  H0  = 
180  km  s'1  Mpc-1,  or  about  one  third  the  value  previous  found  by  Hubble,  whose 
distance  scale  was  in  error.  His  mistake  was  not  discovered  until  the  early  1950s 
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when  red-sensitive  photographic  plates,  developed  for  military  reconnaissance  in 
World  War  II  (1939-1945),  became  routinely  available.  Sandage  then  used  them 
to  discover  that  the  brightest  stars,  which  Hubble  used  to  infer  distances,  are  much 
more  luminous  emission  nebulae. 

The  resulting  redshift-apparent  brightness  diagram  of  474  extra-galactic  neb¬ 
ulae,  as  they  preferred  to  call  them,  had  a  large  scatter.  But  a  straight  line  could  be 
drawn  through  it,  out  to  a  radial  velocity  of  100,000  km  s-1,  or  one-third  the  speed 
of  light  (Humason  et  al.  1956).  So  Hubble’s  law  connecting  radial  velocity  and 
distance  still  held  in  every  direction  as  far  as  one  could  see.  This  meant  that  the 
entire  universe  is  expanding  swiftly  and  evenly  in  all  directions,  with  the  fastest- 
moving  parts  having  traversed  the  greatest  distance. 

The  Hubble  constant,  H0 ,  quantifies  the  current  rate  of  expansion  of  the  uni¬ 
verse,  and  it  has  often  been  quantified  in  the  form: 

Hq  =  100/zkms-1  Mpc-1.  (14.28) 

There  is  an  ongoing  controversy  about  the  exact  value  of  this  important  con¬ 
stant,  with  estimates  ranging  between  50  and  100  km  s-1  Mpc-1,  and  current 
observational  constraints  of  h  lying  between  0.50  and  0.85,  with  a  favored  value  of 
about  0.75  (Fig.  14.10).  The  systematic  uncertainties  in  H0  have  decreased  as  a 
result  of  observations  with  the  Hubble  Space  Telescope  and  the  Spitzer  Space 
Telescope  with  recent  determinations  of  H0  =  73.8  zb  2.4  km  s  1  Mpc  1  (Reiss 
et  al.  2011)  and  H0  =  74.3  db  2.1  km  s-1  Mpc-1  (Freedman  et  al.  2012).  Lang 
(1999)  and  Freedman  and  Madore  (2010)  have  reviewed  determinations  of  the 
Hubble  constant,  and  Feast  and  Walker  (1987)  have  discussed  Cepheids  as  dis¬ 
tance  indicators. 

The  Hubble  constant  sets  the  physical  scale  of  the  universe,  with  the  distance  to 
any  galaxy  given  by: 


D  =  0  =  2997.90  Mpc.  (14.29) 

The  most  distant  objects  exhibit  larger  redshifts,  denoted  by  the  lowercase 
letter  z.  The  largest  observed  redshift  corresponds  to  the  greatest  distance  and 
looks  the  farthest  back  in  time.  Astronomers  currently  are  detecting  galaxies  out  to 
a  redshift  as  great  as  z  =  8.6,  the  radiation  of  which  was  emitted  about  13.1  billion 
years  ago  (Lehnert  et  al.  2010). 

Once  the  distance  to  a  galaxy  is  determined  from  its  redshift  and  Hubble’s 
constant,  one  can  infer  the  galaxy’s  luminosity,  L,  from  it  apparent  brightness  or 
equivalently  its  absolute  magnitude,  M,  from  its  apparent  magnitude  m.  When  this 
is  done,  a  wide  range  of  luminosities  is  determined.  The  number  of  galaxies  at 
different  luminosities  peaks  at  a  blue  luminosity,  L*,  given  by: 

L*  «  10 l0LBQh~2  «  2  x  1O'%0,  (14.30) 

where  h  «  0.75  and  the  luminosity  of  the  Sun  in  the  blue  spectral  region  is 

oz:  _ i 

Lbq  =  1.9  x  10“  Js  corresponding  to  an  absolute  blue  magnitude  for  the  Sun 
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Fig.  14.10  Hubble  diagram  for  Cepheid  variable  stars  This  plot  of  galaxy  distance  versus 
recession  velocity  is  analogous  to  that  obtained  by  Edwin  Hubble  in  his  1929  discovery  of  the 
expansion  of  the  universe  (see  Fig.  14.9).  The  slope  of  the  linear  fit  {solid  line )  to  the  data  {dots) 
measures  the  expansion  rate  of  the  universe,  a  quantity  called  the  Hubble  constant,  designated  H0. 
The  data  shown  here  summarize  1 1  years  of  effort  to  measure  this  constant  by  using  the  Hubble 
Space  Telescope  to  measure  the  distances  and  velocities  of  Cepheid  variable  stars  in  nearby 
galaxies.  The  distance  is  in  units  of  1  million  parsecs,  or  Mpc,  where  1  Mpc  is  equivalent  to  3.26 
million  light-years;  the  radial  velocity  is  given  in  units  of  kilometers  per  second,  denoted  as 
km  s'1.  The  fit  to  these  data  indicate  that  H0  =  75  ±  10  km  s-1  Mpc-1  and  that  this  constant 
lies  well  within  the  limits  of  50  and  100  in  the  same  units  {dashed  lines).  [Adapted  from  Wendy 
L.  Freedman  et  al.  “Final  results  from  the  Hubble  Space  Telescope  Key  Project  to  measure  the 
Hubble  constant,”  Astrophysical  Journal  553,  47-72  (2001).] 

of  Mbq  =  5.48.  At  this  luminosity,  each  galaxy  contains  at  least  20  billion  stars 
like  the  Sun. 

But  galaxies  can  be  more  or  less  luminous  than  L*,  and  the  range  in  luminosity 
is  described  by  a  luminosity  function  <p{L)  that  specifies  the  number  of  galaxies 
with  luminosities  between  L  and  L  +  dL.  It  is  also  called  the  Schechter  luminosity 
function,  and  it  given  by  (Schechter  1976): 

=  *.(£)«  p  (-£)£• 

where  <fi*  is  a  normalization  constant  for  L  =  L*  and  given  by: 

0*  ~  0.02  h3  Mpc-3  ~  0.0084  Mpc-3 

for  the  total  galaxy  population,  a  is  the  slope  of  the  luminosity  function  at  low 
luminosity  L  <  L*  where  cj)(L)  oc  La  and  a  =  —0.8  to  —1.3.  At  high  luminosity 
L  >  L*  there  is  an  exponential  cutoff  with  <p{L)  oc  exp  (— L).  In  other  words,  the 
number  of  galaxies  falls  off  on  the  low  side  of  L*  and  on  the  high  side  of  it,  in  ways 
described  by  the  two  terms  in  the  luminosity  distribution. 


(14.31) 


(14.32) 
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The  wide  range  in  galaxy  luminosity  corresponds  to  a  wide  range  in  absolute 
blue  magnitude  MB  with  MB  =  —7.5  to  —22.5,  and  a  peak  value  MB*  given  by: 

Mb*  «  -19.7  +  5  log  (h)  «  -20.3,  (14.33) 

where  Hubble’s  constant  H0  =  100  h  km  s-1  Mpc-1  and  h  «  0.75. 

The  space,  or  volume,  density  of  galaxies,  Ng ,  their  blue  luminosity  density, 
LBg ,  and  their  total  number  out  to  redshift  z  can  be  determined  by  integrating 
the  observed  luminosity  distribution  function  obtained  from  redshift  surveys 
(Focus  14.3).  The  results  indicate  that: 


Ng  «  0.02  Mpc  3 

(14.34) 

LBg  «  3  x  1034  J  s-1  Mpc-3 

(14.35) 

Nt  «  6  x  109z3. 

(14.36) 

Altogether,  there  are  least  10  billion  galaxies  in  the  volume  of  space  that 
modern  telescopes  can  detect  and  there  is  no  end  in  sight. 


Focus  14.3  Density  and  total  number  of  galaxies 

The  space  density  of  galaxies,  Ng,  or  their  density  per  unit  volume,  can  be 
determined  by  adding  up,  or  integrating,  the  contribution  to  the  luminosity 
function  <p(L)  at  different  luminosities,  L.  Expressed  mathematically: 


oo 


0 


4>(L)dL  pa  0*r(a  +  1) 


(14.37) 


Q  _  1  _ O 

where  the  normalization  constant  0*  «  0.02  /r  Mpc-  «  0.0084  Mpc-  at 
L*  =  1010  Lb  h~2  %  2  x  1010  Lb_  ,  r  is  a  gamma  function,  and  a  is  low- 
luminosity  slope  of  the  luminosity  function. 

The  blue  luminosity  density,  LBg ,  can  similarly  be  defined  by: 


oo 


0 


L(j)(L)dl  ~  (j)*L*T( a  +  2). 


(14.38) 


Redshift  surveys  can  be  used  to  determine  these  parameters  for  field 
galaxies  that  lie  beyond  the  local  concentration  of  galaxies,  known  as  the 
Local  Group,  with  the  results  (Loveday  et  al.  1992): 

Ng  =  0.0552 h3  Mpc"3  «0.019Mpc"3  (14.39) 
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Lg  x  2  x  10 %hhBQ  Mpc-3  «  3  x  1034  Js_1  Mpc-3,  (14.40) 

where  Hubble’s  constant  Ho  =  100  h  km  s-1  Mpc-1  and  h  x  0.75. 

1  9 

With  a  total  galaxy  mass  of  10  “  M0,  in  both  visible  and  unseen  dark 
matter,  this  corresponds  to  a  galaxy  mass  density  pg  of: 

pg  =  10 i2NgMe  X  10“27kg  m-3.  (14.41) 

From  the  Hubble  law,  the  distance  of  a  galaxy  at  redshift,  z,  is 
D  =  cz/Hq ,  where  the  speed  of  light  c  =  2.9919  x  108  m  s_1.  So  the  total 
number  of  galaxies,  NT ,  out  to  redshift  z  is: 

Nt  =  Y  (jf)  Ngx6x  lOV,  (14.42) 

which  is  independent  of  h  and  the  exact  value  of  the  Hubble  constant. 

Since  galaxies  have  been  observed  out  to  redshifts  greater  than  one,  there 
are  at  least  10  billion,  or  1010,  galaxies  in  the  observable  universe.  The  exact 
shape  and  form  of  the  universe  complicates  the  precise  calculations,  since 
the  curvature  of  space  changes  the  distances  at  large  redshifts.  Still,  our 
estimate  should  be  correct  to  within  an  order  of  magnitude,  or  a  factor  of  ten. 


As  Hubble  realized,  astronomers  see  only  as  far  as  their  telescopes  permit, 
eventually  reaching  a  limit  -  a  dim  boundary  to  the  observable  universe  where 
they  measure  the  shadows.  Even  now,  there  is  no  telescope  powerful  enough  to 
detect  the  edge  where  the  galaxies  might  end.  There  is  no  edge  to  the  observable 
universe  and  it  has  no  detectable  center. 


Example:  Can  visible  or  invisible  matter  stop  the  expansion  of  the 
universe? 

Throughout  most  of  the  past  decades,  it  has  been  assumed  that  it  is  the  mass  of 
the  universe  that  curves  its  shape,  establishes  its  geometry,  and  determines  its 
fate.  Under  this  assumption,  which  ignores  the  more  recent  discovery  of  dark 
energy,  the  mass  density  of  galaxies,  pg ,  determines  the  ultimate  destiny  of  the 
universe.  If  this  mass  density  exceeds  a  certain  critical  value,  pc ,  then  gravity 
eventually  overcomes  expansion.  Imagine  the  most  distant  galaxy  with 
mass,  mG ,  distance,  DG,  and  velocity,  VG  =  H0DG.  Gravity  will  just  balance 
the  expansion  of  this  galaxy  if  its  kinetic  energy  of  expansion  is  equal  to  the 
gravitational  potential  energy  of  all  of  the  rest  of  the  universe,  or  if: 

rngVl  _  _  GmqMu 
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where  Mv  is  the  total  mass  of  all  the  rest  of  the  universe  inside  distance  DG 
and  the  gravitational  constant  G  =  6.614  x  10  m  kg  s  .  In  other 
words,  the  velocity,  VG,  of  the  most  distant  galaxy  is  just  equal  to  the  escape 
velocity,  Vesc,  of  the  entire  universe,  Vesc  =  (2 GMjj/Dg)1^2. 

Collecting  terms  we  obtain  a  critical  mass  density  of: 

Pc  =  — — \  =  — — -  ~  1.88  x  10  26  h 2  m3  ~  1.06  x  10  26  kgm-3, 
4nD3G  8ttG  5 

(14.44) 

where  H0  =  100  h  km  s_1  Mpc-1  =  3.24  x  10-18  h  s-1  «  2.43  xlO-18  s_1 
for  h  =  0.75. 

O  _ Q 

The  space  density  of  galaxies  is  Ng  =  0.055  h  Mpc  (see  Focus  14.3), 
where  1  Mpc  =  3.0857  x  10^~  m.  Assuming  that  each  galaxy  has  a  mass  in 
both  visible  and  unseen  dark  matter  of  MG  =  1012  M0  =  1.989  x  1042  kg 
and  h  =  //0/100  =  0.75,  then  the  mass  density  of  visible  galaxies, 

_ 0^7  _ Q 

pG  =  NgMG  k,  1.57  x  10  kg  m  .  This  is  about  a  factor  of  10  less  than 
the  critical  mass  density  needed  to  stop  the  expansion  of  the  universe,  even 
when  we  have  assumed  there  is  ten  times  more  dark  matter  than  visible 
matter  in  galaxies. 


14.4  Galaxies  Gather  and  Stream  Together 
14.4.1  Clusters  of  Galaxies 

Astronomers  have  been  mapping  the  distribution  of  galaxies  for  about  a  century, 
determining  the  shape  and  form  of  the  larger  universe.  The  first  cosmic  maps  were 
two-dimensional,  constructed  from  catalogues  giving  the  celestial  positions  of  the 
brightest  nebulae.  Although  a  foreground  and  background  galaxy  might  sometimes 
coincide,  the  concentrations  were  too  pronounced  to  be  solely  due  to  such  a 
superposition  (Charlier  1922). 

The  galaxies  are  not  placed  randomly  throughout  expanding  space.  They  are  not 
uniformly  strewn  here  and  there  or  isolated  from  one  another  but  instead  knot 
together  in  great  clusters  that  are  millions  of  light-years  across  (Fig.  14.11).  They 
also  contain  large  additional  quantities  of  unseen  matter.  A  rich  cluster  of  galaxies 
typically  spans  10  million  light-years  to  20  million  light-years  and  contains  hun¬ 
dreds  and  even  thousands  of  individual  galaxies.  They  move  within  the  cluster  at 
velocities  of  about  1,000  km  s  \  on  average,  and  the  amount  of  time,  Tc,  required 
for  a  galaxy  to  cross  the  cluster  moving  at  this  speed  is  about  a  two  billion  years. 
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Fig.  14.11  Cluster  of  galaxies  The  Hubble  Space  Telescope  provided  this  dramatic  view  of  the 
center  of  a  massive  cluster  of  galaxies  known  as  Abell  1689,  located  2.2  billion  light-years  away. 
The  gravity  of  the  cluster’s  million  million,  or  trillion,  stars,  plus  any  unseen  matter,  acts  as  a 
gravitational  lens  in  space,  bending  and  magnifying  the  light  of  galaxies  located  far  behind  it  into 
radiant  arcs.  (Courtesy  of  NASA,  the  ACS  Science  team  of  the  HST,  STScI,  and  ESA.) 

In  1958,  George  Abell  (1927-1983),  then  a  graduate  student  at  Caltech,  was 
able  to  describe  nearly  2,712  rich  clusters  of  galaxies  (Table  14.4),  using  photo¬ 
graphic  surveys  that  included  both  faint  and  bright  galaxies  (Abell  1958).  Even 
today,  these  dense  concentrations  of  galaxies  are  referred  to  simply  as  “Abell 
clusters”  -  designated  by  the  word  “Abell”  or  the  letter  “A”  followed  by  the 
number  in  his  catalogue. 

In  addition,  as  Abell  noticed,  the  clusters  gather  and  congregate  together  into 
larger  superclusters,  which  he  called  second-order  clustering.  Our  Galaxy  lies  in 
the  outskirts  of  one,  known  as  the  Local  Supercluster  (de  Vaucouleurs  1953), 
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Table  14.4  Physical  properties  of  rich  clusters  of  galaxies21 

NT  =  total  number  of  galaxies  in  a  rich  cluster  of  galaxies  =  30-300 

RcI  =  central  radius  of  galaxy  cluster  «  (1-2)  h~l  Mpc  «  (4.4-8. 8)  x  1022  m 

_ Q 

Nci  =  volume  density  of  galaxies  in  galaxy  cluster  «  100  Mpc 

<jv  =  velocity  dispersion  of  galaxy  motions  «  100-1,400  km  s-1  =  105— 1.4  x  106  m  s-1 
Mc  =  virial  mass  of  galaxy  cluster  =  cr2  Rci/G  «  (1014-2  x  1015)  h~l 
Mq  %  (2.8-57)  x  1044  kg 

Tc  =  cluster  crossing  time  &  2  Rci/ov  %  6  x  1016  s  «  2  x  109  year 

ii  i  p  _ o 

Lb  =  luminosity  of  galaxy  cluster  in  blue  band  =  (6  x  1011  to  6  x  io1")  hr * 

Lq  %  (4.6-46)  X  1038  J  s-1 

Me/ Lb  —  mass  to  light  ratio  of  galaxy  cluster  ~  300  h  Mq/Lq  %  210  Mq/Lq 

Lx  =  x-ray  luminosity  of  galaxy  cluster  =  (1035  5-1038)  h~2  J  s-1  «  2.0  (1035  5— 1038)  J  s-1 

nci  =  cluster  number  density  «  (10-5-10-6)  h3  Mpc-3  «  0.34  (10-5-10-6)  Mpc-3 

a  For  Hubble  constant  H0  =  100  h  km  s-1  Mpc-1.  The  Sun’s  mass  M0  =  1.989  x  1030  kg 
and  the  Sun’s  luminosity  L0  =  3.828  x  1026  J  s-1 

which  is  oriented  perpendicular  to  the  Milky  Way  and  extends  all  the  way  to  the 
rich  Virgo  cluster  of  galaxies. 

Bahcall  (1977)  has  reviewed  clusters  of  galaxies,  and  Bahcall  (1988,  1993)  has 
reviewed  the  large-scale  structure  in  the  universe  indicated  by  galaxy  clusters. 
Oort  (1983)  reviewed  superclusters  of  galaxies,  and  Rood  (1981,  1988)  has  dis¬ 
cussed  clusters  of  galaxies  and  voids. 


14.4.2  Dark  Matter  in  Clusters  of  Galaxies 

Clusters  of  galaxies  are  bound  together  by  gravity,  even  though  the  expansion  of 
the  universe  is  pulling  the  galaxies  away  from  one  another.  We  could  think  that  the 
combined  gravitational  pull  of  the  numerous  galaxies  might  be  sufficient  to  hold 
them  together  but,  in  1937,  Fritz  Zwicky  (1898-1974)  showed  that  there  must  be 
substantial  amounts  of  unseen  material  that  is  keeping  the  clusters  of  galaxies  from 
dispersing  (Zwicky  1937).  In  his  extraordinarily  prescient  paper,  titled  On  the 
Masses  of  Nebulae  and  of  Clusters  of  Nebulae ,  he  concluded  that  there  must  be 
noticeable  quantities  of  invisible  intergalactic  matter  in  clusters  of  galaxies  or  else 
they  would  be  unstable  dynamically.  In  a  German  language  article  discussing 
many  of  the  same  topics  years  earlier,  Zwicky  introduced  the  term  dunkle  materie, 
or  “dark  matter”  for  the  invisible  stuff,  and  he  concluded  that  it  might  be  present 
with  a  greater  density  than  luminous  matter  (Van  den  Bergh  1999). 

Zwicky  measured  the  amount  of  mass  required  to  keep  the  Coma  cluster  of 
galaxies  stable,  assuming  that  the  motions  of  its  constituent  visible  galaxies  are 
balanced  by  the  gravitational  pull  of  their  combined  mass.  He  found  that  the  total 
mass  of  the  Coma  cluster  must  be  about  10  times  the  sum  of  the  masses  of  the 
individual  galaxies  it  contains.  That  is,  he  inferred  the  total  cluster  mass,  Mc,  from 
the  velocity  dispersion,  ov ,  of  the  galaxy  motions,  above  that  due  to  the  expansion 
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of  the  universe.  For  a  cluster  of  radius,  Rc ,  and  angular  diameter  6 ,  the  binding 
mass  required  to  hold  the  cluster  together  is: 


*c4  =  0cz4 

G  2  GH0 


(14.45) 


_  11  Q  _  1  _ O 

where  the  gravitational  constant  G  =  6.674xl0  mkg  s  ,  the  speed  of 

o  _ 1 

light  c  =  2.9979  x  10  m  s  ,  the  mean  redshift  of  the  cluster  due  to  the 
expansion  of  the  universe  is  z,  and  the  Hubble  constant  H0  =  100  h  km  s-1 
Mpc-1  with  h  «  0.75.  Zwicky  concluded  that  this  binding  mass  Mc  is  about  20 
times  the  mass  of  the  visible  galaxies  in  the  Coma  cluster,  and  that  if  this  dark 
matter  were  not  present  the  Coma  cluster  would  be  flying  apart. 


Example:  Binding  mass  of  the  Coma  cluster  of  galaxies 

The  Coma  cluster  of  galaxies  has  a  redshift  z  =  0.023 1 ,  which  corresponds  to  a 
distance  of  D  =  cz/H0  «  92  Mpc,  using  the  speed  of  light 
c  =  2.9979  x  105  km  s-1  and  the  Hubble  constant  H0  =  75  km  s-1  Mpc-1. 
The  galaxies  within  this  cluster  have  a  velocity  dispersion,  ov ,  of 

_ i  r  _ i 

gv  ~  1,000  km  s  =  10  m  s  .  There  are  about  800  identified  galaxies 
within  an  area  of  1 00'  x  1 00'  centered  on  this  cluster,  where '  denotes  a  minute 
of  arc.  For  an  angular  diameter  0  =  100'  =  6,000"  =  0.029  radians,  where 
1  radian  =  2.06265  x  105  "  and  "  denotes  a  second  of  arc,  the  cluster 
radius  Rc  is  Rc  =  OD  ~  2.7  Mpc,  and  the  binding  mass  required  to  hold  the 
cluster  together  is  Mc  =  RCGy/G  «  1045  kg  «  5  x  10 14  M0  where  1 
Mpc  =  3.0857  x  1022  m,  the  gravitational  constant  G  =  6.674  x 
10-11  m3  kg-1  s-2,  and  the  Sun’s  mass  M0  =  1.989  x  1030  kg.  If  each 
galaxy  contains  about  1011  visible  stars  like  the  Sun,  then  the  total 
visible  stellar  mass  of  the  800  galaxies  in  this  part  of  the  cluster  is  about 
0.8  x  1014  M0.  This  means  that  roughly  4.2  x  1014  or  about  85  %  of  the  total 
mass  of  the  cluster  has  to  be  in  unseen  dark  matter  to  bind  the  Coma  cluster  of 
galaxies  together. 


Hot,  x-ray  emitting  gas  has  also  been  found  permeating  the  space  between 
galaxies  in  massive  clusters.  Any  hydrogen  atoms  immersed  in  these  clusters  and 
moving  at  a  similar  speed  to  the  galaxies  would  have  to  be  very  hot,  with  a 
temperature  of  about  100  million  K.  At  this  temperature,  the  gas  is  an  intense 
emitter  of  x-rays. 


Example:  How  hot  is  the  intergalactic  gas  in  a  cluster  of  galaxies? 

The  thermal  velocity,  Vthermah  of  a  hydrogen  atom  of  mass  mH  at  temperature  T 
is  (Sect.  5.2),Vthermal  =  (3  kT /  m  z/)1^2,  where  the  mass  of  the  hydrogen  atoms  is 
mH  =  1.673  x  10  kg  and  the  Boltzmann  constant  k  =  1.38065  x 

_ o  o  _ 1 

10  ~  J  K  .If  the  thermal  velocity  is  comparable  to  the  galaxy  velocity 
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_ i  /:  _ i 

dispersion  av  =  1,000  km  s  =  10  m  s  in  the  Coma  cluster  of  galaxies, 
then  the  temperature  T  =  mH°v/(?k)  «  4.04  x  107  K.  Such  a  hot  gas  emits 
intense  x-rays.  To  put  it  another  way,  the  hot  gas  with  a  temperature  of  about 

o 

10  K  that  is  observed  through  the  x-ray  emission  of  clusters  of  galaxies 
indicates  that  their  intergalactic  hydrogen  atoms  are  moving  at  a  velocity 
comparable  to  the  cluster  galaxies. 


Although  most  of  the  observable  mass  of  the  clusters  of  galaxies  is  in  the  form 
of  hot,  x-ray  emitting  gas,  outweighing  all  the  visible-light  stars  within  all  the 
galaxies  by  a  factor  of  about  seven,  both  the  stars  and  the  intergalactic  x-ray 
emitting  gas  constitute  only  about  15  %  of  the  total  mass  of  the  galaxy  clusters. 
The  remaining  85  %  of  the  gravitating  material  is  some  kind  of  mysterious  dark 
matter,  emitting  no  detectable  radio,  visible-light  or  x-ray  radiation,  and  no  one 
knows  what  it  is. 

In  his  1937  paper,  Zwicky  also  proposed  that  the  formidable  gravity  of  dark 
matter  in  clusters  of  galaxies  would  act  as  a  powerful  lens,  diverting,  and  focusing 
the  light  of  more  distant  galaxies.  Arthur  Stanley  Eddington  (1822-1944)  had 
previously  noted  that  the  gravitational  effect  of  a  star  would  produce  multiple 
images  and  magnification  of  images  of  a  more  remote  background  star  located 
behind  it  (Eddington  1920).  Later,  Albert  Einstein  (1879-1955)  discussed  the 
effect  for  stars,  but  stated  that  there  was  no  hope  of  observing  such  a  stellar  lens 
(Einstein  1936;  Renn  et  al.  1977).  However,  it  was  not  until  the  early  1980s  that 
astronomers  began  to  observe  that  effect  for  the  much  more  massive  clusters  of 
galaxies.  They  spread  the  light  of  a  distant  galaxy  that  lies  directly  behind  them 
into  an  array  of  faint,  tangentially  stretched  arcs. 

Hubble  Space  Telescope  ( HST)  images  of  rich  clusters  of  galaxies  reveal  the 
highly  stretched,  distorted,  and  magnified  images  of  faint  galaxies  lying  far  behind 
them  (Figs.  14.12  and  14.13).  As  Zwicky  proposed,  the  gravitational-lens  effect 
provides  information  on  both  the  visible  and  unseen  matter.  Moreover,  the  dark 
matter  can  act  like  a  zoom  lens,  magnifying  distant  galaxies  too  faint  to  be  seen 
and  bringing  them  into  view. 

The  presence  of  two  galaxies  along  the  same  line  of  sight,  one  more  distant  than 
the  other,  was  suggested  from  the  spectroscopic  database  of  the  Sloan  Digital  Sky 
Survey,  and  these  gravitational-lens  candidates  were  confirmed  using  the  high- 
resolution  imaging  capability  of  the  HST.  Hundreds  of  these  cosmic  gravitational 
lenses  have  been  found.  Some  exhibit  partial  or  complete  “Einstein”  rings,  which 
indicate  near-perfect  alignment  of  the  foreground  and  background  galaxies 
(Figs.  14.14  and  14.15).  In  1924,  Orest  Chwolson  (1852-1934)  described  the 
production  of  such  a  ring  by  a  gravitational  lens  (Chwolson  1924);  the  first 
observed  Einstein  ring  was  a  radio  source.  Even  a  double  ring  arising  from  the 
light  of  three  aligned  visible-light  galaxies  has  been  found. 

Treu  (2010)  has  reviewed  strong  lensing  by  galaxies,  and  Blandford  and 
Narayan  (1992)  reviewed  cosmological  applications  of  gravitational  lensing. 
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Fig.  14.12  Galaxy  cluster  lens  Very  distant  and  faint  galaxies  can  be  investigated  by  observing 
them  through  a  cluster  of  galaxies.  The  powerful  gravitation  of  the  cluster  acts  like  a  lens, 
bending,  focusing,  and  magnifying  the  light  of  more  distant  galaxies  that  lie  behind  it  (see 
Fig.  14.14).  The  gravitational  lens  action  can  distort  the  light  from  the  background  galaxies  into 
faint  arcs  or  produce  magnified  images  of  individual  galaxies  that  otherwise  would  remain 
invisible.  (Courtesy  of  NASA/JPL-Caltech.) 


Fig.  14.13  Cluster  of  galaxies  and  gravitational  lens  A  Hubble  Space  Telescope  image  of  a 
rich  cluster  of  galaxies  designated  Abell  2218.  It  is  about  1,000  Mpc  or  3  billion  light-years  away 
from  the  Earth.  A  typical  rich  cluster  contains  hundreds  and  even  thousands  of  galaxies,  each 
composed  of  hundreds  of  billions  of  stars  and  possibly  up  to  10  times  more  mass  within  invisible 
dark  matter.  The  galaxy  cluster  Abell  2218  is  so  massive  and  so  compact  that  its  gravity  bends 
and  focuses  the  light  from  galaxies  that  lie  behind  it.  Multiple  images  of  these  background 
galaxies  are  distorted  into  long  faint  arcs.  Magnified  or  ring  images  of  individual  background 
galaxies  also  can  be  observed.  (Courtesy  of  NAS A/STScI/ Andrew  Fruchter/the  ERO  team, 
Sylvia  Baggett/STScI,  Richard  Hook/ST-ECF/Zoltan  Levay,  STScI.) 
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Fig.  14.14  Gravity  lens  geometry  A  remote  galaxy,  labeled  S,  and  an  intervening  lens  galaxy, 
denoted  by  L  and  represented  by  the  dark  filled  circle,  lie  in  a  straight  line  along  the  observer’s 
line  of  sight  at  distances  denoted  by  D  and  its  subscripts.  Only  rays  passing  by  the  lens  at  the 
distance  b  will  reach  the  observer,  at  O,  forming  a  ring  image  with  an  angular  radius  denoted  by 
9e  (see  Fig.  14.15) 


Fig.  14.15  Einstein  ring  When  a  background  and  foreground  galaxy  are  aligned  perfectly,  the 
closer  galaxy  acts  as  a  gravitational  lens,  bending  and  magnifying  the  light  of  the  more  distant 
galaxy  and  forming  a  glowing  “Einstein”  ring.  A  double  ring  is  captured  in  this  Hubble  Space 
Telescope  image,  indicating  an  exceptionally  rare  alignment  of  a  massive  foreground  galaxy  with 
two  background  galaxies;  the  distances  of  the  three  galaxies  are  estimated  at  3  billion,  6  billion, 
and  11  billion  light-years  from  the  Earth.  (Courtesy  of  NASA/ESA/Raphael  Gavazzi  and 
Tommaso  Trea,  University  of  California  at  Santa  Barbara  and  the  SLACS  team.) 
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Example:  When  a  galaxy  acts  as  a  gravitational  lens 

When  a  light  ray  from  a  distant  source  passes  within  a  distance,  b ,  of  an 
object  of  mass  M,  then  the  bending  angle  ot  (see  Fig.  14.4),  is  given  by 


a  = 


4GM 

c2b 


radians 


(14.46) 


_  1  1  O  _ 1  _ 

where  the  Newtonian  gravitational  constant  G  =  6.614  x  10  m  kg  s  " 
and  the  speed  of  light  c  =  2.9979  x  108  m  s-1. 

When  a  light  ray  passes  the  limb,  or  edge,  of  the  Sun’s  visible  disk,  the 
mass  is  M  =  M0  =  1.989  x  1030  kg,  the  Sun’s  mass,  and  b  =  R.  = 

o 

6.955  x  10  m,  the  radius  of  the  Sun.  In  this  case: 


a  =  0.849  x  10"5  radians  =  1.75",  (14.47) 

where  1  radian  =  2.06265  x  105 "  and  the  symbol  "  denotes  seconds  of  arc. 

When  a  lens  object,  denoted  by  subscript  /  and  with  mass,  M,  is  perfectly 
aligned  along  the  line  of  sight  to  a  more  distant  source,  denoted  by  the 
subscript  s,  the  light  from  the  distant  source  will  be  deformed  into  a  ring 
(Chwolson  1924)  called  the  Einstein  ring.  The  angular  diameter,  0E ,  of  the 
ring  is  given  by  (Chwolson  1924;  Einstein  1936;  Lang  1999): 


4  GM\  1/2 
^D) 


(4  GMDls\ 

\c2DolDos ) 


1/2 

radians, 


(14.48) 


where  the  effective  distance  D  =  DotDos/ Dts ,  the  distance  from  the  observer  to 
the  lens  mass  is  Doh  the  distance  from  the  observer  to  the  background  source  is 
Dos ,  and  the  distance  from  the  lens  to  that  source  is  Dis  (also  see  Fig.  14.4). 

If  the  lens  mass  is  a  galaxy  containing  100  billion  stars  like  the  Sun,  with 
a  mass  M  —  1011  M0,  that  is  located  halfway  between  the  observer  and  the 
source,  with  Dis  =  Doi  =  Dos/ 2,  then: 


2  GM\ 

c^i) 


1/2 

radians  «  3.55  x  10  12dJ'2". 


(14.49) 


For  a  lens  galaxy  of  redshift  z  =  0.5,  the  radial  velocity  Vr  =  cz  and  the 
distance  Doi  =  cz/Hq  «  2.0  x  103  Mpc  «  6.17  x  1025  m,  where  the 
speed  of  light  c  =  2.9989  x  105  km  s  ',  the  Hubble  constant 
H0  =  100  h  km  s-1  Mpc-1,  the  h  &  0.75,  and  1  Mpc  =  3.0856  x  1022  m. 
For  this  value  of  Doi  we  obtain  0E  =  0.45",  and  such  rings  have  been  imaged 
from  the  Hubble  Space  Telescope  (see  Fig.  14.15). 
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14.4.3  Cosmic  Streams 

The  galaxies  are  not  simply  flying  outward  with  the  expansion  of  the  universe  in  a 
smooth  and  regular  manner.  Entire  groups  are  streaming  together  over  vast  dis¬ 
tances  in  various  directions.  These  so-called  peculiar  motions  are  caused  by  the 
gravitational  pull  of  huge  assemblages  of  galaxies,  and  they  are  not  related  to  the 
uniform  expansion  of  the  galaxies,  known  as  the  Hubble  flow. 

Davis  and  Peebles  (1983)  reviewed  evidence  for  local  anisotropy  in  the  Hubble 
flow,  and  Burstein  (1990)  reviewed  large-scale  motions  in  the  universe. 

For  instance,  the  nearest  large  galaxy,  Andromeda  (M  31),  is  moving  toward  the 
Milky  Way  at  a  velocity  of  about  300  km  s-1.  A  distance  of  about  778  kpc  cur¬ 
rently  separates  the  two  galaxies,  but  they  are  set  on  an  irrevocable  collision 
course.  They  will  meet  in  a  possibly  destructive  encounter  in  a  few  billion  years. 


Example:  When  will  Andromeda  enter  the  Milky  Way? 

The  distance  of  the  Andromeda  nebula  from  the  Earth  is 
D  =  778  kpc  =  2.40  x  1022  m,  where  1  kpc  =  3.0857  x  1019  m.  The 
observed  radial  velocity  of  Andromeda  is  Vr  —  —  301  km  s  \  with  the 
negative  sign  indicating  approaching  motion.  Andromeda  will  collide  with 
the  Milky  Way  in  a  time  T  =  D/Vr  «  8.0  x  1016  s  «  2.5  x  109 

'■7 

years  «  2.5  billion  years,  where  1  year  =  3.1557  x  10  s. 


In  addition,  an  entire  swarm  of  galaxies  can  set  off  on  a  trajectory  that  is 
independent  of  the  expansion.  Because  the  galaxies  are  moving  together  over  vast 
distances  of  hundreds  of  millions  of  light-years,  their  collective  behavior  is  known 
as  a  large-scale  streaming  motion,  which  is  large  in  space  but  not  so  big  in 
velocity  -  generally  no  more  than  1,000  km  s-1. 

All  of  the  galaxies  in  our  region  of  space,  within  a  volume  of  30  Mpc  across, 
are  rushing  en  masse  toward  the  same  remote  point  in  space.  All  of  these  galaxies 
are  being  pulled  through  space,  forced  into  mass  migration  by  the  gravitational 
pull  of  “The  Great  Attractor,”  located  at  a  distance  of  about  5  Mpc  away.  Its  mass 
is  equivalent  to  about  50  million  billion  (5  x  10  )  stars  like  the  Sun  and  at  least 
500,000  galaxies  like  the  Milky  Way.  The  Great  Attractor  most  likely  is  a  rich  and 
massive  cluster  of  galaxies,  part  of  an  even  larger  super-cluster. 

Occasionally  we  observe  other  galaxies  colliding,  merging,  or  passing  through 
each  other  (Fig.  14.16)  when  they  should  be  moving  farther  apart.  This  mingling 
also  is  due  to  the  gravitational  attraction  of  neighboring  galaxies,  drawing  them 
together  and  producing  local  eddies  within  an  outward  Hubble  flow. 

Barnes  and  Hernquist  (1992)  have  reviewed  the  dynamics  of  interacting 
galaxies. 

It  is  the  gravitational  interaction  of  galaxies  with  one  another  that  distorts  the 
smooth  cosmic  expansion,  producing  the  peculiar  motions  superposed  on  the 
expanding  universe.  However,  the  uniform  Hubble  flow  gathers  speed  with 
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Fig.  14.16  Colliding  galaxies  Gravitational  interaction  of  the  Antennae  galaxies,  catalogued  as 
NGC  4038  and  NGC  4039,  produces  long  arms  of  young  stars  in  their  wake.  The  colliding 
galaxies  are  located  about  62  million  light-years  from  the  Earth  and  have  been  merging  for  the 
past  800  million  years.  As  the  two  galaxies  continue  to  churn  together,  clouds  of  interstellar  gas 
and  dust  are  shocked  and  compressed,  triggering  the  birth  of  new  stars.  This  composite  image  is 
from  the  Chandra  X-ray  Observatory  (blue),  the  Hubble  Space  Telescope  (gold  and  brown),  and 
the  Spitzer  Space  Telescope  (red).  The  blue  x-rays  show  huge  clouds  of  hot  interstellar  gas,  the 
red  data  show  infrared  radiation  from  warm  dust  clouds  that  have  been  heated  by  newborn  stars, 
and  the  gold  and  brown  data  reveal  both  star-forming  regions  and  older  stars.  (Courtesy  of 
NASA /ESA/S  AO /CXC  /  JPL-Caltech /STScI.) 

distance.  Because  the  localized  streaming  motions  are  limited  in  velocity,  they  are 
relatively  slow  when  compared  with  the  expansion  speed  of  remote  galaxies.  The 
very  existence  of  the  large-scale  streaming  motions  nevertheless  indicates  a 
decidedly  uneven  and  lumpy  distribution  of  galaxies,  which  eventually  was 
mapped  across  billions  of  light-years. 
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Fig.  14.17  Great  walls  and  voids  By  measuring  the  recession  velocity,  or  redshift,  of  galaxies, 
astronomers  determined  their  distance  and  combined  it  with  their  location  in  the  sky  to  obtain  the 
three-dimensional  distribution  of  galaxies.  The  map  shown  here  is  for  galaxies  within  1  billion 
light-years  (far  left  or  far  right )  from  the  Earth  (center).  Because  galaxies  started  to  form  about 
12  billion  years  ago,  this  is  a  relatively  nearby  part  of  the  universe.  It  includes  recession  velocities 
of  up  to  30,000  km  s-1,  at  a  redshift  of  about  z  =  0.1.  The  galaxies  are  concentrated  in  long, 
narrow  sheet-like  walls  encircling  large  empty  places  known  as  voids,  about  100  million  light- 
years  across.  The  Sloan  Great  Wall  (left)  spans  about  1.4  billion  light-years.  It  may  be 
gravitationally  unbound,  perhaps  beginning  to  fall  apart,  but  it  includes  superclusters  of  galaxies 
that  may  stay  bound  together  by  their  mutual  gravitational  pull.  The  Sloan  Great  Wall  was 
discovered  using  data  from  the  Sloan  Digital  Sky  Survey  in  2004.  Other  superclusters,  or  clusters 
of  galaxy  clusters,  are  labeled  in  the  diagram,  which  is  from  the  Two  Degree  Field  Galaxy 
Survey.  (Courtesy  of  Willem  Schaap,  Kapteyn  Institute,  University  of  Groningen  et  al.,  2dF 
Galaxy  Redshift  Survey.) 


14.4.4  Galaxy  Walls  and  Voids 

By  determining  the  concentrations  of  galaxies  in  different  directions  and  at  various 
redshifts,  or  depths,  astronomers  located  places  where  the  collective  force  of 
gravity  pulled  galaxies  together  and  locally  reversed  the  uniform  expanding 
motion.  The  three-dimensional  maps  reveal  fascinating  lace-like  patterns  that 
connect  the  galaxies  and  curving  filaments  that  enclose  dark,  seemingly  vacant 
places  (Lapparent  et  al.  1986).  The  galaxies  apparently  are  distributed  along  the 
peripheries  of  gigantic  hollow  bubbles. 

The  early  redshift  surveys  also  delineated  an  enormous  sheet  of  galaxies, 
dubbed  “The  Great  Wall,”  at  distances  ranging  from  350  million  to  500  million 
light-years  away  (Geller  and  Huchra  1989).  Giovanelli  and  Haynes  (1991)  pro¬ 
vided  a  review  of  the  redshift  surveys  of  galaxies  available  at  that  time. 
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Subsequent  three-dimensional  maps  were  obtained  using  electronic  technology 
that  permits  the  simultaneous  measurement  of  hundreds  of  galaxy  redshifts  in  a 
single  exposure  at  a  large  telescope.  The  Sloan  Digital  Sky  Survey,  for  example, 
revealed  the  longest  sheet  of  galaxies  yet  seen  (Gott  et  al.  2005;  Einasto  et  al.  201 1). 
Dubbed  the  “Sloan  Great  Wall,”  it  measures  1.37  billion  light-years  across 
(Fig.  14.17);  and  it  is  the  largest  observed  structure  in  the  universe  -  at  least  so  far. 

Thus,  everywhere  they  look,  in  whatever  direction  and  near  or  far,  modern 
telescopes  are  finding  a  complex  and  richly  textured  universe,  filled  with  luminous 
concentrations  of  matter.  There  is  no  perceptible  end  to  the  lumps  and  clumps  and 
vacant  places,  and  no  one  knows  where  the  unevenness  will  end.  Even  when 
looking  across  10  %  of  the  observable  universe,  astronomers  continue  to  find 
galaxy  structures  crossing  their  maps  from  edge  to  edge,  as  well  as  smaller  bub¬ 
bles,  walls,  and  voids  that  are  nestled  together. 

All  parts  of  the  observable  universe  are  bound  within  this  all-encompassing 
fabric,  glued  together  by  the  invisible  forces  of  gravity,  suspended  in  space  by 
motion,  and  linked  by  radiation.  This  all-embracing  cosmic  web  extends 
throughout  the  observable  universe  (Fig.  14.18). 


Fig.  14.18  Cosmic  web  One  moment  in  the  ever-changing  distribution  of  galaxies  studied  using 
a  supercomputer  to  trace  out  their  formation,  evolution,  and  clustering.  The  width  of  this  image  is 
about  10  million  light-years.  (Courtesy  of  Volker  Springel,  the  Millennium  Simulation  Project/ 
Max  Planck  Institute  for  Astrophysics,  Garching,  Germany.) 


512 


14  A  Larger,  Expanding  Universe 


14.5  Looking  Back  into  Time 


Because  light  travels  at  a  finite  speed,  to  look  far  into  space  is  to  look  back  into 
time.  When  we  look  farther  out  into  space,  we  travel  back  more  in  time.  Large 
telescopes  that  detect  the  faint  light  of  distant  objects  therefore  can  be  used  as  time 
machines  to  see  objects  as  they  were  in  the  past,  when  their  light  was  emitted,  as 
they  were  then  and  not  as  they  are  now.  In  effect,  astronomers  watch  cosmic 
history  race  toward  us  at  the  speed  of  light,  299,792  km  s-1.  The  look-back  time  is 
simply  the  amount  of  time  it  takes  for  light  to  travel  from  the  object  to  us  at  that 
speed. 

Moving  at  the  speed  of  light,  it  takes  2.3  million  years  for  light  to  travel  from 
the  nearest  spiral  galaxy,  Andromeda,  to  the  Earth.  Astronomers  have  observed 
radiation  from  distant  galaxies  whose  light  was  emitted  1 3  billion  years  ago,  long 
before  the  Sun  was  formed  about  4.6  billion  years  ago.  Therefore,  the  look-back 
times  for  galaxies  range  from  millions  to  billions  of  years,  spanning  an  enormous 
period  in  which  we  can  watch  them  evolve. 

Some  of  the  most  distant  galaxies  may  no  longer  exist,  but  they  were  embryonic 
galaxies  when  the  light  now  reaching  the  Earth  began  its  journey.  These  galaxies 
may  have  perished  over  time,  but  their  light  can  survive  unchanged,  helping  us 
trace  out  the  history  of  the  observable  universe  from  the  big  bang  -  about  13.7 
billion  years  ago  -  to  now.  As  long  as  a  ray  of  light  passes  through  empty  space 
and  encounters  no  atoms  or  electrons  it  will  persist  forever. 

For  an  object  at  redshift  z,  we  can  specify  the  look-back  time,  tL,  at  which  the 
radiation  was  emitted.  Looking  at  objects  at  larger  and  larger  redshift  is  looking 
further  and  further  into  the  past.  The  look-back  time  can  be  expressed  in  terms  of 
the  Hubble  time,  tH ,  or  the  reciprocal  of  the  Hubble  constant,  H0.  For  small 
redshifts  z  <C  1,  we  have 


tL  =  ztH  =  —  =  9.778  x  109  ^ years  (14.50) 

H0  h 

where  the  Hubble  constant  H0  =  100  /z  km  s-1  Mpc-1,  1  Mpc  =  3.0857  x 
1019  km,  and  1  year  =  3.1557  x  107  s.  The  Hubble  time  is 

t„  =  —  «  3.0857  x  101  -s  «  4.1  x  1017s«13  x  109  years.  (14.51) 

Ho  h  y  \  j 

At  large  redshift,  the  look-back  time  is  given  by: 


2 

3HqQq2(1  +  z)3/2 ' 


(14.52) 


where  the  density  parameter  Q0  =  p0/ Pc,  the  ratio  of  the  present  mass  density  of 
the  universe,  p0>  to  the  critical  mass  density,  pc  =  3Hq/(8tzG)  =  1.879  x 
10-26  h 2  kg  m-3,  for  a  Hubble  constant  H0  =  100  h  km  s-1  Mpc-1,  needed  to 
close  the  universe  in  the  future. 
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Example:  Viewing  a  distant  galaxy 

Suppose  a  galaxy  is  located  at  a  distance  of  D  =  100  Mpc.  Assuming  a 
Hubble  constant  of  H0  =  75  km  s-1  Mpc-1,  the  radial  velocity  of  the  gal¬ 
axy  is  Vr  =  H0  x  D  =  7,500  km  s-1  and  its  redshift  is  z  =  Vr/c  =  0.025. 

i  >-j 

The  Hubble  time  tH  =  l///0  =  4.1 1  x  10  s  ^  13  billion  years,  where  we 
use  1  Mpc  =  3.0857  x  1019  km  and  1  year  =  3.1557  x  107  s.  The  look 
back  time  tL  =  ztH  =  0.325  billion  years. 

If  the  redshift  of  a  galaxy  is  z  =  3.0,  then  the  radial  velocity  is  obtained 

from  —  =  or  Vr  =  0.88  c  ~  2.64  x  105  km  s-1,  where  the  speed  of 

C  (z+l)2+l  r  ’  F 

light  c  =  2.9979  x  105  km  s  '.  Assuming  that  the  density  parameter 

_ o 

Qq  k,  10  ,  then  the  look-back  time  for  this  galaxy  is  tL  =  0.83  tH  =  10.8 

billion  years,  and  about  3  billion  years  after  the  big  bang  that  occurred  about 
13.7  billion  years  ago. 


The  Hubble  time  is  the  approximate  time  when  the  expansion  of  the  universe 
began.  Gravity  can  slow  the  expansion  to  a  lesser  age,  while  dark  energy  can 
accelerate  it,  giving  a  greater  age.  But  the  Hubble  time  provides  a  pretty  good 
estimate  of  the  age  of  the  expanding  universe.  Its  beginning  was  not  always  known 
with  such  precision.  For  a  while,  it  looked  as  if  the  Earth  was  older  than  the 
universe  itself,  leading  to  a  new  steady  state  theory  for  the  universe  (Focus  14.3). 
Kragh  (1997)  discusses  the  historical  development  of  the  steady  state  and 
expanding  universe  theories. 


Focus  14.4  How  old  is  the  observable  universe? 

If  the  observable  universe  had  a  beginning,  it  must  have  a  finite  age,  and 
estimates  of  that  age  have  lengthened  as  our  astronomical  knowledge 
improved.  When  the  known  universe  was  confined  largely  to  the  Earth,  an 
age  of  10  million  to  100  million  years  was  inferred  from  the  time  it  would 
take  for  the  planet  to  cool  from  an  initially  molten  state  (Kelvin  1862,  1899; 
Burchfield  1975).  The  discovery  of  radioactivity  then  provided  a  new  source 
of  energy  to  keep  the  Earth  hot  inside  (Rutherford  1905),  and  radioactive 
elements  were  used  to  determine  an  age  of  the  oldest  rocks  on  the  Earth,  first 
at  2  to  3  billion  years  (Rayleigh  1905;  Boltwood  1907;  Rutherford  1929; 
Holmes  1949)  and  eventually  at  4.6  billion  years  old  (Patterson  1956). 

When  it  was  realized  that  subatomic  nuclear  energy  keeps  the  stars  shining, 
an  evaluation  of  the  their  nuclear-reaction  rates  indicated  stellar  lifetimes  of 
up  to  10  billion  years  (Schonberg  and  Chandrasekhar  1942).  Ages  between 
10  billion  and  20  billion  years  were  inferred  from  studies  of  the  chemical 
evolution  of  our  Galaxy  (Fowler  and  Hoyle  1960;  Tinsley  1975)  as  well  as  the 
thermonuclear  evolution  of  stars  in  globular  star  clusters  (Sandage  1970; 
Hesser  et  al.  1987;  Chaboyer  1995). 
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Another  way  of  estimating  the  age  of  the  known  universe  is  to  watch  the 
expanding  galaxies.  They  are  moving  away  from  a  beginning,  a  cosmic 
horizon,  which  is  as  far  away  as  we  ever  can  see.  The  distance  between  there 
and  now,  as  well  as  the  age  of  the  observable  universe,  is  estimated  by 
calculating  how  long  it  has  taken  for  the  galaxies  to  move  this  far  at  their 
observed  speeds. 

If  the  expanding  universe  has  been  moving  along  at  a  steady  pace,  then  its 
age  can  be  determined  from  the  present  rate  of  expansion.  That  rate  is 
quantified  by  the  current  value  of  the  Hubble  constant,  the  reciprocal  of 
which  provides  an  expansion  age.  The  currently  accepted  value  of 
75  km  s'1  Mpc-1  corresponds  to  an  expansion  age  of  13.0  billion  years. 
A  lower  value  of  the  Hubble  constant  would  imply  that  the  universe  is 
expanding  now  at  a  slower  rate  and  has  a  greater  age.  A  higher  value  of  the 
constant  implies  a  more  rapid  expansion  and  that  less  time  has  elapsed  since 
the  expansion  started. 

The  universe  may  not  have  been  moving  apart  with  an  unchanging  speed. 
Its  expansion  would  be  slowed  by  the  mutual  gravitational  pull  of  all  of  the 
dark  matter  in  the  universe  or  accelerated  by  anti-gravitational  forces. 
Moreover,  today’s  value  of  the  Hubble  constant  has  always  been  imprecise 
due  to  the  streaming  motions  of  nearby  galaxies,  which  are  not  part  of  their 
expansion,  and  by  uncertain  distances  for  remote  galaxies. 

Hubble’s  measurements  of  the  constant  that  now  bears  his  name,  pegged 
at  530  in  the  typical  units,  caused  quite  a  problem.  It  corresponds  to  an 
expansion  age  of  only  1.8  billion  years.  At  about  the  same  time  that  Hubble 
was  making  his  observations,  Ernest  Rutherford  (1871-1937)  and  his  col¬ 
leagues  at  the  Cavendish  Laboratory  in  England  used  radioactive  dating  to 
estimate  an  age  of  at  least  3.4  billion  years  for  the  oldest  terrestrial  rocks 
(Rutherford  1929).  How  could  the  Earth  be  older  than  the  expanding 
universe? 

Arthur  Stanley  Eddington  (1882-1944)  was  not  at  all  troubled  by  the 
discrepancy.  He  simply  extended  the  beginning  of  the  universe  farther  back  in 
time  than  the  start  of  its  expansion  (Eddington  1933).  In  this  interpretation,  the 
world  once  existed  in  an  unmoving  state  described  by  the  cosmological 
constant  that  Albert  Einstein  (1879-1955)  previously  introduced  to  stop  the 
eventual  collapse  of  a  static,  nonexpanding  universe  (Einstein  1917a,  b). 
An  initial  nonmoving  universe  of  an  indeterminate  age  could  be  sent  into 
unrestrained  expansion  whenever  we  choose  by  the  slightest  disturbance, 
which  upsets  the  balance  between  gravitational  attraction  and  cosmological 
repulsion.  For  Eddington,  a  disrupting  disturbance  about  2  billion  years  ago 
caused  a  slight  expansion  that  thinned  out  the  universe,  making  it  less  able  to 
resist  the  cosmic  repulsion,  and  the  runaway  expansion  of  the  galaxies  began. 

Three  young  scientists  at  Cambridge  University  -  Hermann  Bondi 
(1919-2005),  Thomas  Gold  (1920-2004),  and  Fred  Hoyle  (1915-2001)  - 
noticed  that  we  can  adjust  the  cosmological  constant  to  accommodate  almost 
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any  related  observation  and  the  theory  thereby  lost  its  simplicity  and 
uniqueness.  So  the  trio  proposed  a  universe  that  had  no  beginning  (Bondi 
and  Gold  1948;  Hoyle  1948).  The  cosmos,  they  proclaimed,  may  have 
always  existed,  presenting  an  unchanging  steady  state  on  the  largest  scales  of 
space  and  time.  We  would  no  longer  have  to  attribute  the  observable  uni¬ 
verse  to  a  past  creation  that  was  inaccessible  to  scientific  scrutiny  or 
understanding. 

They  acknowledged  the  inescapable  fact  that  the  galaxies  are  moving 
apart  but  supposed  that  they  have  always  been  doing  so,  while  new  matter  is 
being  created  continuously  at  an  average  rate  of  just  one  hydrogen  atom  per 
cubic  meter  of  space  per  year.  That  is  just  sufficient  to  counteract  the  dis¬ 
persal  and  thinning  out  of  the  expanding  universe  and  keep  the  overall 
universe  unchanged  with  time. 

The  age  problem  that  led  to  serious  consideration  of  the  Steady  State 
Theory  was  partially  resolved  by  Walter  Baade  (1893-1960)  during  World 
War  II  (1939-1945).  As  luck  would  have  it,  Eastman  Kodak  had  just 
developed  a  red-sensitive  emulsion  for  wartime  reconnaissance.  Pushing  the 
2.5  m  (100  inch)  telescope  to  its  very  limits,  Baade  used  the  red-sensitive 
plates  to  resolve  the  nucleus  of  the  nearby  Andromeda  galaxy,  distinguishing 
individual  red  giant  stars  in  the  crowded  center.  His  measurements  indicated 
that  these  red-colored  stars  were  similar  to  those  found  in  the  globular 
clusters  of  our  own  Galaxy,  but  different  from  the  highly  luminous  blue- 
colored  stars  found  in  the  outer  arms  of  both  the  Milky  Way  and 
Andromeda. 

When  the  200  inch  telescope  on  nearby  Palomar  Mountain  began  oper¬ 
ation,  in  1948,  Baade  continued  his  investigations  of  the  red  and  blue  kinds 
of  stars,  and  concluded  that  they  obey  different  period-luminosity  relation¬ 
ships.  This  meant  that  Hubble  had  been  confused  when  applying  the  distance 
calibration  of  Cepheid  variable  stars  in  globular  clusters  to  the  other  types  of 
variables  in  the  arms  of  nearby  spiral  nebulae.  So  Baade  made  the  necessary 
corrections,  obtaining  a  distance  of  about  2  million  light-years  for 
Andromeda,  which  reduced  the  Hubble  constant  by  about  half  and  enlarged 
both  the  scale  and  age  of  the  expanding  universe  by  a  factor  of  about  two 
(Baade  1952). 

Although  a  consummate  observer,  Baade  tended  to  avoid  detailed 
analysis  and  written  accounts  of  his  discoveries.  So  it  was  fortunate  that 
Henrietta  H.  Swope  (1902-1980),  the  daughter  of  the  wealthy  president  of 
the  General  Electric  Company,  joined  Baade  to  assist  with  the  analysis  of  his 
excellent  photographs.  They  used  the  results  to  propose  a  downward  revision 
of  the  Hubble  constant  to  a  value  of  100,  in  the  usual  units  (Baade  and 
Swope  1955).  Allan  Sandage  (1926-2010)  then  continued  to  correct  for 
Hubble’s  mistaken  identification  of  the  brightest  stars  in  Andromeda;  in 
1958,  he  announced  that  the  elusive  constant  had  a  value  of  75  (Sandage 
1958).  Using  Sandage’s  measurements,  the  expansion  age  is  about  13  billion 
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years.  Therefore,  the  universe  was  considerably  older  than  the  oldest  ter¬ 
restrial  rocks,  and  scientists  were  reassured  that  the  observable  universe  had 
a  definite  beginning. 


Estimates  of  the  age  and  size  of  the  observable  universe  haven’t  changed  all 
that  much  over  the  past  decades,  but  the  uncertainties  have  become  smaller.  The 
Hubble  Space  Telescope  and  the  Spitzer  Space  Telescope  have  been  used  to  refine 
estimates  of  Hubble’s  constant  by  observing  Cepheid  variable  stars,  obtaining 
values  of  72  ±  8  (Freedman  et  al.  2001),  H0  =  73.8  d=  2.4  km  s-1  Mpc-1  (Reiss 
et  al.  2011)  and  H0  =  74.3  ±2.1  km  s-1  Mpc-1  (Freedman  et  al.  2012). 

Recent  estimates  indicate  that  everything  we  know  about,  the  whole  observable 
universe,  is  not  older  than  13.7  billion  years.  All  that  we  can  observe  has  a  history 
that  can  be  traced  back  to  that  beginning,  but  most  of  what  we  see  is  significantly 
younger.  Perhaps  the  most  interesting  consequences  are  that  the  observable  uni¬ 
verse  is  about  three  times  older  than  the  Earth,  and  that  the  light  we  see  from  the 
most  distant  galaxies  was  emitted  before  our  solar  system  existed. 

The  exact  mathematical  equations  for  the  look-back  time  can  be  complicated, 
and  the  uncertainties  increase  at  larger  redshifts.  Nevertheless,  the  uncertainties  in 
the  distances  and  look-back  times  are  no  larger  than  those  caused  by  our  imprecise 
knowledge  of  the  Hubble  constant.  Even  at  a  redshift  z  =  5.0,  amongst  the  largest 
ones  observed,  the  look-back  time  ranges  from  10  to  15  billion  years  depending  on 
the  choice  of  the  Hubble  constant  and  the  mass  density  of  the  universe.  So  we  can 
ignore  the  effects  of  space  curvature  in  most  practical  computations. 

Throughout  most  of  the  past  decades,  it  has  been  assumed  that  it  is  the  mass  of 
the  universe  that  curves  its  shape,  establishes  its  geometry  and  determines  its  fate. 
Under  this  assumption,  which  ignores  the  more  recent  discovery  of  dark  energy, 
the  mass  density  of  galaxies,  pG ,  determines  the  ultimate  destiny  of  the  universe. 
If  this  mass  density  exceeds  a  certain  critical  value,  pc,  then  gravity  will  eventually 
overcome  expansion.  Imagine  the  most  distant  galaxy  with  mass,  mG,  distance  DG , 
and  velocity  VG  =  H0DG.  Gravity  will  just  balance  the  expansion  of  this  galaxy  if 
its  kinetic  energy  of  expansion  is  equal  to  the  gravitational  potential  energy  of  all 
of  the  rest  of  the  universe,  or  if: 


mGVc  _  mGH%D2G  _  GmGMG 
2  2  Dg 


(14.53) 


where  Mv  is  the  total  mass  of  all  the  rest  of  the  universe  inside  distance  DG  and  the 

_  11  O  _  1  _ o 

gravitational  constant  G  =  6.674xl0  mkg  s  A  In  other  words,  the 
velocity,  VG,  of  the  most  distant  galaxy  is  just  equal  to  the  escape  velocity,  Vesc,  of 

the  entire  universe,  Vesc  =  (2 GMV /DG)1^2 . 

Collecting  terms  we  obtain  a  critical  mass  density  of: 
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10“26/i2kgm“J  «  1.0  x  10“2bkgm“J  (14.54) 


26 


where  the  Hubble  constant  H0  =  100  h  km  s  1  Mpc  ',  one  parsec  =  1  pc  = 
3.0857  x  10  m,  1  Mpc  =10  pc,  and  the  Newtonian  gravitational  constant 
G  =  6.673  x  10“n  N  m2  kg-2.  For  h  =  0.75,  we  have  pc  «  10-26  kg  m-3. 


14.6  Using  Einstein’s  General  Theory  of  Relativity 
to  Explain  the  Expansion 

Edwin  Hubble  (1889-1953)  relied  solely  on  the  power  of  observation,  preferring 
to  avoid  what  he  called  “the  dreamy  realms  of  speculation”  (Hubble  1936),  most 
likely  referring  to  theoretical  physicists.  Although  an  expanding  universe  was  a 
possible  consequence  of  Albert  Einstein’s  (1879-1955)  General  Theory  of  Rela¬ 
tivity  (Einstein  1917a,  b),  Hubble  thought  that  such  models  were  a  forced  inter¬ 
pretation  of  the  observational  results.  Even  as  late  as  1953,  shortly  before  his 
death,  Hubble  insisted  that  his  law  should  be  formulated  as  an  empirical  relation 
between  observed  data  (Hubble  1953).  The  Belgian  astrophysicist  and  Catholic 
priest,  Georges  Lemaitre  (1894-1966)  had  nevertheless  already  interpreted  the 
radial  velocities  of  spiral  nebulae  in  terms  of  Einstein’s  theory  in  1927. 

After  being  ordained  a  priest  in  1923,  Abbe  Lemaitre  spent  a  year  at  Cambridge 
University  as  a  student  of  Arthur  Stanley  Eddington  (1882-1944),  reviewing 
Einstein’s  General  Theory  of  Relativity,  and  during  the  next  2  years,  Lemaitre 
studied  at  the  Massachusetts  Institute  of  Technology  and  worked  with  Harlow 
Shapley  at  the  nearby  Harvard  College  Observatory. 

The  Belgian  cleric  also  toured  the  country,  meeting  Vesto  Slipher  (1875-1969) 
at  the  Lowell  Observatory  in  Flagstaff,  Arizona,  and  Edwin  Hubble  at  the  Mount 
Wilson  Observatory,  California.  As  a  result,  he  learned  all  about  the  latest  mea¬ 
surements  of  the  redshifts,  or  radial  velocities,  of  spiral  nebulae,  interpreting  them 
as  a  cosmic  effect  of  the  expansion  of  the  universe,  all  in  accordance  with  the 
relativity  theory,  and  additionally  derived  a  theoretical  expression  for  the  linear 
increase  of  their  velocities  with  distance  (Lemaitre  1927). 

But  we  now  see  this  in  hindsight,  and  in  1927  most  scientists  were  not  even 
aware  of  the  observational  support  for  an  expanding  universe.  Lemaitre  also 
published  his  interpretation  in  a  fairly  obscure  journal,  the  Annales  de  la  Societe 
Scientifique  de  Bruxelles.  So  practically  no  one  was  aware  of  his  findings,  which 
either  went  unnoticed  or  were  ignored.  At  this  time,  no  other  astronomer  or 
physicist  had  used  the  General  Theory  of  Relativity  to  explain  the  observed  uni¬ 
verse,  and  it  wasn’t  until  a  few  years  later  that  Eddington  sponsored  an  English 
translation  of  Lemaitre’ s  paper  in  the  Monthly  Notices  of  the  Royal  Astronomical 
Society  (Lemaitre  1931,  b,  c).  In  English,  the  title  read  A  Homogeneous  Universe 
of  Constant  Mass  and  Increasing  Radius  Accounting  for  the  Radial  Velocity  of 
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Time,  t 

Fig.  14.19  Size  of  the  expanding  universe  Schematic  representation  showing  the  size,  or  scale 
factor  R(t),  of  the  expanding  universe  as  a  function  of  time,  t.  The  approximate  age  since  the 
expansion  began  is  given  by  1  /H0  where  H0  is  the  Hubble  constant.  Three  models  describe  future 
possibilities  for  the  universe  with  no  dark  energy  or  cosmological  constant.  It  can  become  closed 
with  positive  space  curvature,  denoted  by  space  curvature  constant  k  —  +1,  forever  open  with  no 
space  curvature  and  described  by  Euclidean  space  with  k  =  0,  or  always  open  with  negative 
space  curvature  constant  k  =  —  1 


Extra- Galactic  Nebulae .  Since  he  was  the  first  to  relate  the  observed  galaxy 
motions  to  the  theory,  Lemaitre’s  solution  was  widely  heralded  for  its  novelty  and 
it  led  to  greater  appreciation  of  the  significance  of  Hubble’s  discovery. 

We  cannot  tell  exactly  how  far  away  the  distant  galaxies  are,  or  precisely  how 
long  ago  their  radiation  began  its  journey,  until  we  understand  the  curvature  of 
space.  If  space  is  curved,  the  path  through  space  might  be  noticeably  longer  than 
that  expected  in  flat,  un-curved  space,  with  larger  distances  and  greater  look-back 
times.  The  detailed  expressions  for  the  distance  and  look-back  time  therefore 
depend  upon  the  amount  of  space  curvature,  or  the  mass  density  of  the  universe,  as 
well  as  the  redshift. 

Schneider  (2006)  provides  a  nice  textbook  of  extragalactic  astronomy  and 
cosmology,  whereas  Sandage  (1988)  and  Lang  (1999)  have  reviewed  observa¬ 
tional  tests  of  world  models. 

The  line  element  or  metric,  ds ,  for  a  homogenous,  isotropic  expanding  universe 
is  the  Robertson- Walker  metric  (Robertson  1935,  1936;  Walker  1936): 


ds 2  =  c2dt 2 


dr 2 

1  —  kr 2 


+  r2  (dO2  +  sin2 6d(j)2)  , 


(14.55) 


where  the  space  curvature  constant  k  =  —1,0  and  +1,  and  R(t )  is  called  the  radius 
of  curvature  or  the  scale  factor  of  the  universe  (Fig.  14.19).  The  r  coordinate  has 
zero  value  for  some  arbitrary  fundamental  observer,  the  surface  r  =  constant  has 
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the  geometry  of  the  surface  of  the  sphere,  6,  cj)  are  polar  coordinates,  t  is  the  time 
coordinate  and  c  is  the  speed  of  light. 

If  light  was  emitted  from  an  extragalactic  object  at  time  te,  then  its  redshift  z  is 
given  by: 


1  +z  = 


R(te )  ’ 


(14.56) 


where  the  subscript  zero  is  used  to  denote  the  present  epoch,  and  R(t0 )  is  the 
present  value  of  R(t )  at  time  f0.  As  previously  mentioned,  the  observed  wave¬ 
length,  20,  of  a  spectral  line  emitted  by  an  extragalactic  object  is  longer  than  the 
emitted  wavelength,  Ae,  with  a  redshift  z  defined  by: 


z 


^0  —  ^ e 


(14.57) 


At  any  cosmic  time  t ,  we  can  define  a  Hubble  expansion  parameter,  H{t),  by: 


R(t)  dR{t)/dt 

W)=  m 


(14.58) 


where  the  •  denotes  differentiation  with  respect  to  time,  a  deceleration  parameter, 
q(t )■ 


{  mm  =  _±m 

R2(t)  HoSM 

and  a  density  parameter,  Q(t),  by: 


$>nGp(t) 

3H2(t) 


(14.59) 


(14.60) 


for  mass  density  p{t). 

For  a  homogeneous,  isotropic  expanding  universe  of  mass-energy  density  p  and 
zero  cosmological  constant  A  =  0  (the  cosmological  constant  is  discussed  in  Sect. 
15.6.2),  we  obtain: 


871 G  kc 2 

~p~W{t ) 


for  A  =  0, 


(14.61) 


where  G  is  the  gravitational  constant,  and: 


m 

R(t) 


for  A  =  0, 


(14.62) 


where  P  is  the  pressure. 

These  two  equations  are  sometimes  called  the  Friedmann  equations,  since  the 
Russian  mathematician  Aleksandr  Friedmann  (1922-1924)  first  derived  them,  but 
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he  had  no  idea  that  they  might  apply  to  the  observable  universe  (Friedmann  1922, 
1924).  They  can  be  rearranged  to  imply  a  conservation  of  energy  relation: 


(pc2+P). 


(14.63) 


At  the  present  time,  denoted  by  t0,  the  expansion  parameter  is  the  Hubble 
constant  H(t0 )  =  H0  =  100  h  km  s-1  Mpc  -1,  with  a  value  of  h  «  0.75.  More¬ 
over,  in  the  present  matter-dominated  era  the  pressure  P0  =  0,  the  radiation  energy 
density  pr(t0)  can  be  omitted  when  compared  to  the  mass  density  pm(t0)  =  p0,  and 
the  deceleration  parameter  is  given  by: 


qo  —  q(t  o) 


4nG  p0 

3HlPo  =  2l Vc 


for  A  =  0, 


the  density  parameter  £20  is  given  by: 


(14.64) 


Q0  =  @(to)  =  —  for  A  =  0,  (14.65) 

Pc 

and  pc ,  the  critical  mass  density  needed  to  close  the  universe  is  given  by: 


Pc 


M 

8tiG 


=  1.879  x  10“26  h2  kgm“J  w  1.0  x  10“zokgm 


26 


(14.66) 


where  the  Hubble  constant  H0  =  100  h  km  s  1  Mpc  1  =  3.24xl0  18  /z  s  ', 

\  f\  6 

one  parsec  =  1  pc  =  3.0857  x  10  m,  1  Mpc  =10  pc,  and  the  Newtonian 
gravitational  constant  G  =  6.693  x  10”11  N  m2  kg-2. 

We  also  have  the  relation: 


kc 2 


For  model  universes  with  zero  cosmological  constant,  or  A  =  0,  we  have  three 
possibilities  with  three  different  values  of  the  space  curvature  constant  k, 
describing  a  closed,  Euclidean  or  open  universe  (Fig.  14.19). 

If  k  =  1  then  q0  >  0.5,  p0  >  pc  and  >  1  for  elliptical  closed  space  and  an 
oscillating  universe  in  which  R(t )  reaches  a  maximum  in  the  future.  The  universe 
would  then  eventually  turn  back  upon  itself  and  reform  the  dense  fireball  of  its 
youth. 

If  k  =  0  then  q0  =  0.5 ,  p0  —  Pc  and  Q0  =  1  for  a  flat,  Euclidean  space  without 
curvature  and  an  ever-expanding  Einstein-De  Sitter  universe  (Einstein  and  De 
Sitter  1932),  in  which  R(t)  continues  to  forever  increase  with  time,  t.  The  universe 
is  then  poised  between  open  and  closed,  right  on  the  dividing  line. 


Hl(2q0  -  1)  for  A  =  0. 


(14.67) 
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If  k  =  —  1  then  0.0  <  q0  <  0.5,  p0  <  pc ,  and  Q0  <  1  for  a  hyperbolic  open 
space  and  an  ever-expanding  Milne  universe  (Milne  1935)  in  which  R(t )  also 
continues  to  forever  increase  with  time,  t.  The  inward  pull  of  gravity  is  too  weak  to 
ever  quell  the  outward  expansion  of  the  universe. 

The  cosmological  constant  has  been  revived  with  the  discovery  of  mysterious 
dark  energy  that  is  now  accelerating  the  expansion  of  the  universe,  and  the 
appropriate  adjustments  to  our  equations  are  given  in  Sect.  15.6.2. 


Chapter  15 

Origin,  Evolution,  and  Destiny 
of  the  Observable  Universe 


15.1  Hotter  Than  Anything  Else 

Regardless  of  the  direction  in  which  we  look  out  into  space,  almost  all  of  the 
distant  galaxies  are  flying  apart,  dispersing  and  moving  away  at  speeds  that 
increase  with  their  distance,  as  if  they  had  been  ejected  by  a  cosmic  bomb. 
Astronomers  call  it  the  “big  bang.” 

We  can  envision  this  early  state  by  putting  the  observed  expansion  of  the 
galaxies  in  reverse  and  pushing  the  galaxies  back  closer  together  until  a  time  about 
13.7  billion  years  ago,  when  the  universe  was  incredibly  small  and  all  of  its  mass 
was  compressed  to  a  very  high  density.  It  marks  the  beginning  of  the  observable 
universe,  but  no  one  knows  what  happened  before  the  big  bang  propelled  the 
expanding  universe  into  existence  (Focus  15.1). 


Focus  15.1  Before  the  Big  Bang 

How  did  it  all  begin?  The  existing  theory  falls  apart  at  the  first  crucial 
instant,  at  the  beginning  of  the  big  bang,  and  can’t  be  extended  to  anything 
that  occurred  before  that.  Moreover,  there  is  no  observational  evidence  for 
prior  events. 

Einstein’s  General  Theory  of  Relativity  cannot  be  used  back  then  because 
the  equations  contain  a  singularity  at  the  beginning  of  the  big  bang,  when  a 
non-zero  parameter  is  divided  by  zero  and  conditions  cannot  be  defined. 
So  we’ve  just  pushed  the  mystery  of  the  ultimate  origin  of  the  universe  back 
about  13.7  billion  years,  to  a  point  that  science  cannot  penetrate. 

An  inflation  theory  does  describe  what  could  have  happened  in  the  first 

_ o  c 

fraction  of  a  second  of  the  big  bang,  when  the  universe  was  just  10  s  old. 
Guth  (1981)  and  Linde  (1982)  describe  inflation;  Narlikar  and  Padmanabhan 
(1991)  have  described  inflation  for  astronomers. 

During  inflation  the  universe  was  driven  by  a  repulsive  gravity,  unlike  the 
attracting  kind  we  are  used  to,  and  operated  on  a  very  small  scale  in  both 
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space  and  time,  blowing  the  universe  up,  enlarging  it  by  an  enormous  factor. 
Owing  to  its  inherent  instability,  the  burst  of  inflation  soon  decayed  away 
and  came  to  an  end,  in  a  time  far  less  than  one  second,  releasing  its 
remaining  energy  into  material  particles  and  creating  the  heat  of  the  big 
bang,  the  primeval  fireball. 

This  accelerated  expansion  in  the  first  miniscule  moments  of  the  big  bang, 
this  inflation,  supposedly  obliterated  evidence  of  previous  space,  time, 
energy  and  matter,  erasing  previous  history.  That  cosmic  forgetfulness 
closes  the  door  to  the  very  beginning,  conveniently  avoiding  the  question  of 
ultimate  origins,  the  original  genesis,  and  removing  it  from  any  observa¬ 
tional  consequences. 

In  other  words,  according  to  this  theory  the  big  bang  or  its  immediate 
consequences  destroyed  all  evidence  of  what  came  before.  Or  the  big  bang 
might  have  initiated  time,  on  a  day  without  a  yesterday.  So  there  is  no 
before.  Or  perhaps  the  explanation  lies  outside  space  and  time. 

In  any  event,  the  existing  equations  and  theories  fail  to  explain  how  the 
observed  universe  began.  So  we  still  don’t  know  how  the  universe  came  into 
being,  and  it  remains  a  captivating  mystery. 


Because  gases  become  hotter  when  they  are  compressed  and  cool  when  they 
expand,  the  observed  universe  must  have  been  incredibly  hot  in  its  earliest,  most 
compact  state.  As  we  look  back  in  time,  at  the  most  distant  regions,  the  universe 
becomes  increasingly  hot,  eventually  becoming  so  exceptionally  hot  that  radiation 
was  the  most  powerful  force,  dominating  the  expansion  of  the  universe. 

In  the  earliest  moments  of  the  big  bang,  there  were  no  stars  or  galaxies,  only 
intense  radiation  and  subatomic  particles  from  which  the  material  universe  sub¬ 
sequently  grew.  During  this  hot  beginning,  matter  was  then  being  created  by 
radiation  and  vice  versa.  Some  of  the  incredibly  energetic  radiation  was  being 
transformed  into  electrons  and  their  anti-matter  counterparts,  the  positrons  or 
positive  electrons,  and  just  as  often  an  electron  would  collide  with  a  positron  to 
make  radiation  again. 

If  we  let  y  denote  a  photon  of  the  energetic  radiation,  or  a  gamma  ray,  then  the 
equilibrium  condition  can  be  written  in  short  hand  notation  as: 

y  +  y  <=*  e~  +  e+,  (15.1) 

where  e~  denotes  an  electron  and  e+  denotes  a  positron,  the  anti-matter  particle  of 
the  electron.  The  double  arrow  means  that  the  reaction  goes  in  both  directions  at 
the  same  rate.  The  forward  process,  from  left  to  right,  is  known  as  electron- 
positron  pair  creation ,  and  the  reverse  one  is  called  pair  annihilation. 

Neutrons  and  protons  were  also  around,  and  these  subatomic  particles  would 
also  turn  back  and  forth  into  each  other,  through  reactions  that  included  electron 
neutrinos  and  electron  antineutrinos,  as  well  as  electrons  and  positrons.  The 
equilibrium  between  radiation  and  these  subatomic  particles  continued  as  long  as  it 
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was  hot  enough  to  create  positrons;  however,  the  radiation  quickly  cooled  as  the 
result  of  the  expansion  of  the  universe  into  a  greater  volume.  Electron-positron 
pair  creation  continued  only  as  long  as  the  thermal  energy  of  the  radiation,  kTr(t ), 
with  radiation  temperature  Tr{t)  at  time  t,  was  greater  than  the  rest  mass  energy  of 
the  electron,  or  for  kTr(t)  >  rnec  .  Here  the  Boltzmann  constant  k  =  1.38065  x 

_ O  Q  _  1  _ Q  1 

10  J  K  ,  the  mass  of  the  electron  is  me  =  9.1094  x  10  kg,  and  the  speed 

o  _ 1 

of  light  c  =  2.9979  x  10  m  s  .  Using  these  constants,  the  equilibrium  stopped 
when  the  radiation  cooled  to  a  temperature  of  Tr(t)  =  5.93  x  109  K.  After  that,  no 
more  positrons  were  made  and  the  leftover  positrons  were  then  consumed  by 
interactions  with  electrons.  The  neutrons,  protons,  and  electrons  that  remained 
eventually  gathered  together  to  create  the  material  universe  we  have  today. 

In  the  early  stages,  the  radiation  controlled  the  expansion,  with  an  effective 
energy -mass  density,  pr(t ),  at  time,  t ,  given  by: 

pr(t)  =  «  0.842  x  1(T32  TA(t)  kgm-3,  (15.2) 

cz 

where  the  radiation  constant  a  =  7.5657  x  10  16  J  m  3  K  4,  and  we  have  divi- 

o  _ 1 

ded  by  the  square  of  the  speed  of  light,  c  =  2.9979  x  10  m  s  ,  to  convert  the 
energy  density  into  an  equivalent  mass  density. 

If  we  denote  the  effective  mass  of  the  radiation  as  Mr(t),  assume  the  universe  is 
expanding  at  a  constant  velocity  V  =  R(t)  / 1  for  radius  R(t)  at  time  t,  and  equate  the 
kinetic  energy  of  expansion  to  the  gravitational  potential  energy  of  the  radiation,  then 


1 

2 


Mr{t) 


m 


(15.3) 


or 


Mr(t) 


R3(t) 

2  Gt2  ’ 


(15.4) 
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where  the  gravitational  constant  G  =  6.674  x  10  m  kg  s  and 


Mr{t)=^R3(t)p,,{t)=^R3{t) 


(15.5) 


Collecting  terms,  we  obtain  an  approximate  equation  for  the  radiation  tem¬ 
perature  Tr(t)  at  time,  t ,  as  long  as  the  radiation  is  controlling  the  expansion. 


Tr(t) 


'3  c2Mr{t) 
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where  the  time  t  is  in  seconds  for  the  numerical  approximation,  or  equivalently 

1 
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The  radiation  temperature  has,  for  example,  dropped  to  5.93  x  109  K  in  just 
about  13  s  after  the  big  bang. 

In  the  early  stages,  the  radiation  controlled  the  expansion  of  the  universe 
because  it  was  incredibly  hot.  Just  1  s  after  the  big  bang,  the  radiation  had  a 
temperature  of  about  20  billion,  or  2  x  1010  K.  As  the  universe  continued  to 
expand,  the  radiation  steadily  cooled,  and  the  matter  eventually  took  over  the 
expansion.  However,  the  big  bang  was  so  intense  and  so  hot  that  we  are  still 
immersed  within  the  radiation. 


15.2  Three  Degrees  Above  Absolute  Zero 
15.2.1  An  Unexpected  Source  of  Noise 

The  discovery  of  the  faint  afterglow  of  the  big  bang  was  a  serendipitous  event, 
involving  a  horn-reflector  antenna  that  had  been  used  at  the  Bell  Telephone 
Laboratories  in  the  first  tests  of  a  communication  satellite.  Arno  Penzias  (1933-  ) 
and  Robert  Wilson  (1936-  )  were  measuring  the  temperatures  of  noises  in  the 
horn-antenna  system  so  they  could  make  accurate  measurements  of  the  intensity  of 
several  extragalactic  radio  sources.  A  persistent,  ubiquitous,  and  unvarying  noise 
source  was  detected  at  a  signal  frequency  of  4,080  MHz  =  4.08  x  109  Hz,  or  a 
wavelength  of  7.35  cm,  contributing  an  antenna  temperature  of  only  3  degrees 
above  zero,  or  about  3  K.  It  was  equally  strong  in  all  directions,  wherever  the 
antenna  was  pointed,  independent  of  the  time  of  day  and  year  and  with  no 
dependence  on  the  location  of  any  known  cosmic  radio  source. 

Penzias  and  Wilson  did  not  know  what  they  had  found  and  avoided  any 
mention  of  the  cosmological  implications  in  their  publication  (Penzias  and  Wilson 
1965),  which  had  the  modest  title,  A  Measurement  of  Excess  Antenna  Temperature 
at  4,090  MHz .  However,  a  group  at  Princeton  University,  which  was  attempting  to 
make  a  similar  measurement  at  the  time,  drew  attention  to  the  implications  in  a 
companion  paper  (Dicke  et  al.  1965).  The  unexpected  source  of  noise  was  the 
faint,  cooled  relic  of  the  hot  big  bang,  now  known  as  the  three-degree  cosmic 
microwave  background  radiation ,  because  it  has  a  temperature  of  about  3  K  and  it 
originated  before  the  stars  and  galaxies  were  formed,  lying  behind  them. 

This  particular  discovery  was  not  entirely  unanticipated.  In  the  late  1940s  and 
early  1950s,  George  Gamow  (1904-1968),  Ralph  A.  Alpher  (1921-2007),  James 
W.  Follin  (1919-2007),  and  Robert  C.  Herman  (1914-1997)  had  speculated  that 
the  1 -billion-degree,  or  109  K,  radiation  of  the  early  universe  would  have  cooled  to 
about  5  K  during  the  past  billions  of  years  of  expansion,  but  nobody  had  attempted 
to  observe  the  relic  radiation  (Gamow  1948,  1956;  Alpher  and  Herman  1948). 
Penzias  and  Wilson  were  also  unaware  of  this  previous  calculation  until  after  their 
discovery.  They  received  the  1976  Nobel  Prize  in  Physics  for  their  discovery  of  the 
cosmic  microwave  background  radiation. 
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15.2.2  Blackbody  Spectrum 

At  the  high  temperatures  during  the  early  history  of  the  expanding  universe,  the 
radiation  and  subatomic  particles  frequently  interacted,  achieving  thermal  equi¬ 
librium  characterized  by  single  temperature.  Later,  when  the  universe  thinned  out 
and  cooled  by  expanding  into  a  greater  volume,  the  matter  and  radiation  quit 
interacting,  going  their  separate  ways.  However,  the  radiation  would  have  retained 
its  thermal  nature  as  it  cooled  and  the  temperature  slowly  decreased. 

A  perfect  thermal  radiator  is  known  as  a  blackbody ,  which  absorbs  all  thermal 
radiation  falling  on  it  and  reflects  none  -  hence,  the  term  black.  The  distribution  of 
the  radiation  emitted  by  the  blackbody,  its  spectrum,  peaks  at  a  wavelength  that  is 
inversely  proportional  to  the  temperature,  dropping  precipitously  at  shorter 
wavelengths  and  falling  off  gradually  at  longer  ones  (Sect.  2.4). 

The  expansion  of  the  universe  preserves  the  blackbody  spectrum  of  the  radia¬ 
tion  for  all  time.  No  process  can  destroy  its  shape,  but  the  location  of  maximum 
intensity  will  stretch  to  increasingly  longer  wavelengths  as  time  goes  on  and  the 
radiation  gets  colder.  The  wavelength  of  peak  intensity,  kmax,  is  inversely 
proportional  to  the  radiation  temperature,  T,  and  the  Wien  displacement  law 
(Sect.  2.4)  specifies  that  wavelength  as  Xmax  =  0.0029 /T  m.  In  the  present  epoch, 
with  a  temperature  of  only  about  3°  above  absolute  zero,  or  at  3  K,  the  blackbody 
radiation  intensity  peaks  at  a  wavelength  of  about  0.001  m  or  0.1  cm.  Unfortu¬ 
nately,  the  Earth’s  atmosphere  absorbs  cosmic  radiation  at  this  short  wavelength 
where  the  most  intense  radiation  occurs. 

The  definitive  spectral  measurements  therefore  had  to  be  made  from  above  the 
atmosphere  using  NASA’s  COsmic  Background  Explorer  ( COBE ),  launched  on 
18  November  1989.  Less  than  two  months  after  COBE  went  into  orbit,  but  a 
quarter-century  after  the  discovery  of  the  cosmic  radiation,  John  C.  Mather 
(1946-  )  reported  the  combined  results  of  millions  of  COBE  spectral  measure¬ 
ments  at  an  American  Astronomical  Society  meeting  near  Washington,  DC.  The 
spectrum  fit  the  Planck  blackbody  curve  with  a  precision  of  1  part  in  10,000 
(Fig.  15.1),  establishing  a  temperature  of  precisely  2.725  K,  with  an  uncertainty  of 
0.002  K  (Mather  et  al.  1994). 

Such  a  thermal  spectrum  could  not  have  happened  in  the  universe  as  it  is  now. 
Matter  currently  has  a  very  different  temperature  than  the  background  radiation. 
In  other  words,  the  observed  spectrum  is  proof  that  the  observable  universe  must 
have  expanded  from  a  very  hot,  dense  state  in  the  past,  when  matter  and  radiation 
were  in  thermal  equilibrium  and  at  the  same  temperature. 

Every  part  of  space  now  is  filled  with  background  radiation.  From  its  temperature, 
of  T  =  2.725  K  and  we  can  specify  the  frequency  vmax  from  the  Wien  displacement 
law  in  frequency  (Sect.  2.4)  as  vmax  «  2.%kT/h  «  5.88  x  1010  T  «  1.60  x  1011  Hz, 

_ o  o  _ 1 

where  the  Boltzmann  constant  k  =  1.38065  x  10  ~  JK  and  the  Planck  constant 
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Fig.  15.1  Spectrum  of  the  cosmic  microwave  background  radiation  The  intensity  of  the 
cosmic  microwave  background  radiation  plotted  as  a  function  of  wavelength.  This  thermal 
radiation  was  formed  about  390,000  years  after  the  big  bang,  which  occurred  about  14  billion 
years  ago.  The  observed  radiation  has  a  nearly  perfect  blackbody  spectrum.  Pioneering 
measurements  by  Arno  A.  Penzias  (1933-  )  and  Robert  W.  Wilson  (1936-  )  in  1965  and  Peter  G. 
Roll  (1935-  )  and  David  T.  Wilkinson  (1935-2002)  in  1966,  at  7.35  and  3.0  cm  wavelength, 
respectively,  are  compared  to  the  expected  spectrum  of  a  three-degree  blackbody  and  radiation 
from  our  Galaxy  (bottom).  The  full  spectrum  at  millimeter  wavelengths  (top)  was  obtained  from 
instruments  aboard  the  COsmic  Background  Explorer  (COBE)  in  late  1989.  These  data  are  so 
accurate  that  the  error  bars  of  the  individual  points  all  lie  within  the  width  of  the  plot  curve.  This 
solid  line,  which  matches  the  shape  and  peak  location  of  the  observed  data,  corresponds  to  a 
thermal  radiator,  or  blackbody,  with  a  temperature  of  2.725  K 


^A 

h  =  6.6261  x  10  J  s.  The  number  density  NCmb  of  photons  in  the  background 
radiation  can  be  determined  from: 
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aTa 

Ncmb  =  7 - ~  4  x  108  m  3,  (15.8) 

ft  ^  max 

where  the  radiation  energy  density  is  aT4,  the  radiation  constant 
a  =  7.5656  x  10  16  J  m  3  K  4,  and  hvmax  is  the  photon  energy.  Thus,  every 
cubic  meter  of  space  in  the  observable  universe  contains  about  a  half  billion 
photons  of  the  cosmic  microwave  radiation.  These  are  tiny  bundles  of  radiation 
energy  that  originated  about  14  billion  years  ago. 


15.2.3  As  Smooth  as  Silk 

What  alerted  astronomers  to  the  importance  of  the  background  radiation  was  its 
equal  brightness  wherever  one  looked,  indicating  that  it  uniformly  fills  all  of  space 
(Wilson  and  Penzias  1967).  This  spatial  isotropy  satisfied  one  of  the  basic  tenets  of 
modern  cosmology,  the  cosmological  principle ,  which  asserts  that  except  for  local 
irregularities,  the  universe  presents  the  same  aspect  from  every  point. 

But  the  radiation  seemed  too  uniform.  The  COBE  instruments  could  detect  no 
regions  brighter  than  others  to  0.0003  K,  or  1  part  in  10,000,  on  angular  scales 
from  minutes  of  arc  to  180°.  Yet,  the  background  radiation  ought  to  have  con¬ 
centrations  in  it,  which  acted  as  seeds  for  the  subsequent  formation  of  the  material 
universe.  They  must  have  acted  as  a  template  or  blueprint,  encoding  the  infor¬ 
mation  required  to  explain  the  subsequent  formation  of  stars  and  galaxies. 


15.2.4  Cosmic  Ripples 

In  1992,  George  Smoot  (1945- )  and  his  colleagues  announced  measurements  of  the 
temperature  fluctuations  in  the  cosmic  microwave  background  radiation  using  four 
years  of  data  gathered  by  COBE  (Smoot  et  al.  1992;  Bennett  et  al.  1993,  1996). 
Aftersubtracting  the  known  microwave  emission  of  the  Milky  Way  and  using 
mathematical  averaging  techniques  on  about  100  million  observations,  the  COBE 
team  found  that  the  temperature  varies  ever  so  slightly  over  large  angular  sizes.  The 
sensitive  instrument  detected  minute  temperature  differences  no  larger  than  a  hun¬ 
dred-thousandth,  or  10-5  K.  Mather  and  Smoot  were  awarded  the  2006  Nobel  Prize 
in  Physics  for  their  discovery  of  the  blackbody  form  and  anisotropy  of  the  cosmic 
microwave  background  radiation. 

COBE  was  pushed  to  the  limits  of  its  sensitivity,  with  evidence  that  wasn’t  quite 
definitive,  mainly  because  it  mapped  the  cosmic  radiation  with  coarse  angular 
resolution  greater  than  7°.  In  the  subsequent  decade,  more  than  20  experiments 
were  therefore  carried  out  from  the  ground  and  balloon  platforms,  bringing  the 
temperature  fluctuations  into  shaper  focus  with  angular  resolutions  as  fine  as  a  few 
minutes  of  arc.  However,  the  ground-based  and  balloon  experiments  only 
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Fig.  15.2  Maps  of  the  infant  universe  An  all-sky  view  of  the  three-degree  cosmic  microwave 
background  radiation  emitted  from  the  universe  in  its  infancy,  just  390,000  years  after  the  big 
bang  that  occurred  13.7  billion  years  ago.  The  data,  taken  in  2003  from  the  Wilkinson  Microwave 
Anisotropy  Probe  (WMAP)  are  shown  here  after  seven  years  of  data  analysis.  The  temperature 
fluctuations  range  up  to  0.0002K  above  and  below  the  average  value.  Darker  regions  are  cooler 
and  lighter  regions  are  hotter.  These  temperature  fluctuations  provided  the  seeds  from  which 
galaxies  subsequently  grew.  (Courtesy  of  the  NASA/ COBE  and  NASA  /WMAP  science  teams.) 


glimpsed  small  portions  of  the  sky  for  a  limited  time,  so  there  was  a  possibility  that 
they  might  not  be  truly  representative  of  all  the  background  radiation. 

It  was  therefore  time  for  another  satellite  experiment  that  would  scan  the  entire 
sky  without  the  confusion  of  microwave  radiation  from  the  atmosphere  and 
ground.  This  time,  the  spacecraft  would  not  only  detect  the  cosmic  ripples;  it 
instead  would  determine  their  distribution  and  characteristic  sizes,  filling  in  the 
gaps  between  the  large  features  seen  with  COBE  and  the  smaller  features  detected 
by  other  instruments. 

David  T.  Wilkinson  (1935-2002),  of  Princeton  University,  joined  Charles  L. 
“Chuck”  Bennett  (1956-  )  of  the  Goddard  Space  Flight  Center  to  create  a  small 
team  of  experts  and  design  a  spacecraft  that  could  accomplish  the  goal  within  the 
modest,  for  NASA,  budget  cap  of  $70  million  in  1994  dollars.  The  resultant 
Microwave  Anisotropy  Probe  (MAP)  was  approved  in  mid- 1996  and  launched  on 
30  June  2001.  The  name  was  changed  in  early  2003  to  Wilkinson  Microwave 
Anisotropy  Probe  (WMAP),  to  honor  Wilkinson  after  his  death.  Instruments  aboard 
WMAP  provided  definitive  measurements  of  the  rippling  departures  from  unifor¬ 
mity  (Fig.  15.2),  with  temperature  fluctuations  of  1  part  in  100,000,  or  at  about 
0.00003  K  (Bennett  et  al.  2003).  This  anisotropy,  at  the  level  of  A T/T  = 
(1.1  ±0.1)  x  10  5,  is  given  with  other  physical  properties  of  the  background 
radiation  in  Table  15.1.  Hu  and  Dodelson  (2002)  have  reviewed  cosmic  micro- 
wave  background  anisotropies. 

When  combined  with  previous  measurements,  the  WMAP  instruments  showed 
that  temperature  variations  are  concentrated  within  certain  angular  sizes  that  are 
displayed  in  an  angular  power  spectrum  -  a  plot  of  the  relative  strength  of  the  hot 
and  cold  spots  against  their  angular  sizes  (Fig.  15.3).  This  spectrum  is  not  flat  -  it 
is  rippled.  Gravity  explains  the  ripples,  the  relative  amplitudes  of  which  can  be 
used  to  infer  the  gravitational  pull  that  caused  them. 
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Table  15.1 

Physical  properties  of  the  cosmic  microwave 

background  radiation 

Parameter 

Name 

Value 

Tq  =  P CM  R 

Temperature 

2.725  ±  0.002  K 

El CMB 

Photon  density 

(410.4  ±  0.9)  x  106  m-3 

PCMB 

Mass-energy  density  of  photons 

4.648  x  10-31  kg  m~3 

T\ 

Dipole  anisotropy 

0.003346  ±  0.000017  K 

Ti/To 

Dipole  anisotropy / temperature 

0.001228 

AT /T0 

Anisotropy,  dipole  removed 

(1.1  ±  0.1)  X  10-5 

Tz  Qrms 

Quadrupole  moment 

(8  ±  2)  x  10~s  K 

Multi  pole  moment  l 

10  100  500  1000 


Fig.  15.3  Ripple  data  The  angular  fluctuation  strength,  or  power,  of  the  cosmic  microwave 
background  radiation  in  which  temperature  fluctuations,  in  units  of  square  micro  kelvin  (10-6  K 
and  designated  pK),  are  displayed  as  a  function  of  their  angular  extent  in  degrees,  denoted  0.  This 
plot  shows  the  relative  brightness  for  the  all-sky  map  observed  from  the  Wilkinson  Microwave 
Anisotropy  Probe  ( WMAP )  (see  Fig.  15.2)  at  various  sizes.  The  solid  line  is  the  model  that  best 
fits  the  observed  data  (solid  dots);  the  gray  band  represents  uncertainties  in  the  model.  Anisotropy 
data  obtained  by  previous  experiments  are  denoted  by  dots  with  error  bars.  The  observed  power 
spectrum  has  been  compared  to  other  astronomical  observations  and  different  theoretical  models, 
providing  estimates  for  the  amount  of  dark  matter  and  dark  energy  in  the  universe  (see 
Fig.  15.10).  (Courtesy  of  the  NASA/ WMAP  Science  Team.) 


The  ratio  of  the  heights  of  the  first  and  second  peak  of  the  angular  power 
spectrum  was  used  to  determine  the  amount  of  “ordinary”  matter  with  which  we 
are  familiar,  the  baryonic  type  that  comprises  atoms.  The  neutrons  and  protons 
found  in  the  nuclei  of  all  atoms  are  baryons. 

When  the  height  of  the  third  peak  was  compared  to  the  other  two,  scientists 
estimated  the  amount  of  dark,  nonbaryonic  matter.  The  comparison  indicated  that 
dark  matter  is  five  times  more  abundant  than  ordinary  baryonic  matter,  and  that  the 
combined  gravitational  pull  of  both  kinds  of  types  of  matter  is  not  enough  to  stop 
the  future  expansion  of  the  universe  (Table  15.2). 

The  COBE  and  WMAP  results  have  carried  cosmology  beyond  the  esoteric 
realms  of  theoretical  speculation  and  into  precise  scientific  tests.  Definitive  new 
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Table  15.2  Cosmological  parameters  inferred  from  WMAP  observations" 


Parameter 

Name 

Value 

H0 

Hubble  constant" 

71.0  ±  2.5  km  s-1  Mpc-1 

*0 

Age  of  expanding  universe 

(13.75  ±  0.13)  x  109  years  (after  the  big 
bang) 

teq 

Equality  of  matter  and  radiation 
(redshift  zeq  =  3196  db  133) 

76,000  d=  5,000  years 

t dec 

Decoupling  (recombination) 

(3.79  ±  0.05)  x  105  years  (after  the  big 

(redshift  idee  —  1090.89  db  0.69) 

bang) 

t  reion 

Reionization  time  (redshift 

(3.5  d=  1.5)  x  108  years  (after  the  big 

Zreion  —  10.5  zb  1.2) 

bang) 

Qbh2 

Baryonic  matter 

(Qb  =  0.0449  db  0.0028) 

0.02258  ±  0.00057 

Qch 2 

Dark  matter  density 
(Qc  =  0.222  db  0.026) 

0.1109  ±  0.0056 

Qmh2 

Total  matter  density 

(Qm  =  0.267  db  0.026) 

0.1335  db  0.0056 

qa 

Dark  energy  density 

0.734  ±  0.029 

k^tot 

Total  density:  matter  +  Energy 

1.08  ±  0.09 

a  Parameter  values  are  from  WMAP  only,  adapted  from  Jarosik  et  al.  (2011).  The  Hubble 
constant  Ho  =  100  /ikms-1  Mpc-1,  or  0.71  for  these  estimates.  A  more  accurate  value 

_  1  _ Q 

indicates  that  H0  =  15  km  s  Mpc  .  The  parameter  Q  is  the  ratio  of  the  specified  quantity  to 
the  critical  amount  required  to  keep  the  expansion  of  the  universe  on  the  brink  of  closure.  The 
matter  density  parameter,  for  example,  is  Qm  =  pm{to)/ Pc-  Where  pm{to)  is  the  total  mass 
density,  in  visible  and  invisible  form,  at  the  present  time  t0,  and  pc  =  3Hq/(StzG)  w  1.0  x 
10-26  kgm-3  for  Ho  =  75  kms-1  Mpc-1  is  the  critical  mass  density  required  to  stop  the 
expansion  of  the  universe  in  the  future.  The  total  density  parameter  Qtot  is  the  sum  of  the 
contributions  from  visible  matter,  dark  matter,  and  dark  energy,  and  kltot  =  1.00  is  consistent 
with  inflation  and  a  universe  that  is  described  by  Euclidean  geometry  without  space  curvature. 

observational  descriptions  of  the  background  radiation,  with  refined  cosmological 
consequences,  are  expected  from  the  Planck  mission,  launched  in  May  2009.  The 
initial  Planck  results,  announced  on  March  21,  2013,  indicate  a  Hubble  constant  of 
H0  =  67.15  zt  1.2  km  s-1  Mpc-1,  and  a  universe  with  a  baryonic  (normal)  matter 
content  of  4.9  percent  (Q/^  =  0.049),  a  dark  matter  content  of  26.8  percent 
(Qc  =  0.268),  a  total  matter  density  of  31.7  percent  (Qm  =  0.317)  and  a  dark 
energy  content  of  68.3  percent  (Q^  =  0.683).  The  values  of  these  parameters 
inferred  from  the  WMAP  results  are  given  in  Table  15.2. 


15.3  The  Beginning  of  the  Material  Universe 
15.3.1  The  First  Three  Minutes 

George  Gamow  (1904-1968)  and  his  colleagues  proposed  that  the  first  elements 
were  formed  during  the  big  bang  that  propelled  the  universe  into  expansion 
(Alpher  et  al.1948,  Sect.  10.5).  As  they  supposed,  the  lightest  atomic  nuclei  were 
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synthesized  before  any  atoms  were  created,  during  the  first  three  minutes  fol¬ 
lowing  the  big  bang  explosion;  the  nuclei  of  less  abundant,  heavier  atoms  were 
manufactured  at  a  later  time,  within  the  cores  of  stars. 

The  Japanese  astrophysicist  Chushiro  Hayashi  (1920-2010)  improved  their 
scenario  by  noting  that  the  temperature  was  hot  enough  in  the  immediate  aftermath 
of  the  big  bang  to  create  neutrinos  and  positrons,  the  antimatter  particles  of  the 
electron,  and  equilibrium  was  established  between  electrons,  positrons,  neutrons, 
protons  and  radiation  (Hayashi  1950). 

The  positrons,  denoted  £+,  interacted  with  the  neutrons,  denoted  n ,  to  form 
protons,  p ,  by  the  reaction: 


e+  +  n  +  ve,  (15.9) 

where  ve  denotes  an  antielectron  neutrino.  The  double  arrow  in  these  equations 
indicates  that  the  reverse  reactions  happen  just  as  often  as  the  forward  reactions, 
so  positrons  and  neutrons  were  being  produced  just  as  often  as  they  were  being 
consumed.  The  neutrons  also  interacted  with  the  electron  neutrinos,  ve  to  form 
protons  by: 


ve-\-n^p-\-e  ,  (15.10) 

where  e~  denotes  an  electron. 

And  in  the  meantime,  the  radiation  was  producing  electron-positron  pairs, 
which  turned  back  into  radiation  just  as  rapidly,  through  the  reactions: 

y  +  y^e~  +  e+ ,  (15.11) 

where  y  denotes  a  photon  of  gamma  ray  radiation. 

During  the  first  few  seconds  of  the  expanding  universe,  when  the  radiation 
temperature  was  greater  than  109  K,  there  were  no  atoms,  just  radiation  and 
subatomic  particles,  like  neutrons,  protons,  electrons,  positrons,  and  neutrinos.  The 
equilibrium  number  density  ratio  of  neutrons,  Nn,  and  protons,  NP,  just  depended 
on  the  mass  difference  between  neutrons  and  protons,  denoted  mn  —  mp,  and  the 
temperature  T,  according  to  the  relation  (Hayashi  1950): 


N, 


n 


Nr 


=  exp 


~{mn  -  mp)c: 
~kT 


(15.12) 


_ Q  _  1 

where  the  Boltzmann  constant  k  =  1.38065  x  10  J  K  ,  the  neutron  mass 
mn  =  1.6749274  x  10-27  kg,  the  proton  mass  mp  =  1.6726218  x  10-27  kg,  and 
the  speed  of  light  c  =  2.9979  x  108  m  s”1.  The  mass  difference  Am  =  mn-  mp  = 
2.3056  x  10_3°  kg,  and  the  energy  A  E  =  ( mn  -  mp )  c 2  =  A  me2  «  2.07  x 
10-13  J,  which  corresponds  to  a  temperature  T  =  A E/k  ^  1.5  x  1010  K. 

As  the  temperature  decreased,  the  number  density  ratio  Nn/Np  also  decreased  and 
the  protons  outnumbered  the  heavier  neutrons.  Eventually,  the  reactions  were  no 
longer  in  thermodynamic  equilibrium,  neutrons  could  no  longer  be  created,  and  the 
neutron-to-proton  ratio  became  “frozen-in”  at  an  amount  that  determined  the 
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abundance  of  helium  synthesized.  Modern  computations  by  David  Schramm 
(1945-1977)  and  others  have  conclusively  demonstrated  that  all  of  the  hydrogen  and 
most  of  the  helium  nuclei,  which  now  are  found  in  the  universe,  were  indeed  syn¬ 
thesized  in  the  immediate  aftermath  of  the  big  bang  (Sect.  10.5,  Peebles  et  al.  1991). 


Example:  Big-Bang  nucleosynthesis  of  helium 

In  the  first  moments  of  the  big  bang,  the  neutrons  and  protons  were  in 
thermodynamic  equilibrium  with  a  number  density  ratio  Nn/Np  = 
exp[— A  me2 /(kT)\,  where  Am  is  the  difference  between  the  mass  of  the 
neutron  and  the  mass  of  the  proton,  c  is  the  speed  of  light,  A  me2  ~  2.07  x 
10~13  J,  the  Boltzmann  constant  k  =  1.38065  x  10-23  J  K-1  and  T  is  the 
temperature.  Just  a  few  seconds  after  the  big  bang,  the  temperature  had 
cooled  to  just  above  109  K,  and  the  production  of  both  positrons  and  neu¬ 
trons  stopped.  The  leftover  positrons  then  were  consumed  by  interactions 
with  electrons,  and  an  equilibrium  was  established  in  which  the  relative 
amounts  of  neutrons  and  protons  were  governed  by  their  mass  difference  and 
the  temperature.  Hayashi  (1950)  estimated  that  this  frozen-in  abundance 
ratio  was  Nn/Np  ^0.25.  Alpher  et  al.  (1953)  obtained  lower  amounts  of 
between  0.17  and  0.22  using  detailed  calculations  that  depended  on  the  time 
it  takes  a  free  neutron  to  decay  into  a  proton;  outside  a  nucleus  free  neutrons 
are  unstable  and  have  a  mean  lifetime  of  881.5  zb  1.3  s  or  about  14  min 
42  s.  A  lower  limit  for  the  “frozen-in”  neutron- proton  ratio  is  obtained  when 
the  thermal  energy  kT  equals  mec  ,  the  rest-mass  energy  of  the  electron  or 
Nn/Np  =  exp  —(mn  —  mp)/me ]  «  exp  (—2.53)  «  0.08,  where  the  electron 
mass  me  =  9.10938  x  10-31  kg. 

If  Nn  and  Np  respectively  denote  the  number  densities  of  neutrons  and 
protons  before  helium  nuclei  were  synthesized,  then  Nn/2  helium  nuclei  will 
be  formed,  since  each  helium  nucleus  contains  two  neutrons  and  two  pro¬ 
tons,  and  the  number  of  protons  left  over  is  Np  —  Nn.  The  relative  number 
densities  of  helium,  N(4He ),  nuclei  to  hydrogen,  N(H ),  nuclei  is: 


N(4He) 
N(H ) 


Nn/2 
Np  Nn 


1 

«  —  «  0.08. 
12 


The  helium  mass  fraction,  T,  is: 


Y  = 


4  N(4He) 


N(H)  +  4  (4He) 


0.25. 


(15.13) 


(15.14) 


About  one  quarter  of  the  mass  of  the  material  universe,  in  baryons,  was 
synthesized  into  helium  in  the  first  few  minutes  of  the  expanding  universe, 
and  this  is  consistent  with  the  amount  of  helium  that  is  now  observed  in  the 
universe. 
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Once  neutron  production  stopped,  the  neutrons  and  protons  could  begin  com¬ 
bining  to  form  the  nuclei  of  deuterium  and  helium  atoms  in  amounts  governed  by 
the  frozen-in  abundance  ratio  of  neutrons  and  protons.  All  of  the  protons  that  did 
not  participate  in  forming  these  deuterium  and  helium  nuclei  eventually  became 
the  nuclei  of  hydrogen  atoms.  The  production  of  light  atomic  nuclei  was  over,  with 
vastly  more  hydrogen  left  behind  than  anything  else. 


15.3.2  Formation  of  the  First  Atoms,  and  the  Amount 
of  Invisible  Dark  Matter 

Whole  atoms  were  not  formed  until  the  expanding  universe  cooled  enough  for 
electrons  to  combine  with  protons  and  helium  nuclei  to  form  long-lived  hydrogen 
and  helium  atoms.  This  recombination  occurred  about  400,000  years  after  the  big 
bang,  when  the  temperature  had  fallen  to  about  3,000  K.  The  rate  of  recombination 
was  then  higher  than  the  rate  of  ionization  by  the  intense  radiation.  By  the  end  of 
recombination,  all  of  the  nuclei  and  electrons  had  been  bound  up  in  atoms,  and  the 
universe  became  transparent  to  the  radiation  that  then  could  travel  through  space 
without  scattering  off  free,  unattached  electrons.  The  cosmic  microwave  back¬ 
ground  radiation  that  we  observe  in  the  present  was  released  back  then,  almost 
14  billion  years  ago. 

Because  there  is  no  stable  nucleus  of  atomic  mass  5  or  8,  elements  heavier  than 
helium  (of  mass  4)  could  not  be  synthesized  by  successive  collisions  with  protons 
(of  mass  1).  Big-bang  nucleosynthesis  therefore  stopped  at  helium  4.  Heavier 
elements  needed  to  be  synthesized  inside  stars  where  the  densities  are  high  enough 
for  triple  collisions  of  helium  to  form  carbon,  rather  than  the  big  bang  in  which  the 
density  had  become  too  low  by  the  time  helium  nuclei  were  formed  for  triple 
collisions  to  become  significant  at  the  prevailing  temperature. 

The  nuclei  of  the  hydrogen  and  deuterium  atoms  and  most  of  the  nuclei  of 
helium  atoms  that  now  are  present  in  the  universe  were  synthesized  in  the  first 
3  min  of  the  expansion,  in  the  immediate  aftermath  of  the  big  bang  and  about 
14  billion  years  ago.  All  of  the  hydrogen  found  in  stars  and  interstellar  space,  or  in 
the  Earth’s  water,  and  in  our  body,  was  produced  by  this  big-bang.  And  every  time 
you  buy  a  floating  party  balloon,  which  has  been  inflated  by  helium,  you  are 
getting  atoms  made  about  14  billion  years  ago.  Deuterium  is  destroyed  inside  stars 
and,  although  helium  is  synthesized  in  main- sequence  stars,  the  amount  of  helium 
formed  inside  stars  over  the  lifetime  of  the  expanding  universe  is  no  more  than 
10  %  of  what  is  now  observed  in  cosmic  objects  (Hoyle  and  Tayler  1964). 

The  cosmological  implications  of  big-bang  nucleosynthesis  are  profound!  The 
agreement  of  light-element  abundances  and  predictions  from  the  primordial 
nuclear  reactions  works  only  if  the  density  of  ordinary  matter  in  the  universe  -  in 
both  visible  and  invisible  forms  -  is  less  than  10  %  of  the  critical  mass  density,  pc, 
required  to  eventually  stop  the  expansion  of  the  universe. 
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Fig.  15.4  Key  events  in  the  expanding  universe  Shortly  after  the  big  bang  (left)  the 
temperature  was  about  109  K  for  both  radiation  and  matter,  and  with  its  greater  energy  density 
radiation  dominated  the  expansion  of  the  universe.  With  increasing  time,  the  radiation  energy 
density  dropped  faster  than  the  matter  energy  density,  until  the  two  became  equal  about  76,000 
years  after  the  big  bang  when  the  temperature  had  dropped  to  about  104  K.  Thereafter  the 
radiation  went  its  separate  way,  cooling  to  the  2.725  K  we  now  detect  in  the  cosmic  microwave 
background  radiation  13.7  x  109  years  after  the  big  bang  {right).  The  matter  became  cool  enough 
for  protons  and  electrons  to  recombine  and  make  hydrogen  atoms  about  400,000  years  after  the 

o 

big  bang,  at  a  temperature  of  about  3,000  K,  and  the  first  stars  and  galaxies  formed  between  10 
and  109  years  after  the  big  bang 


Observations  of  cosmic  objects  with  low  heavy-element  (metals)  content 
indicate  that  the  primordial  abundance  of  helium,  or  the  amount  created  before  the 
first  stars  formed  and  synthesized  any  helium  or  heavy  elements,  is  roughly 
Y  =  0.25,  or  25  %  by  mass.  WMAP  observations  of  patterns  in  the  cosmic 
microwave  radiation  provide  evidence  for  the  presence  of  helium  long  before  the 
first  stars  formed  with  Y  =  0.326  db  0.075  (Jarosik  et  al.  2011).  The  abundance  of 
deuterium,  D ,  relative  to  hydrogen,  //,  which  is  D/H  >  10-5,  sets  an  upper  limit  to 
the  baryon  density  parameter  of  QB  =  Pso/Pc^O-025  hr2  ^0.05,  where  the 
Hubble  constant  Hq  =  100  h  ~  75  km  s-1  Mpc-1  and  the  critical  mass  density 
required  to  close  the  expanding  universe  in  the  future  is  pc  =  3Hq/(&tzG)  ~ 
10_26kgm~3  for  Hq  =  75  km  s^1  Mpc-1.  The  number  of  baryons  is  equal  to  the 
number  of  neutrons  and  protons  in  matter  in  either  seen  or  unseen  forms. 

When  combined,  the  big-bang  nucleosynthesis  constraints  of  all  four  of  the 

A  O  n 

light  nuclei,  He ,  D,  He ,  and  Li ,  provide  limits  to  the  baryon  to  photon  number 
density  ratio,  tj  =  NB/Ny  ^3  x  10-10,  which  has  remained  unchanged  since  the 
epoch  of  electron-positron  annihilation  a  few  seconds  after  the  big  bang.  Here  NB 
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is  the  number  density  of  baryons  and  Ny  is  the  number  density  of  photons,  so  the 
universe  contains  less  than  one  baryon  (neutrons  and  protons)  per  billion  photons. 
During  the  time  of  big-bang  nucleosynthesis ,  the  universe  was  a  dilute  gas  of 
radiation  photons,  contaminated  by  only  trace  amounts  of  baryons.  At  present  the 
number  density  of  photons  in  the  cosmic  microwave  background  radiation  is 
Ny  o  «  4  x  108  m-3.  The  baryon  density  now,  pB0  =  mpNyorj  ^2x  10-28  kgm-3, 

_ 0^7 

where  the  proton  mass  mp  =  1.673  x  10  ~  kg,  and  the  baryon  density  parameter 
Q#  =  Pbo/ Pc  ~  0-05  (Copi  et  al.  1995). 

Thus,  when  measurements  of  the  Hubble  constant  are  considered,  the  results  of 
big-bang  nucleosynthesis  and  the  observations  of  the  abundance  of  light  elements 
indicate  that  the  baryon  density  is  now  0.05  or  5  %,  of  the  critical  mass  density 
required  to  ever  halt  the  current  expansion  of  the  universe  in  the  future. 

A  completely  independent  estimate  of  the  baryon  density  is  provided  by  the 
power  spectrum  of  the  anisotropies  in  the  cosmic  microwave  background  radia¬ 
tion.  The  relative  amplitudes  of  the  peaks  in  the  spectrum  constrain  it  to 
Q b  =  0.044  =b  0.004,  in  excellent  agreement  with  the  conclusions  based  on  big- 
bang  nucleosynthesis. 


15.3.3  History  of  the  Expanding  Universe 

As  the  universe  grew  larger,  the  radiation  energy  density,  pr{t),  decreased  more 
rapidly  than  the  mass  density,  pm{t).  Eventually,  at  the  time  teq  «  76,000  years 
after  the  big  bang,  the  mass-energy  density  of  the  radiation  had  become  equal  to 
that  of  the  matter.  Thereafter  it  was  mass  that  dominated  the  expansion  of  the 
universe.  This  and  other  critical  times  in  the  expansion  are  illustrated  in  Fig.  15.4. 
Loeb  and  Barkana  (2001)  have  reviewed  the  re-ionization  of  the  universe  by  the 
first  stars  and  quasars. 

If  R(t)  denotes  the  radius,  or  scale  factor,  of  the  universe  at  cosmic  time,  t,  the 
radiation  temperature,  Tr(t),  falls  off  as  \/R(t).  This  means  that  the  radiation 
energy  density  (Tolman  1934): 

pr(t)=^Nl  *  J_  (15.15) 

where  the  radiation  constant  a  =  1 .5657  x  10  16  J  m  3  K  4,  the  speed  of  light 

o  1 

c  =  2.9979  x  10  m  s  ,  and  the  symbol  oc  denotes  proportional  to.  The  mass 
density,  pm(t ),  doesn’t  decrease  as  rapidly  with  increasing  time  and  radius,  since  it 
thins  out  with  increasing  volume,  or  with 


p„,(t)  oc 


(15.16) 
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Since 

(15. 


M2p  =  7V(,„)  (15.18) 

is  a  constant  for  all  times  t.  Here  teq  denotes  the  time  when  the  radiation  energy 
density  is  equal  to  the  matter  density,  or  when  pr{teq)  =  pm(teq) 


Tr(t)  oc 


1 


RlfV 


the  ratio 


Example:  Radiation  energy  density  and  mass  energy  density 

As  the  universe  expands,  the  radiation  energy  density  pr{t)  at  time  t  will  fall 
off  as  the  inverse  fourth  power  of  the  radius  of  the  universe,  R(t ),  since  the 
radiation  temperature  Tr{t)  falls  off  as  1  /R(t).  The  mass  energy  density  pm(t ) 
falls  off  as  the  inverse  cube  of  the  radius,  since  the  volume  increases  as  the 
radius  cubed.  This  means  that  pm{t)Tr(t) / pr(t)  =  constant  for  all  time  t. 
At  the  present  time  t0  we  have  Tr(to)  =  2.725  K,  the  pr(to)  =  a  T^(to)/c2  ~ 
0.842  x  10~32  T?(t0)  ~  4.64  x  10-31kgm-3,  where  the  radiation  constant 
a  =  7.5657  x  10“ 16  J  m-3  K-4  and  the  speed  of  light  c  =  2.9979  x 

O  _  1 

10  m  s  .  WMAP  observations  indicate  that  the  density  parameter  of 
normal  baryonic  matter,  like  protons  and  neutrons  that  are  found  in  atoms,  is 
Qfc  =  pm(to)/pc  ~  0.04  (Table  15.2),  where  pm(to )  is  the  present  mass 
density  in  both  visible  and  invisible  baryons  and  the  critical  mass  density 
required  to  stop  the  expansion  of  the  universe  in  the  future  is  pc  =  3tf07 
(87iG)  «  1.0  x  10^26  kgm-3  (Sect.  14.6),  the  Hubble  constant  H0  = 
100  h  km  s'1  Mpc-1  =  3.24  x  10-18  h  s-1,  with  h  «  0.75,  1  Mpc  = 
3.0857  x  10 19  km,  and  the  Newtonian  gravitational  constant 
G  =  6.674  x  10-11  m3  kg-1  s-2.  This  means  that  pm{to)  ~  0.04  pc  « 
4  x  10^28  kgm-3,  about  a  thousand  times  greater  than  the  present  radiation 
energy  density,  and  that  pm(to)Tr(to)  /  pr(to)  ^2x  103. 

The  radiation  energy  density  and  mass  energy  density  were  equal  at  a  time 
teq  «  76,000  years  «  2.40  x  10 12  s,  where  one  year  =  3.1557  x  107  s, 
after  the  big  bang  (Table  15.2).  At  this  time,  the  radiation  temperature  has 
dropped  to  Tr(teq)  =  2.15  x  1010  t~q2  «  104  K,  and  pm(teq)Tr(teq) / 
Pr(teq)  =  Tr(teq )  ~  104,  close  enough  to  the  value  of  this  ratio  now  consid¬ 
ering  the  uncertainties. 


The  radiation  dominated  era,  when  the  radiation  energy  density  is  greater  than 
the  matter  density,  has  a  radius  that  scales  as  the  square  root  of  time,  t,  or 
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R(t)  (x  t1^2  for pr(t)  >  pm(t)  or  for  t <teq,  (15.19) 

and  when  the  matter  dominates  the  expansion  of  the  universe,  its  radius  is: 

R(t)  oc  t 2^3  for  pm(t)  >  pr(t )  or  for  t  >  teq.  (15.20) 

If  radiation  was  emitted  at  time  t ,  then  its  redshift,  z,  is  given  by 


1  +z  = 


*(*<>) 


and  the  radiation  temperature,  Tr{t)  at  that  time  is  given  by 


Tr{f)  =  (1  +  z)  Tr(t 0)  =  2.725  (1  +  z), 


(15.21) 


(15.22) 


where  the  temperature  of  the  background  radiation  Tr(to)  =  2.725  db  0.002  K  right 
now,  at  time  t0. 

WMAP  observations  indicate  that  the  radiation  energy  density  became  equal  to 
the  matter  density  at  a  redshift  of  zeq  =  3196  zb  133  (Jarosik  et  al.  2011).  Since 
that  time,  denoted  as  teq  for  the  equality  of  radiation  and  matter,  the  matter 
dominated  the  expansion  and  R(t)  grew  as  f2^3,  so  the  transition  from  the  radiation 
dominated  era  to  the  matter  dominated  era  occurred  at  time  teq  where: 

=  (1  +zf2,  (15.23) 


>eq  —  3'2  76,000  years  (15.24) 

(1  +Zeq)  7 

after  the  big  bang.  This  indicates  that  matter  has  dominated  the  expansion  of  the 
universe  for  all  but  a  relatively  small  fraction  of  the  age  t0  «  13.7  x  109  years  of 
the  expanding  universe. 

Another  crucial  time  is  the  decoupling  time ,  or  recombination  time ,  tdec,  when 
the  temperature  had  fallen  to  about  3,000  K,  which  was  cool  enough  for  electrons 
to  begin  recombining  with  protons  and  helium  nuclei  to  form  long-lived  hydrogen 
and  helium  atoms.  That  is,  the  rate  of  recombination  to  form  atoms  was  then 
higher  than  the  rate  of  atomic  ionization  by  the  intense  radiation.  By  the  end  of 
recombination,  all  the  nuclei  had  been  bound  up  in  atoms,  and  the  universe  became 
transparent  to  the  radiation;  so  cosmic  microwave  background  radiation  that  we 
observe  at  the  present  time  was  released  back  then,  about  13.3  billion  years  ago. 

The  decoupling  redshift  corresponding  to  this  temperature  is  zdec  ^ 
3,000/2.725  «  1100,  and  tdec  =  to/ (11 00)3^2  «  377,000  years.  The  WMAP  esti¬ 
mates  give  zdec  =  1090.89  ±  0.69.  This  also  marks  the  beginning  of  the  dark  ages  of 
the  expanding  universe,  for  there  were  no  sources  of  radiation  other  than  the  grad¬ 
ually  cooling  and  darkening  cosmic  background  radiation  until  stars  and  galaxies 


and  that 


* eq 

to 


R(tp) 

R(teq ) 


to 
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Table  15.3  Crucial  times  during  the  expansion  of  the  universe 


Time  (after  the  big 
bang) 

Redshift,  z 

Temperature 

(K) 

Key  events 

icr14  s 

1027 

1027 

Inflation  ends,  Qm  +  QA  =  1- 

10  s 

4  x  109 

10‘° 

Neutron  and  positron  production 
stops. 

3  min 

4  x  108 

109 

Big-bang  nuclosynthesis  ends,  light 
elements  H,  D,  He  formed 

teq  «  76,000  year 

3,196 

104 

Radiation  domination  equals  matter 

domination,  transfer  from  R(t)  1//2  to 
R(t)  oc  f2/3 

tdec  ~  377,000  year 

1,100 

3,000 

Decoupling  (recombination)  time, 
radiation  decouples  from  matter, 
hydrogen  atoms  recombine,  universe 
becomes  transparent  to  background 
fluctuations,  dark  ages  begin 

treio n  ^3.5  x  108  year 

10.5  ±  1.2 

31 

Reionization  time,  first  stars  and  galaxies 
form,  universe  re-ionized  by  their 
radiation,  and  dark  ages  end 

6  x  109  year 

1 

5 

Dark  energy  begins  acceleration  of 
universe  expansion 

t0  =  13.7  x  109  year 

0 

2.725 

Age  of  expanding  universe,  today,  present 
epoch 

formed,  about  350  million  years  after  the  big  bang,  they  provided  beacons  of  bright 
light  that  could  ionize  surrounding  matter.  However,  by  then  the  universe  had  thinned 
out  enough  that  the  low-density  ionized  hydrogen  remained  transparent,  ending  the 
dark  ages. 

A  summary  of  these  milestones  in  the  history  of  the  expanding  universe  is  given 
in  Table  15.3. 


Example:  When  did  the  first  stars  and  galaxies  form? 

When  atoms  coalesced  to  form  stars  and  galaxies,  these  shining  beacons  of 
intense  radiation  illuminated  the  former  darkness  and  re-ionized  nearby 
matter.  The  WMAP  observations  suggest  that  this  began  at  a  redshift 
z  re  ion  =  10-5  =b  1.2.  This  would  happen  at  a  background  radiation  temper¬ 
ature  T  =  (1  +  z)  2.725  =  31  K.  The  corresponding  time,  treion,  is  treion  = 
W(1  +  Zreion)3^2  ~  3.5  x  108  years  or  about  350  million  years  after  the  big 
bang.  Since  matter  decoupled  from  radiation,  and  the  dark  ages  began,  about 
76,000  years  after  the  big  bang,  the  dark  ages  lasted  about  350  million  years 
and  stars  and  galaxies  have  been  around  for  about  t0  -  treion  «  13.4  billion 
years,  where  the  big  bang  occurred  about  t0  =  13.75  billion  years  ago.  These 
stars  and  galaxies  have  redshifts  of  10.5  or  less. 
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15.4  The  First  Stars  and  Galaxies 
15.4.1  Pulling  Primordial  Material  Together 

Immediately  after  the  big  bang,  the  radiation  and  matter  were  distributed 
smoothly,  with  almost  no  structure  at  all;  because  the  subatomic  neutrons  and 
protons  were  then  in  thermal  equilibrium  with  the  radiation,  the  material  universe 
must  have  had  a  smooth  beginning.  Departures  from  complete  uniformity  in  the 
cosmic  microwave  background  radiation  were  about  one  part  in  one  hundred 
thousand,  or  with  fluctuations  AT  in  the  temperature  T  of  A T/T  «  10-5.  Yet,  the 
material  universe  we  see  today  has  gathered  together  into  stars,  galaxies,  and 
clusters  of  galaxies,  so  something  must  have  given  them  shape  and  form.  They  are 
composed  of  baryonic  matter,  which  includes  neutrons,  protons,  and  atoms  of  any 
sort,  and  thus  consist  of  ordinary  matter  that  we  encounter  or  experience  in 
everyday  life. 

When  left  alone,  a  spread  out  distribution  of  matter  will  coalesce  about  small, 
initial  concentrations,  as  a  result  of  their  gravitational  pull  on  surrounding  mate¬ 
rial,  so  any  slight  perturbations  in  otherwise  uniform  matter  will  grow  and  even¬ 
tually  contract.  Given  enough  time,  gravity  might  magnify  the  extremely  slight 
irregularities  in  the  initial  mass  distribution,  eventually  providing  the  concentrated 
structures,  the  observed  stars  and  galaxies,  by  gravitational  collapse. 

The  problem  with  this  scenario,  first  recognized  by  Georges  Lemaitre 
(1894-1966)  and  rigorously  derived  by  Evgeny  Lifshitz  (1915-1985),  is  that  the 
initial  irregularities  could  not  have  grown  fast  enough  to  account  for  the  observed 
stars  and  galaxies  (Lemaitre  1934;  Lifshitz  1946).  Chance  density  fluctuations  in 
this  matter  would  have  grown  too  slowly  to  overcome  and  resist  the  overall 
expansion  of  the  universe,  which  pushes  and  tears  the  material  apart  as  soon  as  it 
starts  to  gather  together.  Even  a  billion  years  is  not  enough  time  for  the  fluctuations 
in  ordinary  matter  density  to  gravitationally  pull  the  primeval  matter  together  into 
luminous  stars  and  galaxies. 

We  can  follow  the  basic  argument  by  supposing  that  an  initial  density  pertur¬ 
bation  A pm  in  the  mass  density  pm  was  of  order  Apm/ pm  =  10-5,  since  the  sub¬ 
atomic  matter  was  in  thermal  equilibrium  with  the  radiation  with  temperature 
fluctuations  of  A  T/T  =  10-5.  In  the  matter-dominated  era  the  perturbations  will 
slowly  grow  with  increasing  time,  t ,  and  the  scale  factor  R(t)  as 

A  Pm!  Pm  ^2//3  oc  R(t),  but  even  over  the  past  13  billion  years  R(t )  has  only 

o 

increased  by  a  factor  of  a  thousand,  or  10  ,  so  the  initial  perturbations  would  have 
grown  to  A pm/pm  =  10-2,  much  smaller  than  the  amount  of  Apm/pm  =  1  now 
observed  in  galaxies. 

However,  we  have  considered  only  ordinary  matter,  and  the  paradox  could  be 
resolved  if  the  extra  gravitational  pull  of  much  greater  amounts  of  invisible  dark 
matter  helped  clump  and  shape  the  expanding  universe,  pulling  together 
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primordial  fluctuations  in  perfectly  ordinary  matter  and  forming  the  seeds  from 
which  the  first  stars  and  galaxies  grew. 

According  to  the  cold  dark  matter  scenario,  galaxies  formed  first,  than  gravi¬ 
tationally  merged  and  consolidated  into  clusters  and  super-clusters  of  galaxies  as 
the  universe  expanded  and  evolved;  for  pioneering  papers  on  the  cold  dark  matter 
hypothesis  see  Lang  (1999). 

Understanding  how  gravity  causes  the  perturbations  in  ordinary  matter  to  grow 
in  an  expanding  universe  and  eventually  become  galaxies  requires  studying  the 
interaction  between  ordinary  matter  and  dark  matter.  That  interaction  causes  a 
region  of  space  with  more  ordinary  matter  than  average  to  oscillate,  sending  out 
waves  known  as  baryonic  acoustic  oscillations.  These  sound  waves  were  first 
predicted  in  1970,  suggested  by  WMAP  fluctuations  of  the  cosmic  microwave 
background  in  1999,  and  measured  as  rippling  imprints  in  the  distribution  of 
galaxies  using  Sloan  Digital  Sky  Survey  data  in  2005  (Peebles  and  Yu  1970; 
Eisenstein  et  al.  2005). 


15.4.2  When  Stars  Began  to  Shine 

With  the  help  of  cold  dark  matter,  the  first  stars  and  galaxies  appeared  more  than 
10  billion  years  ago.  We  can  observe  these  embryonic  galaxies  when  they  were 
cosmic  infants;  the  light  now  reaching  us  began  its  journey  long  before  the  Sun 
came  into  existence. 

Bromm  and  Larson  (2004)  have  discussed  the  first  stars;  Bromm  and  Yoshida 
(2011)  provided  a  review  of  the  first  galaxies;  Brodie  and  Stader  (2006)  discussed 
extragalactic  globular  clusters  and  galaxy  formation,  and  Kravtsov  and  Borgani 
(2012)  has  reviewed  the  formation  of  galaxy  clusters. 

Kennicutt  and  Evans  (2012)  have  reviewed  star  formation  near  and  far.  Shapley 
(2011)  has  reviewed  the  physical  properties  of  galaxies  from  redshifts  z  =  2-4, 
and  Giavalisco  (2002)  has  discussed  Lyman-break  galaxies.  Sanders  and  Mirabel 
(1996)  have  reviewed  luminous  infrared  galaxies. 

Each  galaxy  may  have  formed  through  the  gravitational  collapse  of  a  larger, 
protogalactic  cloud,  which  would  become  a  rotating  disk  like  the  Milky  Way 
(Eggen  et  al.  1962).  These  flattened,  spinning  galaxies  often  show  spiral  structure, 
with  arms  of  gas  and  dust  in  which  new  stars  are  forming. 

Not  all  galaxies  have  a  disk  or  spiral  shape,  and  the  most  massive  are  the  giant, 
rounded,  featureless  elliptical  galaxies.  They  may  result  from  the  collision  and 
subsequent  merger  of  two  spiral  galaxies.  During  the  encounter,  the  ordered 
rotational  motions  of  the  stars  in  the  spiral  galaxies  are  transformed  by  tidal  forces, 
which  tear  their  disks  and  arms  apart  and  randomize  the  orbits  of  their  stars.  When 
the  merger  is  complete,  a  single  elliptical  galaxy  remains,  composed  of  old  stars 
with  little  or  no  gas  and  dust  left  to  form  new  stars.  Many  of  the  giant  elliptical 
galaxies  are  found  in  the  cores  of  dense  clusters  of  galaxies  where  collisions 
should  be  frequent  on  a  cosmic  time  scale. 
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Time  Since  Big  Bang 
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Fig.  15.5  Star-formation  rates  The  star-formation  rate,  in  solar  masses  per  year  per  cubic 

_ i  _ o 

Megaparsec,  or  M0  year  Mpc  ,  plotted  as  a  function  of  redshift,  z  ( bottom  axis)  and  time 
since  the  beginning  of  the  expanding  universe  ( top  axis),  in  units  of  109  years,  or  1  billion  years 
and  a  G  year.  The  rate  of  star  formation  peaked  at  a  redshift  of  about  3,  or  roughly  2  billion  years 
after  the  expansion  began,  and  this  rate  subsequently  has  decreased  as  gravitation  pulls  more 
material  into  stars.  (From  “The  Life  and  Death  of  Stars”  by  Kenneth  R.  Lang,  published  by 
Cambridge  University  Press,  2013.  Reprinted  with  permission.) 


When  astronomers  use  infrared  telescopes,  aboard  the  Herschel  and  Spitzer 
spacecraft,  they  can  peer  behind  veils  of  local  interstellar  dust  to  see  infant  stars  in 
distant  galaxies.  Some  starburst  galaxies  are  very  powerful  infrared  emitters,  with 
an  infrared  luminosity  greater  than  a  million  million,  or  10  ,  times  that  of  the  Sun, 
and  an  infrared  output  that  is  100  times  their  visible-light  emission.  An  exceptional 
amount  of  interstellar  dust  in  these  galaxies  absorbs  the  intense  ultraviolet  radia¬ 
tion  produced  by  enhanced  star  formation,  and  the  dust  reradiates  in  the  infrared 
part  of  the  electromagnetic  spectrum.  Their  intense  infrared  emission  and  implied 
dust  suggests  that  these  galaxies  are  forming  stars  more  vigorously  than  our 
present-day  Milky  Way  (Fig.  15.5). 

Starburst  galaxies  at  large  redshift  of  about  z  =  3  are  found  by  a  clever  com¬ 
parison  of  the  ultraviolet  and  visible  radiation  of  galaxies.  The  technique  is  related 
to  the  main  ultraviolet  spectral  transitions  of  hydrogen,  the  Lyman  alpha  line  at 
121.7  nm,  which  occurs  between  electron  orbits  with  quantum  numbers  m  =  2  and 
n  =  1,  and  all  the  other  Lyman  transitions  at  larger  m  and  the  same  n ,  culminating 
at  the  Lyman  limit  at  91.2  nm  at  very  large  m.  The  observed  Lyman  limit  at  large 
redshifts  is  Doppler  shifted  into  visible  wavelengths  at  2  =  91. 2(1  T-z),  which  is 
at  360  nm  for  z  =  3.  Radiation  at  wavelengths  lower  than  the  Lyman  limit  is 
almost  completely  absorbed  by  the  alpha  transition  of  neutral  hydrogen  in  the  star¬ 
forming  regions.  At  large  redshifts  the  sharp  decrease,  or  break,  of  the  emitted 
spectrum  has  been  Doppler  shifted  into  the  visible  region.  The  starburst  galaxies 
are  therefore  also  known  as  Lyman-break  galaxies. 

About  2  billion  years  after  the  big  bang  and  roughly  12  billion  years  ago,  some 
starburst  galaxies  had  an  exceptionally  high  rate  of  star  formation,  which  exceeded 
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100  stars  per  year  for  hundreds  of  millions  of  years  -  much  greater  than  the  rate  in 
most  galaxies  and  currently  in  the  Milky  Way,  which  is  about  one  new  star  every 
year.  Scarcely  any  stars  were  being  formed  during  the  first  half  billion  years  of  the 
expanding  universe,  perhaps  because  the  gravitational  forces  were  still  pulling  the 
galaxies  together,  and  after  the  bursts  of  star  formation  there  may  have  been  less 
material  available  for  forming  new  stars,  since  some  of  it  already  had  been  used  up 
in  creating  other  stars. 

The  fast  pace  of  star  formation  in  galaxies  in  the  early  universe  could  not  be 
continued  for  long  times.  It  would  use  up  the  interstellar  gas  from  which  stars  are 
formed  in  much  less  time  than  the  age  of  the  universe.  So  the  bursts  of  star 
formation  could  be  associated  with  rare  circumstances,  perhaps  feeding  off  gas 
stirred  up  as  a  result  of  collisions  or  close  encounters  between  galaxies. 

The  mergers  of  galaxies  may  not  be  the  dominant  method  of  high  star  growth. 
It  could  be  associated  with  a  voracious  consumption  of  hydrogen  gas,  which  has 
been  observed  in  greater  abundance  back  then  when  compared  to  more  recent 
times.  A  steady  supply  of  gas  may  have  streamed  in  from  filaments  of  dark  matter. 

When  the  first  stars  formed  out  of  collapsing  clouds  of  gas  and  ignited  the 
nuclear  reactions  that  make  them  shine,  the  early  universe  consisted  of  nothing 
more  than  the  light  elements,  hydrogen  and  helium.  These  young  stars  must  have 
been  uncontaminated  by  heavier  elements.  Some  of  these  “infant”  stars  most 
likely  were  very  massive,  perhaps  with  about  100  times  as  much  mass  as  the  Sun; 
therefore,  they  would  have  a  relatively  short  lifetime  on  the  cosmic  time-scale.  The 
first  massive  stars  would  have  exploded  as  supernovae,  spewing  out  ashes  of  dust 
made  of  heavy  elements  synthesized  within  them  and  spawning  the  next  genera¬ 
tion  of  stars. 

The  interstellar  medium  would  have  become  steadily  enriched  in  heavy  ele¬ 
ments  as  subsequent  generations  of  massive  stars  were  formed,  lived,  and  expired 
explosively.  They  would  have  seeded  their  surroundings  with  elements  such  as 
carbon,  oxygen,  and  iron,  which  were  needed  for  the  formation  of  Earth-like 
planets  and  life. 

Whether  a  galaxy  is  young  or  old,  there  will  always  be  many  more  stars  of  low 
mass  than  there  are  massive  stars  (see  Sect.  10.1).  The  stellar  mass  distribution  will 
also  depend  on  the  evolution  of  stars,  which  varies  with  mass.  The  initial  mass 
distribution  can  be  inferred  from  the  observed  stellar  luminosity  function,  or  the 
number  of  stars  of  different  absolute  luminosities,  by  using  the  stellar  mass- 
luminosity  relation  together  with  a  model  of  how  the  star  formation  rate  varies 
with  time. 

The  initial  mass  distribution  for  stars  more  massive  than  the  Sun  was  quantified 
by  the  Cornell  astronomer  Edwin  E.  Salpeter  (1924-2008),  who  showed  that  the 
number  of  stars  with  masses  in  the  range  M  to  M  +  dM  within  a  specified  volume 

_ o  o  c 

of  space,  is  proportional  to  M  '  (Salpeter  1955).  In  other  words,  the  number  of 
stars  in  each  mass  range  decreases  rapidly  with  increasing  mass. 

The  oldest  stars  in  our  Milky  Way  Galaxy,  which  were  formed  when  the 
universe  was  only  about  1  billion  years  old,  are  deficient  in  heavy  elements  when 
compared  to  stars  that  are  now  forming  in  the  Milky  Way.  No  one  has  yet  found  a 
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completely  pure  star  that  formed  out  of  uncontaminated  hydrogen.  Perhaps  such 
stars  now  are  inaccessible  to  direct  observation,  awaiting  the  next  space  telescope 
that  can  peer  deeper  into  the  remote  past. 


15.5  The  Evolution  of  Galaxies 
15.5.1  Active  Galactic  Nuclei 

By  peering  out  at  galaxies  that  are  located  at  vastly  different  distances  or  redshifts, 
astronomers  have  shown  that  the  entire  observable  universe  evolves  and  has  a 
history,  and  that  the  properties  of  galaxies  change  over  vast  time  scales.  The  light 
we  detect  from  a  remote  galaxy  has  traveled  for  a  very  long  time,  and  was  emitted 
in  the  galaxy’s  infancy  many  billions  of  years  ago.  And  when  our  telescopes 
observe  a  nearby  galaxy,  its  light  may  have  been  generated  a  few  million  years 
ago,  after  the  galaxy  has  aged  for  billions  of  years. 

Because  no  significant  change  in  the  equilibrium  of  galaxies  can  be  produced 
without  a  substantial  change  in  the  distribution  of  mass  and  angular  momentum,  it 
was  long  believed  that  no  significant  departures  from  a  stable  equilibrium  in  their 
shape,  form,  mass,  or  luminosity  would  be  produced  during  most  of  their  lifetimes. 
Nevertheless,  it  now  is  known  that  the  centers  of  galaxies  are  locations  of  pro¬ 
nounced  activity  that  disrupts  the  expected  equilibrium  and  that  galaxies  tend  to  be 
more  active  in  their  youth. 

The  American  astronomer  Carl  K.  Seyfert  (1911-1960)  provided  early  obser¬ 
vational  evidence  that  the  central  regions  of  some  galaxies  are  not  in  equilibrium 
when  he  examined  the  intense  blue  centers  of  certain  spiral  nebulae  -  a  type 
subsequently  named  Seyfert  galaxies  (Seyfert  1943).  Although  most  spirals  exhibit 
spectral  lines  in  absorption,  similar  to  the  absorption  spectra  of  stars,  the  central 
regions  of  Seyfert  galaxies  exhibit  intense  emission  lines  of  the  type  produced  by 
ionized  emission  nebulae  (also  see  Sect.  11.1).  They  are  the  emission  lines  of 
oxygen  [O  II],  [O  III],  nitrogen  [N  II],  neon  [Ne  III]  and  sulphur  [S  II],  as  well  as 
the  permitted  emission  transitions  of  unionized  hydrogen,  H,  and  ionized  helium, 
He  II  (Table  15.4). 

The  emission  lines  of  Seyfert  galaxies  are  unexpectedly  wide.  High-speed 
motions  of  the  ions  and  hydrogen  atoms  have  widened  the  emission  lines,  and  the 
velocities  implied  from  their  widths,  when  interpreted  by  the  Doppler  effect,  are  up 
to  8,500  km  s_1.  Because  the  central  masses  derived  from  the  rotation  curves  of 
spiral  nebulae  are  no  more  than  1011  solar  masses,  the  escape  velocities  of  the 
central  regions  are  only  a  few  hundred  km  s-1.  The  observed  motions  at  the 
centers  of  Seyfert  galaxies  are  therefore  far  in  excess  of  the  expected  escape 
velocities,  and  they  provide  the  first  evidence  for  violent  explosive  events  in  the 
nuclei  of  galaxies.  Their  matter  could  be  flowing  out  into  intergalactic  space;  some 
of  the  Seyfert  galaxies  exhibit  bright  filaments  that  suggest  the  ejection  of  gas. 
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Table  15.4 

Intense  emission  lines  found  in 

Seyfert  galaxiesa 

Element 

Wavelength  (nm) 

Element 

Wavelength  (nm) 

[O II] 

372.62 

He  I 

447.25 

[O II] 

372.89 

He  II 

468.57 

He  II 

376.89 

HjS 

486.1332 

H  9 

379.86 

[O  III] 

495.891 

He  II  +  H rj 

383.56 

[O  III] 

500.684 

[Ne  III] 

386.875 

He  I 

587.56 

Hg  +  He  I 

388.89 

He  II 

597.7 

[Ne  III] 

396.746 

[O I] 

630.0304 

He 

397.01 

[O I] 

636.3776 

[S II] 

406.85 

[N  II] 

644.803 

[S  II] 

407.65 

Ha 

656.281 

US 

410.17 

[N  II] 

658.341 

He  II  +  Uy 

433.86 

[S  II] 

671.647 

+  434.047 

[S  II] 

673.085 

[O  III] 

436.32 

a  Adapted  from  Seyfert  (1943) 


Weedman  (1977)  has  summarized  our  then  current  knowledge  of  Seyfert  gal¬ 
axies.  Early  considerations  of  violent  activity  in  the  nuclei  of  galaxies  can  be  found 
in  Ambartsumian  (1958),  Burbidge  et  al.  (1963)  and  Lynden-Bell  (1969). 

Powerful  cosmic  radio  sources  provide  additional  evidence  for  intense  activity 
in  the  central  regions  of  young  galaxies.  But  since  there  was  no  radio  technique  for 
establishing  distances,  the  optical  wavelength  counterparts  of  the  radio  sources  had 
to  be  used  to  determine  how  far  away  they  were.  Research  groups  led  by  Joseph  L. 
Pawsey  (1908-1962)  in  Australia  and  by  Martin  Ryle  (1918-1984)  in  Cambridge, 
England  built  interferometers  that  were  used  to  obtain  accurate  positions  and 
identify  the  optical  counterparts  of  the  brightest  radio  sources,  named  by  the 
constellation  they  appeared  in.  The  Australian  group  identified  the  source  named 
Virgo  A  -  also  numbered  3C  274  in  the  third  Cambridge  catalogue  of  bright  radio 
sources  -  with  giant  elliptical  galaxy,  M  87  (Bolton  et  al.  1949),  and  an  accurate 
position  established  with  the  Cambridge  radio  interferometer  was  used  to  identify 
the  radio  source  Cygnus  A,  numbered  3C  405,  with  another  elliptical  galaxy 
(Baade  and  Minkowski  1954).  Like  the  nuclei  of  Seyfert  galaxies,  the  optical 
counterpart  of  Cygnus  A  emits  strong  “forbidden”  emission  lines  of  [O  II],  [O  III], 
[N  II]  and  [Ne  III]  with  widths  corresponding  to  velocities  of  a  least  1,000  km  s-1. 

Moreover,  the  Doppler  shift  of  the  central  wavelengths  of  these  spectral  lines 
exhibited  a  redshift  of  0.0561,  and  a  recession  velocity  of  16,820  km  s-1.  Using 
the  Hubble  law  with  this  redshift,  the  radio  galaxy  Cygnus  A  lies  at  a  distance  of 
about  224  Mpc,  or  731  million  light-years,  for  a  Hubble  constant 
H0  =  75  km  s'1  Mpc-1,  and  the  apparent  radio  luminosity  and  distance  can  be 
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combined  to  infer  an  enormous  absolute  radio  luminosity  of  about  10  J  s  .  It  is 
emitting  as  much  power  at  radio  wavelengths  as  the  visible  luminosity  of  a  million 
million  stars  like  the  Sun,  of  absolute  luminosity  L0  =  3.828  x  1026  Js-1,  whose 
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Fig.  15.6  Radio  galaxy  Cygnus  A  The  radio  galaxy  Cygnus  A,  listed  as  3C  405  in  the  third 
Cambridge  catalogue  of  bright  radio  sources,  which  has  a  radio  output  1  million  times  more 
powerful  than  the  radio  emission  of  a  normal  galaxy  like  the  Milky  Way.  This  radio  image,  taken 
with  the  Very  Large  Array  at  a  wavelength  of  6  cm  with  a  held  of  view  of  0.038  x  0.022  degrees, 
shows  two  narrow,  straight  radio-emitting  jets  of  particles  that  protrude  in  opposite  directions 
from  a  giant  elliptical  galaxy  at  the  center.  The  redshift  of  the  optically  visible  elliptical  is 
z  =  0.056075,  indicating  a  distance  of  about  224  Mpc  or  780  million  light-years,  and  a  linear 
extent  for  the  radio  galaxy  of  about  1  million  light-years  from  end  to  end.  The  radio  jets  probably 
were  ejected  along  the  rotation  axis  of  a  super-massive  black  hole  located  within  a  central 
elliptical  galaxy.  It  had  to  be  active  for  tens  of  millions  of  years  to  produce  the  two  radio  lobes. 
(Courtesy  of  NRAO/AUI/NSF.) 

most  intense  radiation  is  also  at  the  visible,  optical  wavelengths  with  relatively  dim 
radio  emission. 

When  the  optical  image  of  the  elliptical  galaxy  associated  with  Cygnus  A  is 
combined  with  maps  of  the  radio  signals,  it  is  found  that  the  radio  emission  is  not 
confined  to  its  visible  counterpart;  instead  it  is  concentrated  in  two  radio  lobes  that 
are  separated  from  the  central  visible  galaxy  by  hundreds  of  thousands  of  light-years. 
It  is  as  if  the  radio-emitting  clouds  were  expelled  from  the  central  elliptical  galaxy, 
which  is  detectable  only  at  optically  visible  wavelengths.  The  astonishing  radio 
power  is  attributed  to  the  nonthermal  synchrotron  radiation  of  high-speed  electrons 
supplied  from  the  visible  center  along  two  oppositely  directed  jets  that  feed  the  radio 
lobes  (Fig.  15.6).  These  dual  jets  remain  extraordinarily  straight  and  surprisingly 
stable,  energizing  the  radio  lobes  and  pushing  them  farther  and  farther  apart. 

Harris  and  Krawcynski  (2006)  discussed  x-ray  emission  from  extragalactic  jets; 
Bridle  and  Perley  (1984)  have  reviewed  extragalactic  radio  jets;  Kellermann  and 
Pauliny-Toth  (1981)  have  reviewed  our  knowledge  of  compact  radio  sources,  and 
Miley  (1980)  has  discussed  the  structure  of  extended  radio  sources. 

If  the  radio  galaxy  has  been  sending  out  radio  power  at  the  present  rate  at  an 
estimated  million-year  lifetime,  then  it  has  emitted  radio  energy  equivalent  to  the 
complete  annihilation  of  about  a  100,000  stars. 


Example:  Feeding  the  radio  lobes  of  Cygnus  A 

The  radio  galaxy  Cygnus  A  has  a  redshift  of  z  =  0.056,  which  from  the 
Hubble  law  cz  =  H0D  provides  a  distance,  D,  of  224  Mpc  =  6.91  x  1024  m 
and  about  730  million  light-years  for  a  Hubble  constant  of  H0  = 
75  km  s'1  Mpc-1  where  the  speed  of  light  is  c  =  2.9979  x  105  km  s-1  and 
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1  Mpc  =  3.0857  x  10^  m.  Its  two  lobes  are  separated  by  an  angle  of 
6  «  0.038  degrees  «  137"  and  6.63  x  10  4  radians,  using  the  conversion 
factor  of  1  radian  =  2.06265  x  105  s  of  arc.  The  minimum  amount  of  time, 
t ,  that  the  central  elliptical  galaxy  has  been  feeding  the  radio  lobes  is 
t  =  9D/(2c),  where  the  distance  between  the  visible  galaxy  and  one  radio 
lobe  is  OD/2.  That  time  is  t  —  7.6  x  10 12  s  =  0.24  x  106  years,  where 
1  year  =  3.1557  x  107  s. 

Q  O 

The  absolute  radio  luminosity  of  Cygnus  A  is  about  LR  «  1.3  x  10 
J  s'1  «  3.4  x  1011  L0,  where  L0  =  3.828  x  1026  J  s”1  is  the  visible 
absolute  luminosity  of  the  Sun.  Thus,  the  radio  power  emitted  by  Cygnus  A 
is  comparable  to  the  visible-light  power  of  340  billion  stars  like  the  Sun, 
exceeding  that  of  our  Milky  Way  Galaxy  of  about  100  billion  stars.  Over  its 
lifetime  of  a  quarter  of  a  million  years,  Cygnus  A  emits  about  1051  J  of  radio 
energy,  which  is  comparable  to  the  rest  mass  energy  of  5,594  solar  masses, 
or  5,594  M0c 2  for  a  solar  mass  M0  =  1.989  x  1030  kg  and  the  speed  of 
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light  c  =  2.9979  x  10  m  s  .  Finding  a  radiation  mechanism  comparable 
to  the  complete  annihilation  of  5,594  stars  was  initially  problematic,  until 
super-massive  black  holes  provided  the  answer. 


Even  more  dramatic  sources  of  energy  were  found  still  deeper  in  space  and 
generated  a  longer  time  ago.  As  with  radio  galaxies,  the  discovery  of  the  first 
quasar  resulted  from  the  accurate  location  of  a  bright  radio  source,  which  was 
determined  when  the  Moon  happened  to  pass  in  front  of  it.  As  the  radio  astronomer 
Cyril  Hazard  (1928-  ),  then  at  the  University  of  Sydney  in  Australia,  realized,  a 
careful  timing  of  the  disappearance  and  reappearance  of  the  occulted  source  would 
establish  a  precise  position,  since  the  location  of  the  Moon’s  edge  is  known 
accurately  for  any  time. 

In  1962  Hazard  and  his  colleagues  used  the  occultation  method  to  show  that  3C 
273  is  a  double  radio  source,  one  component  of  which  apparently  coincided  with  a 
blue  stellar  object.  This  coincidence  prompted  Maarten  Schmidt  (1929-  )  to  obtain 
an  optical  spectrum  of  the  blue  object  using  the  5  m  (200  inch)  Palomar  telescope, 
which  indicated  an  exceptionally  high  recession  velocity  of  0.16  %  of  the  speed  of 
light.  When  he  told  his  colleague  Jesse  Greenstein  (1909-2002)  about  the  dis¬ 
covery,  Greenstein  produced  a  list  of  emission  line  wavelengths  for  the  optically 
visible  counterpart  of  another  radio  source  3C  48,  and  within  minutes  they  had 
found  that  it  is  rushing  away  with  an  even  faster  motion  at  37  %  of  the  speed  of 
light. 

When  these  velocities  are  used  to  infer  distances  using  the  Hubble  law,  it  is 
found  that  3C  48  and  3C  273  are  located  at  distances  of  billions  of  light-years.  And 
when  their  observed  luminosities  are  combined  with  these  distances,  it  was  found 
that  they  are  shining  with  the  visible  blue  light  of  10  million  million,  or  10  , 
Sun-like  stars. 
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The  sequence  leading  to  the  discovery  of  these  hitherto  unknown  objects  of 
tremendous  velocity,  distance,  and  luminosity  happened  so  quickly  that  the  article 
reporting  the  major  discoveries  appeared  together  in  a  six-page  sequence  in  the 
journal  Nature  (Hazard  et  al.  1963;  Schmidt  1963;  Greenstein  and  Matthews  1963). 

Because  the  bright  objects  appeared  star-like  in  visible  light,  they  became 
known  as  quasistellar  radio  sources ,  a  term  that  soon  was  shortened  to  quasars. 
The  quasars  had,  in  fact,  been  ignored  as  stars  on  optical  photographs  for  years. 
Once  quasars  were  known,  astronomers  located  others  by  obtaining  optical  spectra 
of  bright,  blue-colored,  star-like  objects  that  are  located  well  outside  the  plane  of 
the  Milky  Way,  where  stars  are  not  supposed  to  be,  and  measuring  the  large 
redshifts  characteristic  of  remote  quasars.  Thousands  of  quasars  have  now  been 
discovered  in  this  way,  some  of  them  emitting  intense  radio  signals  and  many 
more  silent  ones  with  their  radios  turned  off. 

Astronomers  gradually  came  to  realize  that  quasars  are  brilliant,  tiny  cores, 
sometimes  smaller  than  the  solar  system,  embedded  in  much  larger,  extremely 
active  galaxies,  whose  outer  parts  are  difficult  to  detect  in  the  intense  quasar  glare. 
From  its  vantage  point  in  space,  the  Hubble  Space  Telescope  resolved  the  core 
quasar  light  and  removed  it  from  the  computerized  images  to  detect  the  faint, 
fuzzy  halo  of  a  host  galaxy  that  is  as  large  as  the  elliptical  galaxies  found  at  the 
centers  of  many  intense  radio  sources. 

Quasars  are  believed  to  be  very  luminous  versions  of  the  same  blue  nuclei  that 
Seyfert  observed  in  the  center  of  nearby  spiral  galaxies.  The  visible-light  emission 
of  quasars  exhibits  the  same  emission  lines  as  both  the  Seyfert  galaxies  and  the 
central  elliptical  galaxies  of  radio  galaxies  (Lynden-Bell  1969).  Lang  et  al.  (1975) 
present  a  composite  Hubble  diagram  that  includes  normal  galaxies,  radio  galaxies 
and  quasi-stellar  objects  in  the  context  of  the  evolution  of  the  universe. 

Seyfert  galaxies,  radio  galaxies,  and  quasars  all  belong  to  a  common  class, 
known  collectively  as  active  galactic  nuclei.  Modern  astronomers  are  now 
investigating  active  galactic  nuclei  using  the  Hubble  Space  Telescope ,  the  Spitzer 
Space  Telescope  and  large  ground  based  telescopes  operating  at  visible,  infrared, 
millimeter  and  radio  wavelengths.  Fabian  (2012)  has  discussed  observational 
evidence  for  active  galactic  nuclei  feedback.  Ho  (2008)  reviewed  nuclear  activity 
in  nearby  galaxies;  Crenshaw  et  al.  (2003)  have  reviewed  evidence  for  mass  loss 
from  the  nuclei  of  active  galaxies;  Osterbrock  (1991)  provided  a  review  of  active 
galactic  nuclei;  Sulentic  et  al.  (2000)  have  described  broad  emission  lines  in  active 
galactic  nuclei;  Osterbrock  and  Mathews  (1986)  reviewed  emission-line  regions  of 
active  galaxies  and  QSOs;  Weymann  et  al.  (1981)  have  reviewed  absorption  lines 
in  the  spectra  of  quasi-stellar  objects;  and  Ulrich  et  al.  (1997)  have  discussed  the 
variability  of  active  galactic  nuclei. 

The  active  galactic  nuclei  radiate  so  powerfully  over  the  entire  range  of  the 
electromagnetic  spectrum  that  they  cannot  possibly  consist  of  ordinary  stars,  which 
emit  most  of  their  luminous  output  in  a  narrow  band  of  wavelengths  grouped 
around  visible  light.  However,  super-massive  black  holes  can  account  for  the 
prodigious  energy  output,  violent  activity,  and  rapid  variations  of  active  galactic 
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nuclei,  as  well  as  jets  of  material  that  moves  out  of  them  at  extremely  high 
relativistic  speeds  that  approach  the  speed  of  light. 


15.5.2  Super-Massive  Black  Holes 

As  independently  proposed  by  the  astrophysicists  Edwin  E.  Salpeter  (1924-  ),  at 
Cornell  University,  and  Yakov  B.  Zeldovich  (1914-1987)  in  Moscow,  the  tre¬ 
mendous  luminosity  of  every  radio  galaxy  and  quasar  most  likely  is  supplied  by  a 
super-massive  black  hole ,  which  emits  luminous  radiation  as  its  powerful  gravity 
pulls  in  surrounding  stars  and  gas  (Salpeter  1964;  Zeldovich  1964).  The  gravita¬ 
tional  pull  of  a  mass  equivalent  to  100  million  Suns  is  needed  to  balance  the  visible 
quasar  luminosity;  otherwise  its  radiation  pressure  would  blow  away  the  quasar. 
Such  a  super-massive  black  hole  would  be  sufficiently  small  and  powerful  enough 
to  explain  the  tiny  sizes  and  the  colossal  brightness  of  quasars.  The  super-massive 
black  hole’s  rotational  energy  is  used  to  accelerate  charged  particles  and  spew 
them  out  in  diametrically  opposite  directions  along  its  rotation  axis  at  about  the 
speed  of  light,  continuously  feeding  the  two  radio  lobes  commonly  found  sym¬ 
metrically  placed  from  the  center  of  radio  galaxies  and  quasars. 

Rees  (1984)  has  reviewed  black  hole  models  for  active  galactic  nuclei,  and 
Begelman  et  al.  (1984)  and  Lang  (1999)  have  reviewed  the  theory  of  extragalactic 
radio  sources.  Longair  (2011)  provides  a  detailed  treatment  of  the  high-energy 
astrophysics  used  to  describe  cosmic  radio  and  x-ray  emission. 

Kormendy  and  Richstone  (1995)  have  reviewed  the  search  for  super-massive 
black  holes  in  galactic  nuclei.  The  classic  example  is  M  87,  a  giant  elliptical 
galaxy  whose  central  spinning  disk  of  hot  gas  indicates  that  a  super-massive  black 
hole  resides  at  its  center.  M  87  is  close  enough  to  measure  the  motions  of  stars,  and 
their  increasing  velocities  toward  the  center  indicate  that  billions  of  solar  masses 
must  be  crammed  within  a  very  small,  unseen  volume  to  keep  the  high-velocity 
stars  from  flying  into  space  (Sargent  et  al.  1978;  Macchetto  et  al.  1997;  Gebhardt 
and  Thomas  2009). 

A  one-sided  jet  of  gas  emerges  from  the  center  of  M  87  and  stretches  out  into 
one  of  the  two  lobes  of  the  radio  galaxy  Virgo  A  -  numbered  3C  274  in  the 
Cambridge  survey  (Fig.  15.7).  The  motions  of  bright  concentrations  in  the  jet 
indicate  that  they  are  traveling  outward  at  about  half  the  speed  of  light.  And  Very 
Long  Baseline  Interferometry  observations  with  widely  separated  radio  telescopes 
reveal  that  the  M  87  jet  emerges  from  a  region  at  most  6  light-years  across,  most 
likely  harboring  the  super-massive  black  hole  that  produces  the  jet. 

Monstrous,  super-massive  black  holes  seem  to  inhabit  the  centers  of  all  gal¬ 
axies.  They  are  massive,  scaled-up  versions  of  stellar  black  holes,  with  millions  if 
not  billions  of  times  the  mass  of  the  Sun  packed  into  a  region  only  a  few  light- 
years  across.  Like  their  stellar  counterparts,  the  super-massive  black  holes  cannot 
be  observed  directly.  Their  presence  is  inferred  from  the  orbital  motion  of  nearby 
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Fig.  15.7  Radio  jet  from  M  87  The  bright  radio  source  Virgo  A,  also  designated  3C  274, 
coincides  with  M  87,  a  giant  elliptical  galaxy  with  a  redshift  of  z  —  0.00436  and  a  distance  of 
about  17.4  Mpc  or  56.8  million  light-years.  M  87  is  the  largest  and  brightest  galaxy  within  the 
Virgo  cluster  of  galaxies.  The  core  of  M  87  contains  a  super-massive  black  hole  of  about 
6.3  billion  solar  masses,  or  6.3  x  109  Af0,  which  is  1,500  times  more  massive  than  the  black  hole 
at  the  center  of  our  Milky  Way  Galaxy.  This  radio  map,  made  with  the  Very  Large  Array,  shows 
two  elongated  lobes,  one  on  either  side  of  the  center  of  M  87,  apparently  fed  by  the  super-massive 
black  hole.  The  most  intense  radio  emission  comes  from  a  jet  that  emerges  from  the  core  of  the 
galaxy  and  extends  about  5,000  light-years  into  one  of  the  two  lobes.  The  observed  high-speed 
motion  of  bright  knots  in  the  jet  implies  that  its  radio-emitting  electrons  are  traveling  at  nearly  the 
speed  of  light.  (Courtesy  of  NRAO/AUI/NSF.) 


visible  stars  the  trajectories  of  which  are  guided  by  the  otherwise  invisible  black 
holes. 

The  faster  the  stars  are  moving,  the  more  gravity  -  and  therefore  mass  -  is 
needed  to  hold  the  stars  in  their  orbits.  By  measuring  the  sharp  rise  in  orbital 
velocity  at  close  distances  from  galaxy  centers,  astronomers  have  weighed  unseen 
super-massive  black  holes  in  nearby  giant  elliptical  galaxies,  which  are  the 
brightest  galaxies  in  clusters  of  galaxies.  The  central  black-hole  powerhouse  in 
relatively  nearby  galaxies,  designated  M  87,  NGC  3842,  and  NGC  4889,  tips  the 
scales  at  6.3,  9.7  and  21  billion  solar  masses,  respectively.  Without  a  gravitational 
pull  equivalent  to  about  10  billion  Sun-like  stars,  the  close,  fast-moving  stars 
would  fly  away  from  the  galaxies  (McConnell  et  al.  2011).  Such  central,  super- 
massive  black  holes  most  likely  reside  in  more  distant  galaxies  that  are  too  far 
away  to  resolve  central  stars  and  measure  their  motions. 

Quasars  and  active  galactic  nuclei  become  increasingly  numerous  as  we  look 
deeper  into  space,  at  larger  redshifts.  The  number  density  of  unobscured  quasars 
peaks  at  a  redshift  between  z  — 2  and  z  =  3,  which  indicates  that  the  hot,  lumi¬ 
nous  spurt  of  activity  happened  in  the  distant  past,  about  10  billion  years  ago  and 
shortly  after  the  first  galaxies  were  born.  At  smaller  redshifts  and  closer  distances, 
corresponding  to  an  old  age,  there  are  relatively  few  quasars.  The  spurt  of  activity 
apparently  became  worn  out  and  used  up  as  the  galaxies  grew  older. 

To  power  the  youthful  activity  of  a  quasar,  there  has  to  be  about  1  solar  mass 
per  year  of  gas  flowing  into  the  super-massive  black  hole.  Therefore,  billions  of 
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stars  or  the  equivalent  amount  of  gas  must  be  consumed  as  its  active  nucleus 
evolves  during  the  course  of  billions  of  years.  The  supply  dwindles  away  over  time 
and  the  activity  dies  down,  but  the  black  hole  does  not  disappear. 

Most  galaxies  probably  contain  super-massive  black  holes  at  their  center.  Those 
in  the  older,  nearby  galaxies  are  the  starving  remains  of  former  quasars,  with  a 
dwindling  supply  of  material  that  once  fed  a  higher  rate  of  activity.  They  are  found 
in  ordinary  nearby  galaxies,  such  as  Andromeda,  whose  cores  are  surviving  fossils 
of  former  quasars.  Our  Galaxy,  the  Milky  Way,  is  almost  as  old  as  the  observable 
universe,  and  it  contains  a  central  super-massive  black  hole.  However,  its  mass  is 
equivalent  to  only  about  1  million  stars  like  the  Sun  rather  than  the  billions  in 
some  super-massive  black  holes  (see  Sect.  14.1). 


15.5.3  Gamma-Ray  Bursts 

The  brightest  sources  found  in  the  universe,  at  least  so  far,  are  the  gamma-ray 
bursts  whose  duration  is  measured  in  seconds  or  less  and  which  never  reappear  in 
exactly  the  same  part  of  the  sky.  They  emit  energy  at  a  gamma-ray  wavelength 
shorter  than  10“ 11  m,  so  each  photon  of  a  gamma  ray  is  about  100,000  times  more 
energetic  than  a  photon  of  visible  light.  When  it  was  found  that  they  originate  in 
remote  galaxies,  it  was  realized  that  the  observed  gamma  ray  bursts  might  radiate, 
for  a  few  seconds,  gamma-ray  energy  far  in  excess  of  the  visible-light  energy 
emitted  by  galaxies.  These  gamma-ray  bursts  can  briefly  become  the  brightest 
electromagnetic  events  in  the  universe. 

The  initial  burst  is  usually  followed  by  a  longer-lived  “afterglow”  emitted  at 
longer  wavelengths,  from  x-rays  to  optical  and  radio  of  galaxies.  Gamma-ray 
bursts  are  attributed  to  intense  radiation  emitted  during  a  supernova  explosion 
when  a  rapidly  rotating,  high  mass  star  collapses  to  form  a  neutron  star  or  black 
hole. 

The  discovery  of  the  gamma  ray  bursts  was  the  unexpected  result  of  defense 
satellite  observations  designed  to  detect  clandestine  nuclear  bomb  explosions  in 
the  Earth’s  atmosphere,  on  the  Moon,  or  in  outer  space,  but  brief,  intense  gamma- 
ray  bursts  were  instead  found  to  be  coming  from  the  distant  Cosmos. 

The  secret  Vela  satellites  were  launched  and  operated  in  identical  pairs  on 
opposite  sides  of  a  circular  orbit  around  the  Sun,  eventually  with  sufficient  time 
resolution  to  determine  the  direction  of  the  source  from  the  difference  in  arrival 
time  of  its  radiation  at  two  or  more  satellites.  Although  a  nuclear  bomb  was  never 
detected,  unexpected  flashes  of  gamma  rays  were  discovered  coming  from  dif¬ 
ferent  places  in  deep  space,  with  an  existence  measured  in  seconds.  Moreover,  the 
lag  between  the  burst  arrival  times  at  the  two  defense  satellites  indicated  that  they 
originated  far  beyond  the  solar  system,  and  were  therefore  of  cosmic  origin. 

The  gamma-ray  bursts  were  kept  secret  for  about  five  years,  until  the  Los 
Alamos  scientists,  Ray  W.  Klebesadel  (1932-  ),  Ian  B.  Strong  (1930-  )  and 
Roy  A.  Olson  (1924-  ),  described  the  discovery  in  the  Astrophysical  Journal 
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Fig.  15.8  The  accelerating  expansion  of  the  universe  The  Hubble  diagram  plot  of  the  apparent 
magnitude  of  Type  la  supernovae  plotted  as  a  function  of  their  redshift.  At  a  redshift  below  about 
z  =  0.1,  there  is  a  linear  fit  to  the  data,  but  at  larger  redshifts,  the  observations  begin  to  diverge 
from  a  straight  line.  The  curved  departures  for  distant  supemovae  at  high  redshift  indicate  an 
acceleration  in  which  the  speed  of  expansion  is  increasing.  The  observed  data  can  be  compared  to 
cosmological  models  with  different  values  of  the  omega  parameter,  Q.  It  is  the  ratio  of  the 
inferred  density  to  the  critical  mass  density  needed  to  stop  the  expansion  of  the  universe  in  the 
future.  The  subscript  A  denotes  the  cosmological  constant,  a  possible  form  of  dark  energy,  and 
the  subscript  m  denotes  matter.  (Adapted  from  Saul  Perlmutter,  Physics  Today  April  2003.  From 
“The  Life  and  Death  of  Stars”  by  Kenneth  R.  Lang,  published  by  Cambridge  University  Press, 
2013.  Reprinted  with  permission.) 

(Klebesadel  et  al.  1973).  Civilian  astronomers  might  blame  the  military  for 
keeping  such  an  important  discovery  hidden  for  years,  but  it  is  also  likely  that  the 
gamma-ray  bursts  would  have  never  been  discovered  if  it  wasn’t  for  the  defense 
satellites.  No  other  government  agency  was  likely  to  fund  a  speculative  program 
designed  to  look  for  such  totally  unknown,  unsuspected,  and  unprecedented  events 
in  outer  space. 

The  long  lasting  afterglow  of  the  cosmic  gamma-ray  bursts  was  eventually 
detected  at  visible  wavelengths,  enabling  the  distances  to  be  determined  by 
spectroscopy  and  the  Hubble  law,  and  the  enormous  energies  realized  (Van 
Paradijs  et  al.  1997;  Frail  et  al.  1997;  Metzger  et  al.  1997;  Kulkarni  et  al.  1998). 
If  these  bursts  emit  radiation  in  all  directions,  then  the  total  gamma  ray  luminosity 
of  the  burst  can  briefly  exceed  the  combined  visible-light  luminosity  of  millions  of 
galaxies.  A  lower  gamma  ray  luminosity  is  inferred  if  the  bursts  are  beamed,  like  a 
pulsar’s  radio  emission.  At  least  some  of  the  gamma  ray  bursts  are  attributed  to 
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powerful  extragalactic  supernovae  explosions  of  very  massive  stars  whose  centers 
collapse  into  black  holes. 

Gehrels  et  al.  (2009)  have  reviewed  gamma-ray  bursts  in  the  Swift  era;  Fishman 
and  Meegan  (1995)  discussed  earlier  observations  of  gamma  ray  bursts.  Woosley 
and  Bloom  (2006)  have  presented  the  supernova-gamma-ray  burst  connection,  and 
Meszaros  (2002)  has  provided  a  review  of  theories  of  gamma-ray  bursts.  Paradijs 
et  al.  (2000)  have  reviewed  gamma-ray  burst  afterglows;  and  Weiler,  Panagia,  and 
Sramek  (2002)  have  summarized  our  knowledge  of  radio  emission  from  super¬ 
novae  and  gamma-ray  bursts.  Fender  and  Belloni  (2004)  have  discussed  GRS 
1915  +  105  and  the  disc-jet  coupling  in  accreting  black  hole  systems. 


15.6  Dark  Energy,  the  Cosmological  Constant,  and  How  it 
All  Ends 

15.6.1  Discovery  of  Dark  Energy 

The  redshift,  z,  of  a  galaxy  increases  with  its  distance,  D,  according  to  the  Hubble 
law  cz  =  H0D ,  where  c  is  the  speed  of  light  and  H0  is  the  Hubble  constant.  Since 
distances  can  only  be  independently  measured  for  nearby  galaxies,  the  law  is 
expressed  as  a  redshift  -  magnitude  relation  applicable  at  larger  redshifts.  This 
relation  is  given  by 


m  =  5  \og(cz/Ho)  +  M  +  25,  (15.25) 

where  the  apparent  magnitude  is  denoted  by  m  and  the  absolute  magnitude  by  M.  If 
all  galaxies  have  the  same  absolute  magnitude,  then  a  plot  of  m  against  log  z  will 
describe  a  straight  line  whose  slope  is  related  to  the  Hubble  constant. 

At  redshifts  greater  than  one,  observable  departures  from  linearity  in  the 
apparent  magnitude  -  log  redshift  diagram  could  occur  due  to  changes  in  the 
expansion  velocity.  If  the  expansion  of  the  universe  is  slowing  down  and  decel¬ 
erating,  for  example,  then  the  apparent  brightness  would  be  greater  than  expected 
from  uniform  expansion,  and  the  apparent  magnitude  less  than  expected  since 
greater  brightness  means  a  smaller  magnitude. 

Supernova  explosions  of  Type  la,  denoted  by  SNe  la,  provide  a  useful  method 
for  measuring  such  possible  effects  (Reiss  et  al.  1995,  1996;  Branch,  1988).  Since 
they  are  the  result  of  a  thermonuclear  explosion  of  a  white  dwarf  star  that  has 
grown  above  the  Chandrasekhar  limit  (Sect.  13.4),  there  is  little  variation  in  their 
absolute  magnitude  of  MB  =  19.6  zb  0.2  in  blue  light, and  they  are  so  luminous 
that  they  can  be  detected  at  relatively  large  redshifts  of  z  =  0.5- 1.5. 

In  the  late  1980s,  a  group  led  by  Saul  Perlmutter  (1959-  )  at  the  Lawrence 
Berkeley  National  Laboratory  began  a  dedicated  search  for  Type  la  supernovae  to 
measure  how  fast  the  expansion  of  the  universe  was  slowing  down,  due  to  the 
gravitational  pull  of  its  combined  matter.  By  1998,  his  Supernova  Cosmology 
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Project ,  and  a  rival  group,  dubbed  the  High-z  Supernova  Search ,  announced  that 
the  light  from  the  distant  supernovae  was  fainter  than  predicted,  which  meant  that 
the  galaxies  are  speeding  up,  expanding  at  a  quickening  pace,  and  accelerating 
instead  of  slowing  down  by  gravity  (Perlmutter  et  al.  1998,  1999;  Riess  et  al.  1988; 
Schmidt  et  al.  1988).  In  other  words,  the  distant  galaxies  were  not  where  they  were 
supposed  to  be,  and  the  space  they  are  in  seems  to  be  expanding  at  a  faster  rate  as 
time  goes  on. 

The  redshift  -  magnitude  diagram  of  SNe  la  has  a  straight-line,  linear  shape  at 
low  redshifts,  as  expected  from  the  Hubble  law,  but  unexpected  nonlinear  effects 
appear  at  large  redshifts  (Fig.  15.8).  The  line  describing  the  data  indicates  that  the 
galaxies  are  expanding  at  a  quickening  pace. 

Saul  Perlmutter,  Brian  P.  Schmidt  (1967-  ),  and  Adam  G.  Riess  (1969-  )  were 
awarded  the  2011  Nobel  Prize  in  Physics  for  the  discovery  of  the  accelerating 
expansion  of  the  universe  through  observations  of  distant  supernovae. 

Distant  galaxies  are  being  accelerated  by  the  anti-gravity  push  of  a  mysterious 
dark  energy.  So  the  fate  of  the  universe  is  no  longer  supposed  to  depend  on  its 
mass,  but  rather  on  its  energy.  If  dark  energy  retains  its  vigor,  the  universe  will  not 
stop  expanding. 

Dark  energy,  which  pushes  matter  apart,  is  not  the  same  as  dark  matter,  which 
encourages  attraction.  But  the  discovery  of  dark  energy  did  do  away  with  the  need 
for  overwhelming  amounts  of  dark  matter  to  keep  the  universe  poised  on  the  edge 
of  future  collapse,  but  never  quite  pulling  it  there.  No  more  than  one  quarter  of  the 
critical  mass  density  is  now  imagined  to  reside  in  mass  of  any  kind,  and  astron¬ 
omer’s  observations  of  such  a  low-density  universe  are  now  widely  accepted.  Dark 
energy  has  taken  over;  perhaps  keeping  the  universe  at  the  brink  of  closure  within 
“flat”  space,  which  is  described  by  Euclidean  geometry. 

Frieman  et  al.  (2008)  have  reviewed  dark  energy  and  the  acceleration  of  the 
expansion  of  the  universe.  Ratra  and  Vogeley  (2008)  have  reviewed  the  current 
standard  model  for  evolution  of  the  universe,  including  big-bang  cosmology, 
inflation,  dark  matter  and  dark  energy,  as  well  as  the  formation  and  observations  of 
galaxies  and  stars. 


15.6.2  Using  the  Cosmological  Constant  to  Describe  Dark 
Energy 

The  trouble  is,  nobody  understands  this  mysterious  something,  this  dark  energy 
that  permeates  space  and  eventually  overwhelms  the  gravitational  self-attraction  of 
the  entire  material  universe.  But  an  old  idea,  termed  the  cosmological  constant,  has 
been  revived  to  give  dark  energy  another  name  and  couch  it  in  mathematical 
terms.  That’s  the  anti-gravity  fudge  factor  that  Einstein  introduced  to  stabilize  a 
non-moving  universe  against  collapse. 
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When  the  expanding  universe  had  not  yet  been  discovered,  it  looked  as  if  the 
unrelenting,  universal  attraction  of  gravity  would  cause  the  eventual  collapse  of  an 
unmoving  universe,  so  Einstein  (1917a,  b)  inserted  a  mathematical  fix,  the  cos¬ 
mological  constant,  into  his  relativity  equations.  The  extra  term  represented  the 
repulsive  force  of  an  unknown  and  undetected  form  of  energy  that  permeated 
space  and  exerted  a  sort  of  outward  pressure  that  opposed  gravity  and  kept  the 
universe  from  collapsing. 

In  a  little  more  than  a  decade,  it  was  discovered  that  the  galaxies  are  moving 
away  from  us  in  a  cosmic  expansion,  so  the  universe  wasn’t  static,  or  non-moving, 
after  all.  Einstein  therefore  abandoned  the  cosmological  constant,  and  stated  that 
the  ad  hoc  term  was  greatly  detrimental  to  the  formal  beauty  of  his  theory. 

But  the  artifice  stubbornly  refused  to  die,  and  has  been  repeatedly  invoked 
whenever  cosmologists  have  had  trouble  reconciling  their  theories  with  observa¬ 
tions.  All  they  had  to  do  was  revive  the  term,  stick  it  in  the  relevant  equations,  and 
adjust  its  value.  As  an  example,  Georges  Lemaitre  (1894-1966)  constructed  a 
model  universe  with  two  periods  of  accelerated  expansion,  one  at  the  beginning 
and  one  later,  and  a  more  gentle  coasting  period  in  between  (Fig.  15.9),  pro¬ 
claiming  that  the  expansion  thus  took  place  in  three  phases:  a  first  period  of  rapid 
expansion  a  second  period  of  slowing-up,  followed  by  a  third  period  of  accelerated 
expansion  (Lemaitre  1931a,  b).  It  is  doubtless  in  the  third  period  we  find  ourselves 
today.  This  interpretation  is  somewhat  similar  to  some  modern  explanations  of 
dark  energy,  beginning  with  rapid  inflation  and  with  a  currently  accelerated 
expansion  that  might  invoke  the  cosmological  constant. 

Peebles  and  Ratra  (2003)  have  provided  an  extensive  review  of  the  cosmo¬ 
logical  constant  and  dark  energy,  whereas  Carroll  et  al.  (1992)  provided  a  review 
of  the  cosmological  constant  in  another  context. 

The  equations  that  describe  a  homogeneous,  isotropic  expanding  universe, 
given  in  Chap.  14,  Sect.  14.6  for  zero  cosmological  constant  A  =  0,  have  to  be 
rewritten  to  allow  for  a  non-zero  value  of  this  constant.  The  Hubble  expansion 
parameter  H(t)  is  then  given  by: 


%)\2_  fdR(t)/dt\ 

m)  ~  wur J 


kc 2  A 
R2(t)  +  V’ 


(15.26) 


_  11  O  _  1  _ O 

where  R(t)  is  the  scale  factor  of  the  universe,  G  =  6.674  x  1CT11  m  kg  s  is 
the  gravitational  constant,  p(t)  is  the  mass-energy  density,  which  is  equal  to  the 
radiation  energy  density  pr(t)  in  the  early  radiation-dominated  era  and  the  matter 
density  pm{t )  in  the  current  matter- dominated  era,  the  space  curvature  constant  is 
k  =  —  1,  Oor+1,  and  the  cosmological  constant  is  denoted  by  A. 

Contemporary  observations  are  consistent  with  k  =  0,  and  in  this  situation,  we 
have: 


H2{t)  _  SnGp(t)  A  _  p(t)  A 
HHio)~  3Hq  +3hI~^  +  3hI’ 


(15.27) 
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Fig.  15.9  Models  of  the  expanding  universe  Schematic  representation  of  various  cosmological 
models  showing  the  size,  or  scale  factor  R (f),  of  the  expanding  universe  as  a  function  of 
time,  t.  The  approximate  age  since  the  expansion  began  is  given  by  1  /H0  where  H0  is  the  Hubble 
constant.  Since  the  expansion  age  was  once  thought  to  be  smaller  than  the  age  of  the  oldest  rocks 
on  Earth,  Georges  Lemaitre  (1894-1966)  and  Arthur  Eddington  (1822-1944)  independently  used 
a  cosmological  repulsion  term  with  Einstein’s  General  Theory  of  Relativity  to  permit  an 
adjustable  age  for  the  universe.  The  adjustable  term  is  symbolized  by  a  non-zero  value  for  the 
cosmological  constant  A.  In  this  interpretation  ( thick  solid  line),  the  universe  began  with 
expansion  against  gravity,  followed  by  an  essentially  non-moving  stagnation  in  which  gravitation 
and  cosmological  repulsion  were  nearly  in  balance.  This  coasting  period  was  then  followed  by  an 
accelerated  expansion  driven  by  the  cosmological  repulsion.  Three  models  with  no  cosmological 
constant,  or  with  A  =  0,  describe  three  future  possibilities  for  the  universe  with  no  dark  energy 
(thin  solid  lines).  It  can  become  closed  with  positive  space  curvature,  forever  open  with  negative 
space  curvature,  and  always  open  and  no  curvature  of  space.  In  recent  times,  this  last  option,  of 
never  ending  expansion  in  space  without  curvature,  has  been  modified  by  a  non-zero 
cosmological  constant  to  give  a  boost  to  some  age  estimates  and  permit  an  accelerated  expansion 
by  a  mysterious  dark  energy  (dashed  lines) 


1.0  x  10-26  kg  m-3,  (15.28) 

s'1  Mpc-1,  h  «  0.75,  and  1 

Mpc  =  3.0857  x  10iV  km. 

Once  the  cosmological  term  has  grown  large  enough,  it  dominates  the  right  side 
of  the  expansion-parameter  equation  and  we  have 


3  Hi 


pc=-^-=  1.879  x  10-26  h2  kgm 
87 zG 
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with  the  Hubble  constant  H0  =  100  h  km 


R(t)  « 


A 


(15.29) 
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which  has  the  solution 


m  oc  exp 


exp  (Hct), 


(15.30) 


where  the  Hubble  expansion  parameter  H(t)  has  become  a  constant  Hc ,  and  the 
universe  enters  an  exponential  expansion. 

The  other  Friedmann  equation  that  involves  the  pressure  P  becomes: 


R(t)  _  d2R(t)/dt 

W)  ~  W) 


4nG 

~3cr 


{p  ( t)c 2  +  3 P)  +  —  ■ 


(15.31) 


This  equation  implies  that  with  large  positive  cosmological  constant  A  the  term 
d  R{t)/dt  becomes  positive  and  the  expansion  of  the  universe  accelerates,  as 
opposed  to  deceleration  that  could  occur  without  such  a  term. 

The  present  value  of  the  deceleration  parameter  q0  for  any  value  of  A  is  given  by 


qo 


R(t0) 


Ac2 


(15.32) 


where  the  pressure  is  now  negligible  and  H0  =  H(t0 )  =100  h  km  s  1  Mpc  1  is 
the  Hubble  constant  and  h  «  0.75. 

The  power  of  the  cosmological  constant  is  measured  from  its  density  parameter 
Qa  given  by 


Ac2 

QA  =  QA(t0)  = —J ,  (15.33) 

3/i0 

while  the  matter  density  parameter  is  given  in  its  usual  way 

Um  =  Um(r0)  (15.34) 

Pc 

More  than  a  decade  of  observations  of  type  SNe  la  has  confirmed  that  their 
redshift  -  magnitude  relation  goes  non-linear  at  large  redshifts,  permitting  astron¬ 
omers  to  measure  parameters  that  describe  the  unexpected  effect.  The  initial  results, 
obtained  in  1998,  were  interpreted  by  a  total  matter  density  parameter 
Qm  =  0.28  =b  0.085,  for  invisible  and  visible  matter  assuming  a  cosmological  con¬ 
stant  of  A  =  0  (Perlmutter  et  al.  1998),  but  it  was  soon  realized  that  the  universe  was 
expanding,  propelled  by  a  dark  energy  that  could  be  interpreted  in  terms  of  a  non¬ 
zero  cosmological  constant  (Riess  et  al.  1998).  Ten  years  later,  data  for  hundreds  of 
SNe  la  were  used  to  suggest  a  cosmological-constant  density  factor  of  Qa  = 
0.713  zb  0.027  for  a  flat  universe  with  a  space  curvature  constant  of  k  =  0  (Kowalski 
et  al.  2008).  When  combined  with  other  data,  tight  limits  of  Qm  =  0.237  zb  0.010 
were  also  realized  (Fig.  15.10)  and  the  previous  Table  15.2  of  WMAP  results). 
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Fig.  15.10  Dark  mass  and  dark  energy  constraints  Three  independent  sets  of  observations 
provide  constraints  to  the  mass  and  energy  content  of  the  universe.  Studies  of  high-redshift  Type 
la  supemovae,  designated  SNe,  constrain  the  difference  between  the  density  of  matter  and  the 
density  of  dark  energy  in  the  universe.  This  diagram  illustrates  the  results  from  the  Union  2.1 
compilation  of  833  SNe  drawn  from  19  datasets  from  2008  to  2011.  Anisotropies  in  the  cosmic 
microwave  background  radiation,  denoted  CMB,  constrain  their  sum.  Investigations  of  B  ary  on 
Acoustic  Oscillations,  abbreviated  BAO,  detected  in  data  from  the  Sloan  Digital  Sky  Survey 
provide  other  constraints.  Here  the  densities  are  given  in  terms  of  the  omega  parameter,  denoted 
by  Q.  It  is  the  ratio  of  the  inferred  density  to  the  critical  mass  density  needed  to  stop  the 
expansion  in  the  future.  The  subscript  A  denotes  the  cosmological  constant,  a  possible  form  of 
dark  energy,  while  the  subscript  m  denotes  matter.  Galaxy  observations  indicate  a  mass  density  of 
at  most  Qm  =  0.3.  The  theoretical  expectation  of  an  inflationary  universe  without  spatial 
curvature  requires  Qm  +  QA  =  1.0,  and  significant  dark  energy  with  QA  =  0.7.  The  observations 
are  consistent  with  such  a  flat  universe,  described  by  ordinary  Euclidean  geometry.  [Courtesy  of 
the  Supernova  Cosmology  Project,  whose  compilation  of  580  SNe  is  available  at  http:// 
supemova/lbl.gov/  -  also  see  N.  Suzuki  et  al,  “The  Hubble  Space  Telescope  cluster  supernova 
survey:  V.  Improving  the  dark  energy  constraints  above  z  >  1  and  building  an  early-type-hosted 
supernova  sample”,  Astrophysical  Journal  746,  85  (2012).] 
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As  also  previously  mentioned,  recent  Planck  mission  results  indicate  Qm  =  0.317 
and  klA  =  0.683  also  with  a  flat  universe  of  Qm  +  Q^  =  1.000. 

The  SNe  la  redshift-magnitude  diagram,  the  WMAP  fluctuations  in  the  cosmic 
microwave  background  radiation,  and  other  astronomical  observations,  have  been 
combined  to  show  that  Qa  =  0.728  d=  0.015,  and  Q mh2  =  0.1334  =b  0.0156  with 
h  «  0.75,  so  Qm  +  Qa  «  1  with  a  space  curvature  constant  k  =  0  and  non-curved 
Euclidean  space.  In  this  condition,  at  any  time,  t, 

Qm  +  Qa  =  Q  m(t)  +  QaW  =  1-  (15.35) 

Allen  et  al.  (2011)  have  reviewed  the  determination  of  cosmological  parameters 
from  observations  of  galaxy  clusters.  Leibundgut  (2001)  has  reviewed  the  cos¬ 
mological  implications  of  observations  of  Type  la  supernovae;  and  Branch  (1998) 
has  reviewed  Type  la  supernovae  and  the  Hubble  constant  . 


15.6.3  When  Stars  Cease  to  Shine 

Ever  since  the  discovery  of  the  expansion  of  the  universe,  we  have  known  that  the 
universe  is  slowly  and  inexorably  approaching  an  end.  There  has  never  been  any 
known  force  that  can  prevent  the  observable  universe  from  steadily  moving  into 
darkness.  As  Georges  Lemaitre  (1894-1966)  so  eloquently  stated  “The  evolution 
of  the  world  can  be  compared  to  a  display  of  fireworks  that  has  just  ended:  some 
few  red  wisps,  ashes,  and  smoke.  Standing  on  a  well-chilled  cinder,  we  see  the 
slow  fading  of  the  suns,  and  we  try  to  recall  the  vanished  brilliance  of  the  origin  of 
the  worlds  (Lemaitre  1931a,  1931,  1950).” 

Dark  energy  has  been  found,  but  that  discovery  may  not  help  matters.  It  gives 
an  extra  outward  push  to  the  expansion  and  further  reduces  the  power  of  mass  to 
stop  the  expanding  universe  in  the  future.  If  the  acceleration  caused  by  dark  energy 
continues  unabated  at  the  current  rate,  all  the  galaxies  will  be  moving  apart  so 
quickly  that  they  cannot  communicate  with  one  another  in  about  150  billion  years, 
disappearing  over  the  cosmic  horizon. 

Eventually,  in  about  100  trillion  years,  all  of  the  interstellar  gas  and  dust  from 
which  new  stars  condense  finally  will  be  used  up,  and  new  stars  will  cease  to  form 
in  any  galaxy.  But  if  dark  energy  weakens  as  time  goes  on,  expending  its  strength, 
then  gravity  and  mass  may  take  over  and  eventually  pull  back  on  the  outward 
moving  galaxies,  ultimately  reversing  the  expansion  and  dragging  the  universe 
back,  melting  it  down  and  remaking  the  big  bang.  So  no  one  knows  for  sure  just 
how  it  will  all  end. 
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Appendix  I 
Constants 


Constant 

Symbol 

Value 

Universal  constants 

Speed  of  light  in  vacuum 

c 

2.9979  x  108  m  s_1 

Universal  gravitational  constant 

G 

6.674  x  1CT11  N  m2  kg~2 

Planck  constant 

h 

6.674  x  ltT11  m3  kg-1  s~2 
6.6261  x  ltT34  J  s 

Electric  constant 

£o 

8.8542  x  1CT12  F  m_1 

Magnetic  constant 

ho 

1.2566  x  10”6  N  A~2 

Thermal  radiation 

Boltzmann  constant 

k 

1.38065  x  10~23  J  K'1 

Stefan-Boltzmann  constant 

G 

5.6704  x  10~8  J  s'1  rrT2  KT4 

Radiation  density  constant 

a 

7.5657  x  10“16  J  m“3  K“4 

Wien  wavelength  displacement  law  constant 

b 

2.8978  x  10~3  m  K 

Atomic  and  nuclear 

Electron  mass 

me 

9.1094  x  10-31  kg 

Elementary  charge 

e 

1.6022  x  10-19  C 

Classical  electron  radius 

re 

2.8179  x  10~15  m 

Thomson  scattering  cross  section 

G  t 

6.65246  x  10“29  m2 

Atomic  mass  unit 

mu  =  u 

1.660539  x  10“27  kg 

Proton  mass 

mp 

1.6726  x  10-27  kg 

Neutron  mass 

mn 

1.6749  x  10"27  kg 

Alpha  particle  mass 

m* 

6.644656  x  10'27  kg 

Bohr  radius 

a0 

5.2918  x  10'11  m 

Rydberg  constant 

Roo 

10,973,731.5685  m“‘ 

Sun 

Mass  of  the  Sun 

M© 

1.989  x  1030  kg 

Luminosity  of  Sun 

Le 

3.828  x  1026  J  s-1 

Radius  of  Sun 

«© 

6.955  x  108  m 

Expanding  universe 

Hubble  constant 

H0 

75  km  s'1  Mpc-1 

Age  of  expanding  Universe 

to 

13.7  x  109  years 

(continued) 
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Appendix  I  Constants 


(continued) 


Constant 

Symbol 

Value 

Cosmic  microwave  background  radiation 

Temperature 

Tcmb 

2.725  K 

Photon  density 

N CMB 

4.10  x  108  m~3 

Anisotropy 

AT /Tcmb 

1.1  x  10“5 

The  physical  constants  are  accurate  to  the  fourth  decimal  place.  For  greater  accuracy  with  the 
latest  values  consult  http://physics.nist.gov/cuu/constants/index.html 


Appendix  II 
Units 


Unit 

Symbol 

Value 

Distance  and  length 

o 

Angstrom 

o 

A 

1CT10  m 

Meter 

m 

1  km  =  103  m  =  0.621371  mile 

Mean  Earth-Sun 

AU 

1  mile  =  1.60934  km 

1.49598  x  10“  m 

distance 

Light-year 

iy 

9.460528  x  1015  m  =  63,239.67  AU 

Parsec 

pc 

3.08568  x  101S  m  =  3.26164  light- 

Megaparsec 

Mpc 

years  =  206,265  AU 

10s  pc 

Angle 

Degree 

o 

60'  =  3600" 

(The  symbol  '  denotes  minutes  of  arc, 

1  =  0.0174532925  radians 

1"  =  4.8481368  x  10~6  radians 
the  symbol  "  designates  seconds  of  arc) 

Radian 

rad 

2.06265  x  105  " 

Pi 

71 

57.2957795°  =  360°/(27t) 

3.141592654 

Time 

Solar  day 

day 

24  h  =  86,400  s 

Sidereal  day 

sidereal  day 

23  h  56  m  04.09  s  =  23.9344696  h 

Year 

tropical 

365.25  solar  days  =  3.15576  x  107  s 

Energy,  power,  force, 
Joule 

year 

pressure 

J 

107  erg 

Electron-volt 

ev 

1.6018  x  10-19  J 

Power 

Watt 

J  s-1 

Force 

N 

kg  m  s-2 

Pressure 

Pa 

N  m-2 

(continued) 
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Appendix  II  Units 


(continued) 


Unit 

Symbol 

Value 

Mass 

Kilogram 

kg 

1,000  g,  1  metric  ton  =  103  kg 

Solar  units 

Mass  of  the  Sun 

M0 

1.989  x  1030  kg 

Luminosity  of  Sun 

L0 

3.828  x  1026  J  s_1 

Radius  of  Sun 

R@ 

6.955  x  108  m 

Appendix  III 
Fundamental  Equations 


Angular  resolution,  0n  of  a  telescope  of  diameter,  DT ,  at  a  wavelength,  2: 


6r  =  —  radians  (A-l) 

Dj 

where  1  radian  =  2.06265  x  105  seconds  of  arc  =  2.06265  x  105  " . 

Angular  source  extent,  0size,  of  a  celestial  source  of  radius,  R ,  located  at  a 
distance,  D : 


2  R 
~D 


radians, 


(A-2) 


where  1  radian  =  2.06265  x  105  seconds  of  arc  =  2.06265  x  105  " 

Wavelength,  2,  frequency,  v,  and  speed  of  light,  c : 


2  x  v  =  c,  (A-3) 

o  _ 1 

where  the  speed  of  light  c  =  2.9979  x  10°  m  s  . 

Photon  energy,  E ,  of  radiation  at  frequency  v: 

E  =  hv  (A-4) 

_ nA 

where  the  Planck  constant  h  =  6.6261  x  10  J  s. 

Stefan-Boltzmann  law  for  luminosity,  L,  of  thermal  radiator  with  effective 
temperature  Tef  and  radius  R: 

L  =  4naR2T^ff  (A-5) 

_ O  1 

where  n  =  3.14159  and  the  Stefan-Boltzmann  constant  (7  =  5. 6704  x  10  Js 
m-2  K“4. 


Apparent  magnitude,  absolute  magnitude,  and  luminosity  for  a  star.  Any 
apparent  magnitude,  m,  can  be  converted  to  absolute  magnitude,  M,  through  the 
simple  formula: 
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absolute  magnitude  =  M  =  m  +  5  —  51og  D,  (A-6) 

where  D  is  the  distance  in  parsecs,  and  1  parsec  =  3.26164  light- 

1  6 

years  =  3.08568  x  10  m.  The  absolute  magnitude  can  be  converted  into  a 
luminosity,  L,  using: 

log(T)  =°.4(M0-M),  (A-7) 

or 

L  =  1O°'4(M0_M)L0,  (A-8) 

where  the  absolute  magnitude  of  the  Sun  in  the  visual  range  of  wavelengths,  where 
it  is  most  intense,  is  MQ  =  +4.83  and  the  absolute  luminosity  of  the  Sun  is 
L0  =  3.828  x  1026  J  s'1.  Notice  that  the  symbol  M0  is  used  to  denote  both  the 
absolute  magnitude  of  the  Sun,  which  is  used  here,  and  the  mass  of  the  Sun,  used 
in  other  equations.  The  Sun  has  an  apparent  magnitude  of  m0  =  —26.74. 

Wien  displacement  law  for  wavelength  2max  of  maximum  intensity  for  a 
thermal  radiator  at  temperature  T: 

„  0.002898 

/max  = - meters. 


(A-9) 


Radiant  flux,  /,  or  apparent  brightness,  of  an  object  of  luminosity  L  at 
distance  D: 


/ 


L 

4n  D2' 


(A- 10) 


Gravitational  force,  /  <;.  between  two  masses,  M\  and  M2  separated  by  a 
distance  D  between  their  centers: 


GM[Mi 

D2  ’ 


(A-ll) 


where  the  universal  gravitational  constant  G  =  6.674  x  10  11  nr5  kg 


-l 


Kepler’s  third  law  for  the  orbital  period  P  of  a  binary  system  of  mass  Mx  and 
M2  separated  by  distance  a: 


4n2 

G(Ml+M2)a 


(A-12) 


Appendix  III  Fundamental  Equations 


613 


where  the  universal  gravitational  constant  G  =  6.674  x  10  11  nr5  kg 


-l 


Jeans  mass,  My,  for  a  spherical  gas  cloud  of  radius,  R ,  and  temperature,  T: 


3kT 

Gm 


R . 


(A-13) 


_ o  o  _ 1 

where  the  Boltzmann  constant  k  =  1.38065  x  10  ~  J  K  ,  the  universal 

_  11  Q  _  1  _ 

gravitational  constant  G  =  6.674  x  10  m  kg  s  ,  and  m  is  the  gas 
particle  mass.  Gravitational  collapse  occurs  if  the  cloud  mass,  M,  is  greater  than 
the  Jeans  mass,  Mj. 

Escape  velocity,  Vesc,  at  a  distance  R  from  a  mass  M: 


2  GM\  1/2 


(A- 14) 


_  11  Q  _  1  _ r\ 

where  the  universal  gravitational  constant  G  =  6.674  x  10"11  m  kg  s  . 

Doppler  effect  for  a  change  Akin  the  wavelength  k  due  to  a  radial  velocity  Vr 
of  a  source  moving  away  from  observer: 

A/l  k observed  ^ emitted  Vr  ,  x 

t - = - t - =  —  for  Vr  <  c,  (A-15) 

Z' emitted  ^ emitted  C 

O  _  1 

where  the  speed  of  light  c  =  2.9979  x  10°  m  s  . 

Parallax  the  annual  parallax  nA ,  of  a  star  at  distance,  D ,  is: 

7 z a  =  AU/D  radians,  (A- 16) 

where  1  AU  =  1.49598  x  1011  m  and  1  radian  =  2.06265  x  105  s  of  arc.  When 
the  parallax  is  given  in  units  of  seconds  of  arc,  then  the  distance,  D,  is  given  by: 

D  =  —  parsecs,  (A- 17) 

71a 


where  1  parsec  =  3.26164  light  years  =  206,265.8  AU. 

Gravitational  potential  energy  of  a  mass,  M,  with  radius,  R: 


Gravitational  potential  energy 


GMZ 

~R~ 


(A-18) 


_  11  Q  _  1  _ r\ 

where  the  universal  gravitational  constant  G  =  6.674  x  10_ii  nU  kg"1  s  . 
Kinetic  energy  of  mass,  m,  moving  at  velocity,  V : 


.  .  1  9 

Kinetic  energy  =  mV  . 

2 


(A-19) 


Thermal  energy  at  temperature,  T\ 


Thermal  energy  =  -kT . 

2' 


(A- 20) 
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_ O  Q  _  1 

where  the  Boltzmann  constant  k  =  1.38065  x  10  J  K  . 


Thermal  velocity,  Vthermah  of  a  particle  of  mass,  m,  at  temperature,  T: 


V thermal 


3kT 

"3kr 

/  m 

m 

1  1/2 


(A-21) 


_ O  O  _  1 

where  the  Boltzmann  constant  k  =  1.38065  x  10  J  K  . 

Gas  pressure,  PG ,  of  particles  of  number  density,  N,  and  temperature  T: 

PG  =  NkT ,  (A-22) 

_ oo  _ 1 

where  the  Boltzmann  constant  k  =  1.38065  x  10  J  K  . 


Radiation  pressure,  Pn  for  a  temperature  T. 

aT4 
P,  = - 


where  the  radiation  constant  a  =  1 .5657  x  1(T16  J  m~3  K-4. 
Magnetic  pressure,  PB ,  of  a  magnetic  field  of  strength,  B : 

B2 
Pb 


2po ' 


(A-23) 


(A-24) 


_ z:  _ o 

where  the  magnetic  constant  =  1.2566  x  10~°  N  A-  . 

Energy  radiated,  dE,  by  a  mass  loss,  dm,  during  nuclear  reactions: 

AE  =  Am  c2,  (A-25) 

o  _ 1 

where  the  speed  of  light  c  =  2.9979  x  10°  m  s  . 

Schwarzschild  radius,  Rsch ,  of  a  mass,  M,  of  radius  R : 

tfsch  =  ^  =  2.95  x  103  (JQ  m,  (A-26) 

_  11  O  _  1  _ O 

where  the  universal  gravitational  constant  G  =  6.614  x  10  m  kg  s  and 
the  Sun’s  mass  M0  =  1.989  x  1030  kg. 

Hubble  law  for  the  recession  velocity  Vr  of  a  galaxy  at  a  distance  D : 


Vr=H0x  D,  (A-21) 

where  the  Hubble  constant  H0  =  75  km  s-1  Mpc-1  and  1  Mpc  = 
3.08568  x  1022  m. 
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galaxy  formation,  542 
Milky  Way,  485,  486 
spiral  galaxies,  488,  491 
star  formation,  543 
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sidereal,  17 
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Decoupling  time,  539 
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Degenerate,  425 

Degenerate  electron  gas,  423-427 
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Degenerate  neutron  pressure,  465 
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Density  waves,  483 
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Milky  Way,  479-482 
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cosmic  rays,  202 
dark  energy,  554-555 
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galaxies,  487-492 
gamma-ray  bursts,  552 
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pion,  208 
positron,  206 
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white  dwarf  stars,  418,  419 
X-ray  pulsars,  460,  461 
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galaxies,  496 
Proxima  Centauri,  53 
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Distance  modulus,  302 
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Doppler  line  broadening,  188 
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Earth 

age,  198,  216 

atmospheric  pressure,  141,  142,  145 
equatorial  radius,  12 
escape  speed,  100,  102 
flattening  factor,  12,  79 
gravitational  redshift,  181,  182 
gravity,  79 
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local  acceleration  of  gravity,  79 
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magnetosphere,  283-285 

mass,  75,  80 

mass  density,  75 

mean  radius,  1 1 
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Eddington  limit,  462,  463 
Eddington  luminosity,  312,  313 
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planets,  56,  57 
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Emission  lines,  159,  160 
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planetary  nebulae,  413,  414 
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radio  galaxies,  546 
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thermal,  132 
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Equation  of  energy  conservation,  333 
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Equation  of  radiative  energy  transfer,  333 
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Sun,  100,  102 
Escape  velocity,  77 
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Milky  Way,  492 
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neutron  star,  451 
planet,  139 
star  cluster,  111 
Sun,  100,  101 
Eskimo  Nebula,  416 
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Ever-expanding  universe,  521 
Exclusion  principle,  424 
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atmospheres,  410 
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flat  multi-planet  systems,  408 
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hot-Jupiters,  408 
mass,  405,  406 
orbital  distance,  406 
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recombination  time,  539,  540 
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Expansion  velocity,  185 
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gravitational,  74 
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red  giant  star,  341 
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radiation,  35,  36 
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distance,  476-478 
super-massive  black  hole,  484,  485 
Galaxies,  487-510 
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clusters,  500-504 
collision,  508 
cosmic  web,  511 
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discovery,  487-491 
distance,  494,  496 
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first,  540-544 
formation,  543-544 
giant  elliptical,  550,  551 
gravitational  lens,  504,  506 
look-back  time,  512 
luminosity,  492-498 
luminosity  function,  497 
motion,  492,  508,  510 
number  density,  492,  498 
peculiar  motions,  508 
physical  properties,  492 
redshift,  494 
starburst,  543 
streams,  508 

super-massive  black  holes,  484,  485, 
547-55 1 
voids,  510 
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Galaxy,  471-488 
Gamma-ray  bursts,  552 
afterglow,  552 
discovery,  552 
luminosity,  553 
supernovae,  554 
Gamma  rays,  39,  38 
Gamow  energy,  229 
Gas  pressure,  141,  142,  264,  330 
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Geosynchronous  orbit,  103 
Giant  elliptical  galaxies,  super-massive  black 
holes,  542,  546 

Giant  molecular  clouds,  385-391 
free  fall  time,  397 
gravitational  collapse,  391 
physical  properties,  388 
Giant  planets,  383 
Giant  stars,  306,  321 
formation,  336 
pulsation,  327 
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Globular  star  clusters,  111,  114,  343-345, 
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Cepheid  variables,  474 
distribution,  477 

Hertzsprung-Russell  diagrams  ,  347 
spheroid,  478 
stellar  motions,  116 
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Granulation,  249,  250 
Gravitation,  74 

Gravitational  binding  energy,  78 
Gravitational  collapse,  389-395 
Kelvin-Helmholtz  time,  216 
triggering,  392 

Gravitational  constant,  30,  75 
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Gravitational  force,  74 
Gravitational  lens,  505,  506 
clusters  of  galaxies,  503 
galaxies,  504,  505 
Gravitational  potential  energy,  78 
Gravitational  radiation,  97 
Gravitational  radius,  468 
Gravitational  redshift,  181,  182 
Sun,  181 
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Gravitational  waves,  97 
Gravity,  73 
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cause,  93 
Earth,  80 
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Great  Wall,  510 
Greenhouse  effect,  58,  59 
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Ground  state,  173 
Gyration  radius,  371 
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H  I  regions,  375,  377 
equilibrium,  392,  393 
physical  properties,  378 
H  II  regions,  357,  361 
equilibrium,  392 
physical  properties,  361 
Habitable  zone,  57,  409 
Hale  Telescope,  21 
Heat,  130 
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Helioseismology,  123,  251 
Heliosheath,  276 
Heliosphere,  275 
Helium  atom,  129 
Helium  burning,  340-343 
temperature,  342 
Helium,  129,  165,  534 

big-bang  nucleosynthesis,  534 
discovery  in  Sun,  164 
physical  properties,  129 
Herschel  Telescope ,  25 
Hertzsprung-Russell  diagram,  318-320 
protostars,  396 
star  clusters,  343-347 
Higgs  boson,  212 
H inode,  25 
HIPPARCOS,  8,  25 

Hertzsprung-Russell  diagram,  319,  320 
Oort’s  constants,  480 
star  density,  478 
star  distances,  295 
stellar  proper  motions,  111 
Historical  supernovae,  434 
Home  stake  neutrino  detector,  240 
Hooker  Telescope,  21 

Horizontal  branch,  Hertzsprung-Russell  dia¬ 
gram,  347 
Hot-Jupiters,  408 

H-R  diagram.  See  Hertzsprung-Russell 
diagram 

Hubble  constant,  493-497 

Hubble  expansion  parameter,  519,  558 

Hubble  law,  493-499 

Hubble  Space  Telescope  (HST),  25,  402,  476, 
496,  497,  501,  505,  509,  516 
Hubble  time,  512 
Hydrogen,  126 
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B aimer  lines,  166 
big-bang  nucleosynthesis,  534 
interstellar,  375-377 
most  abundant  element  in  most  stars,  179 
physical  properties,  129 
Hydrogen  alpha  line,  167 
Hydrogen  burning,  231-239,  335-340 
Hydrostatic  equilibrium,  330 

I 

Ideal  gas  law,  142 
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Inflation,  523 

Inf raRed  Astronomical  Satellite  (IRAS),  401 
Infrared  radiation,  39 
discovery,  60 

Initial  mass  function,  stars,  317 

Instability  strip,  Hertzsprung-Russell  diagram, 
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Interferometer,  23,  24,  306 
Interferometry,  23 
stars,  307 

International  Atomic  Time,  15 
International  celestial  reference  system,  6 
Interplanetary  magnetic  spiral,  289 
Interstellar  Boundary  EXplorer  (IBEX)  ,  276 
Interstellar  dust,  367,  368 
reddens  starlight,  304 
Interstellar  hydrogen  atoms,  375-378 
Interstellar  molecular  hydrogen,  380 
Interstellar  molecules,  378,  379 
radio  radiation,  379 
Interstellar  water,  379 
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Inverse  square  law,  74,  299 
gravity,  74 
light,  299 
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abundant  atoms,  178 
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temperature,  151 
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James  Webb  Space  Telescope  (JWST),  26 

Jeans  density,  391 

Jeans  length,  389 

Jeans  mass,  389,  390 

Jeans  radius,  391 

Joule,  54 

Julian  century,  15 

Julian  Date,  13 

Julian  year,  15 

K 

Kamiokande  neutrino  detector,  240 
Kapteyn  Universe,  477 
Kapteyn’s  star,  velocity,  110,  111 
Keck  Telescopes,  21 
Kelvin-Helmholtz  time,  216 
Kepler  mission,  409 

Kepler’s  first  law  of  planetary  motion,  69,  70 
Kepler’s  harmonic  relationship,  71,  73 
Kepler’s  second  law  of  planetary 
motion,  69,  70 

Kepler’s  third  law,  Newtonian  expression,  103 

Kepler’s  third  law  of  planetary  motion,  71,  72 

Kiloparsec  (kpc),  29 

Kinetic  energy,  100,  130,  132 

Kinetic  temperature,  133 
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Large  Binocular  Telescope,  21 

Large  Hadron  Collider,  212 

Large  stars,  306 

Latitude,  23 

Leptons,  242 

Lifetime 

main  sequence,  323-326 
nuclear  fusion  in  Sun,  219 
nuclear  reaction,  219 
protostars,  395-397 
radioactive  atom,  196-199 
synchrotron  radiation,  374 
Light  bending,  95,  96 
Light  travel  time,  294 

astronomical  unit  (AU),  28,  52 
Light-year,  29,  294 
Line  broadening 
expansion,  184 
rotation,  184 
thermal  motion,  183 
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Zeeman  effect,  183,  187-190 
Line  splitting,  Zeeman  effect,  183,  187-190 
Local  hour  angle,  19 
Local  sidereal  time,  17 
Longitude,  2,  3 
Look-back  time,  galaxies,  512 
Lorentz  contraction,  43 
Lorentz  factor,  41 
Lorentz  transformation,  41 
Luminiferous  ether,  41 
Luminosity,  298 

accretion,  463,  464 
Eddington,  463 
galaxies,  491,  498 
gamma-ray  bursts,  552 
Milky  Way,  471-486 
radio  pulsar,  456,  457 
stars,  299 

Sun,  27,  54,  298-300 
synchrotron  radiation,  374 
Luminosity  class,  stars,  321,  323 
Luminosity  function,  galaxies,  497 
Lunik  2,  269 
Lunisolar  precession,  13 
Lyman  transition,  171 
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Magnetic  brakes,  Sun  rotation,  386,  387 
Magnetic  cloud,  from  Sun,  291 
Magnetic  constant,  31 
Magnetic  energy,  393 
Magnetic  fields 
cosmic,  189 
Earth,  189 
energy  density,  157 
interstellar,  188 
neutron  stars,  456 
radio  pulsars,  457,  458 
solar  wind,  190 
Sun,  258,  289 
sunspots,  188 
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Magnetic  pressure,  141,  264 
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entry  to  magnetosphere,  288 
solar  flares,  278,  279 
Magnetic  waves,  156,  158,  267 
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Magnetosphere,  283-285 
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dark  matter  Milky  Way,  487 
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H  II  region,  365 
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Milky  Way,  479,  482 

spectroscopic  binary  stars,  316-318 

stars,  306 

Sun,  26,  80 

super-massive  black  hole  galactic  center, 
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super-massive  black  holes,  484 
upper  limit  for  stars,  313 
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Mass  distribution,  stars,  544 
Mass-energy  equivalence,  220 
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solar  wind,  273 
supergiant  stars,  309,  348 
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Maxwell  speed  distribution,  135,  136 
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Mean  free  path,  134,  135 
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M  87,  super-massive  black  hole,  550 
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precession  of  perihelion,  93,  94 
rotation,  116,  117 
Mesosphere,  146 
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star  clusters,  111 
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Neutrinos,  200,  237 
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supernova,  441 
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Crab  Nebula,  446-449 
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Nova  Aquilae  1918,  429 
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origin,  429-432 
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Nova  Herculis  1934,  429 
Nuclear  bombardment,  209 
Nuclear  fission,  212 
Nuclear  force,  127 
Nuclear  reactions 
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supergiant  stars,  348-349 
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atomic,  127 

binding  energy,  226,  227 
mass  defect,  226 
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Number,  galaxies,  497 
Number  density,  galaxies,  502 
Nutation,  13,  14 
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Optically  thick,  66 
Optically  thin,  66 
Optically  visible  radiation,  38 
Orbital  motion,  101 
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thermal  velocity,  132,  133 
Pion,  discovery,  208 
Planck  constant,  30,  45 
Planck  mission,  532 
Plane  waves,  34 
Planetary  nebulae,  412-417 
age,  415 

bright  named,  417 
central  stars,  414 
emission  lines,  413,  414 
formation,  415 
physical  properties,  414 
Planetary  rings,  origin,  90 
Planet-forming  disks,  400-402 
Planets 

albedo,  58 

atmospheric  molecules,  138 
composition,  382,  383 
effective  temperature,  55,  56 
fastest  rotation,  119 
habitable  zone,  57 
magnetospheres,  285,  286 
motion,  69 
oblate  shape,  118 
orbital  speed,  102 
rotation,  116-120 
speed,  69,  70 
temperatures,  56-58 
Plasma,  149,  150 
Plasma  frequency,  152 
Plasma  oscillations,  152 
Plerions,  448 
Plurality  of  worlds,  400 
Polaris,  6,  10 

Polarization,  starlight,  368,  369 
Pole  Star,  11,  12 
Polonium,  193 
Population  I  stars,  354 
Cepheids,  476 
Milky  Way,  476 
variable,  475 
Population  II  stars,  354 
globular  star  clusters,  477 
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variable,  475 
Population  III  stars,  354 
Positive  beta  decay,  201,  207 
Positron,  201 
discovery,  206 

Power,  nuclear  reactions,  223 
Poynting-Robertson  effect,  141 
Precession,  11-13 
Pressure,  141-145 
cosmic  range,  142 
Earth’s  atmosphere,  142,  145 
gas,  140,  142 
magnetic,  141 
radiation,  141 
wind,  141 

Prime  Meridian,  2,  16 
Principia,  74 
Principle  of  Relativity,  41 
Procyon  B,  327 
Proper  motion,  105,  107 
Proper  time  interval,  469 
Protons,  127 

electrical  repulsion,  221 
physical  properties,  129 
Proton-proton  chain,  232-234,  335,  338 
Proto-planetary  disks,  400 
Protostars,  395,  396 

Hertzsprung-Russell  diagram,  395 
lifetime,  388,  389 
Proxima  Centauri,  29,  319 
distance,  53,  295,  301 
luminosity,  301 
radius,  301,  307 
temperature,  301 
velocity,  119 

Pulsar,  Crab  Nebula,  449,  450 
Pulsar.^?*?  Radio  pulsar  or  X-ray  pulsar 
Pulsars,  452-454,  456,  458,  460,  461 
discovery,  452,  453 


Q 

Quanta,  44 

Quantum  mechanics,  168,  197,  424 
atom,  165,  166,  168 
degenerate  gas,  423-427,  452 
radioactive  decay,  197 
Quantum  tunneling  effect,  in  Sun,  222 
Quarks,  127 
Quasars,  549 
discovery,  548 
emission  lines,  549 
Quasistellar  radio  sources,  549 


R 

Radar,  astronomical  unit,  53 
Radial  velocity,  105-107,  180 
Radian,  9,  10 
Radiation,  33 
absorption,  65 
blackbody,  44 
emission  coefficient,  65 
frequency,  35,  36 
scattering,  62,  63 
thermal,  44 
wavelength,  35 
Radiation  constant,  49 
Radiation  era,  expanding  universe,  536 
Radiation  pressure,  141,  330 
stars,  311,  313 

Radiation  temperature,  big  bang,  526 
Radiation  transfer,  65 
Radio  galaxies,  546,  547 
emission  lines,  546 
synchrotron  radiation,  547 
R136al,  massive  star,  312 
Radio  pulsars,  452,  453,  455,  456,  458,  460 
age,  457 

Crab  Nebula,  449,  450,  458 
luminosity,  457 
magnetic  held  strength,  457 
periods,  454 
physical  properties,  454 
rotational  energy,  457 
Radio  radiation,  64 
bending  by  Sun,  95 
discovery,  59 
Milky  Way,  369,  375 
scintillation,  453 
supernova  remnants,  444 
Radio  telescope,  23 
Radioactive  alpha  decay,  194 
Radioactivity,  193 
constant,  196 
dating,  198 
nuclear  half-life,  197 
Radium,  193 
Radius 

Arcturus,  308 
atom,  128 

atomic  nucleus,  128 
Betelgeuse,  307 
Earth,  11 

emission  nebula,  360,  365 
gyration,  187 
H  II  region,  360,  365 
Milky  Way,  476,  478 
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neutron  stars,  451 
observable  universe,  516 
Proxima  Centauri,  315 
stars,  305 
Sun,  10,  28,  53 
universe,  558 

Radius-mass  relation,  white  dwarf  stars,  428 
Radon,  194 
Random  motion,  131 
Rate,  star  formation,  542 
Rayleigh  scattering,  62 
Rayleigh  scattering  cross  section,  62 
Rayleigh-Jeans  law,  46,  47 
Reaction  rate,  nuclear  fusion,  230 
Recombination  lines,  170,  363 
Recombination  radiation,  361,  362 
Recombination  time,  expanding  universe,  539 
Reddening,  starlight,  368 
Red  giant  stars,  306,  320,  326 
formation,  341 
mass  loss,  328 
winds,  328 
Redshift,  109,  180 
galaxies,  494 

Redshift-magnitude  diagram,  Type  la  super¬ 
novae,  555 

Redshift-magnitude  relation,  495 
Reflector,  19,  20 
Refractor,  19,  20 

Reionization  time,  expanding  universe,  540 

Resolving  power,  21 

Resonance  reactions,  343 

Rest  mass,  43 

Rest-mass  energy,  43 

Right  ascension,  5 

Rings,  origin,  90 

Robertson-Walker  metric,  518 

Roche  limit,  90-92 

Roche  lobe,  432 

Root  mean  square  speed,  137 

Rosette  Nebula,  359 

Rotation 

cosmic,  116 
differential,  120 
Earth,  2 

Mercury,  116,  117 
Milky  Way,  478-481 
neutron  star,  455 
radio  pulsar,  456,  458 
stars,  120,  398,  399 
Sun,  121,  385,  386 
velocity,  184 


Venus,  118 
white  dwarf  stars,  456 
Rotational  line  broadening,  184 
Runaway  stars,  115-118 
Runaway  thermonuclear  explosion,  white 
dwarf,  432 

Rydberg  constant,  167 
atomic,  172,  173 
Rydberg  energy  unit,  169 

S 

Sagittarius  A*,  484,  485 
Saha  ionization  equation,  177 
Satellites,  threat  from  Sun,  291 
Scattering 

Compton,  64 
radiation,  64 
Rayleigh,  61 
Thomson,  63 

Schwarzschild  metric,  468 
Schwarzschild  radius,  467,  468 
Scintillations,  453,  454 
Second  generation  stars,  354 
Second  law  of  thermodynamics,  266 
Second  of  arc,  9,  10 
Seeing,  22 

Seyfert  galaxies,  545 
emission  lines,  545 
Sgr  A*,  484,  485 
SI  units,  30 

conversion  to  c.g.s  units,  30 
Sidereal  day,  17 
Sidereal  rotation  period,  121 
Sidereal  time,  17,  18 
Singularity 

big  bang,  523 
black  hole,  469,  470 
Sirius  A,  luminosity,  302 
Sirius  B,  327,  419 

gravitational  redshift,  421 
luminosity,  302 

Sloan  Digital  Sky  Survey,  7,  510,  511 
Sloan  Great  Wall,  510,  511 
SN  1987A,  439 

neutrino  burst,  441 
progenitor  star,  441 
visible  explosion,  441 
Solar  active  region,  261 
SOlar  and  Heliospheric  Observatory  (SOHO)  , 
251 

Solar  constant,  55,  300 
Solar  day,  15,  18 
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Solar  flares,  276-280 

accelerated  particles,  277 
chromospheric  evaporation,  280 
energy,  276 

magnetic  reconnection,  278,  279 
nuclear  reactions,  279 
origin,  277 
positrons,  280 

protons  -  threat  to  humans,  290 
radio  bursts,  277,  278 
shock  waves,  277 
temperature,  276 
white  light,  276 
X-rays,  278,  279 
Solar  granulation,  249,  250 
Solar  magnetic  activity  cycle,  259 
Solar  nebula,  381,  383,  394 
Solar  neutrino  problem,  240-244 
Solar  neutrinos,  detectors,  239 
Solar  neutrinos,  flux,  238,  241 
Solar  neutrino  unit,  240 
Solar  parallax,  51,  52 
Solar  system,  formation,  381-387 
Solar  time,  15,  17 
Solar  wind 

comet  tails,  268 
discovery,  268 
mass  loss  from  Sun,  273 
physical  properties,  272 
speed  of  sound,  156,  272 
steady,  uniform  high  speed,  274 
supersonic,  271 
termination  shock,  275,  276 
variable,  gusty  slow  speed,  274 
Solstice,  8 
Sound,  154 
Sound  speed,  155 
Sound  waves,  154 

heat  chromosphere,  267 
in  Sun,  251,  252 
Space  curvature,  94 
Space  curvature  index,  518,  520 
Space  Telescopes,  25 
Space  velocity,  105,  106 
Space  weather,  283-292 
forecasting  ,291,  292 

Special  Theory  of  Relativity ,  42,  43,  220,  468 
Spectral  classification,  stars,  306 
Spectral  sequence,  305,  306 
Spectroheliograph,  159,  162 
Spectroscopic  binary  stars,  315,  317 
Spectroscopic  distance,  stars,  322 
Spectroscopic  parallax,  stars,  322 
Spectrum 


blackbody,  44 
continuum,  44 
electromagnetic,  37,  38 
nonthermal  radiation,  370 
thermal  radiation,  46 
Speed,  71 

escape,  77,  99-101,  113,  139,  451 

light,  14,  28,  30,  35,  41 

mean,  137 

most  probable,  135 

orbital,  102 

planets,  70,  71 

root  mean  square,  137 

sound, 155 

Speed  distribution  function,  136 
Speed  of  light,  14,  28,  30,  35,  41 
Speed  of  sound,  155 

Sphere,  gravitational  potential  energy,  76 
Spheres  of  ionization,  360 
Spiral  arms,  Milky  Way,  482,  483 
Spiral  galaxy,  491 

dark  matter,  490,  491 
Spiral  nebulae,  491 

Cepheid  variable  stars,  488,  489 
distance,  489 
recession  velocities,  492 
Spitzer  Space  Telescope  (SST)  ,  25,  401,  496, 
509,  516 

Spontaneous  transition  coefficient,  175 

Spring  ocean  tides,  83 

Sputnik,  153 

Standard  epoch,  13 

Stars 

age  and  rotation,  123,  124 
angular  size,  306 
brightest,  296,  297 
central  temperature,  219,  329 
Cepheid  variable,  474-476 
chemical  composition,  334 
classification,  305 
color,  303 

convection,  339,  340 

convective  zones,  340 

effective  temperature,  303 

fate,  310 

first,  540-544 

first  generation  ,  354 

formation,  387-400,  543-544 

formation  rate,  543 

gravitational  redshift,  181,  182,  421 

habitable  zone,  409 

initial  mass  distribution,  544 

initial  mass  function,  317 

interferometry,  307 
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Stars  ( cont .) 
largest,  306 
luminosity,  299 
luminosity  class,  321 
mass,  310 

mass  distribution,  544 
mass-luminosity  relation,  310,  311 
most  abundant  element  hydrogen,  179 
most  massive,  312 
motion,  105 

Population  I,  II  and  III,  354 
pulsation,  327,  475 
radiation  pressure,  312 
radius,  306 
rapid  rotation,  124 
red  giant,  306,  320,  327,  328 
rotation,  120-124,  399 
second  generation,  354 
spectral  classification,  305 
spectroscopic  distance,  322 
spectroscopic  parallax,  322 
supergiant,  306,  320,  327,  328 
support  by  gas  pressure,  330 
support  by  radiation  pressure,  330 
temperatures,  303 
upper  mass  limit,  313 
variable,  475 
white  dwarf,  320,  327 
Star  catalogues,  6 
Star  clusters 

escape  velocity,  113 
Hertzsprung-Russell  diagrams,  347 
Star  time,  17,  18 
Starburst  galaxies,  543 
Starlight,  367,  368 

interstellar  absorption,  367 
polarization,  368,  369 
reddening,  368 

Stefan-Boltzmann  constant,  49 

Stefan-Boltzmann  law,  49,  309,  318 

Stellar  aberration,  14 

Stellar  associations,  115 

Stellar  black  holes,  465-469 

Stellar  nucleosynthesis,  350,  352 

Stellar  rotation,  124 

Stratosphere,  145 

Streams,  galaxies,  508 

Stromgren  radius,  360,  364 

Sudbury  Neutrino  Observatory,  244,  245 

Sun 

absolute  visual  magnitude,  300 
absorption  lines,  159 


abundance  of  elements,  351 
active  region,  261 
age,  198,  217 
angular  extent,  8 
central  gas  pressure,  218 
central  mass  density,  218 
central  temperature,  217 
composition,  160,  161 
convection,  245,  248 
convective  zone,  246,  247 
core,  244,  245 
deflection  of  starlight,  94 
differential  rotation,  120 
distance,  51,  52 
effective  temperature,  56 
escape  velocity,  100,  101,  270 
explosions,  276 
fate,  253,  254 

Fraunhofer  absorption  lines,  161-163 
free  fall  time,  397 
gravitational  redshift,  181 
internal  mass  density,  148,  150 
internal  plasma,  153 
internal  pressure,  148,  150 
internal  rotation,  123,  124 
internal  structure,  245 
internal  temperature,  148,  150 
light  bending,  95,  96 
luminosity,  28,  55,  298,  300 
magnetic  field,  258,  259 
mass,  28,  80 

mass  loss  by  nuclear  fusion,  236 

mass  loss  by  solar  wind,  273 

most  abundant  element  hydrogen,  179 

most  abundant  elements,  164,  165 

neutrinos,  238-243 

nuclear  fusion  lifetime,  236 

ocean  tides,  81 

physical  properties,  219 

radiation  diffusion  time,  247 

radiative  zone,  246,  247 

radius,  9,  10,  28,  54 

red  hydrogen  alpha  transition,  176 

rotation,  120,  386 

rotation  velocity  about  galactic  center,  479, 
481 

secondary  nuclear  fusion  reactions,  235 
spectrum,  161 
temperature,  55,  56 
Thomson  scattering,  247 
Sunspots,  258 
cycle,  258-260 
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Sunspots  ( cont .) 
origin,  263 
pairs,  261 
Zeeman  effect,  188 
Sun  time,  15,  17 
Superfluids,  neutron  stars,  452 
Supergiant  stars,  306,  320,  327-329 
mass  loss,  309,  328 
nuclear  reactions,  347-349 
winds,  309,  310,  328 
Supergranulation,  250 
Super-Kamiokande  neutrino  detector,  243 
Super-massive  black  holes,  550,  551 
galactic  center,  484,  485 
mass,  550 
M  87,  550 

Supernovae  ,  433-444 

acceleration  of  cosmic  rays,  203 
binary  star  system,  437 
energy,  442 
gamma-ray  bursts,  552 
gravity-powered  iron-catastrophe,  436, 
438,  439 
historical,  434 
neutrinos,  441 
nuclear  powered,  436,  437 
origin,  436-439 
origin  of  elements,  341,  344 
remnants,  443-450 
triggering  gravitational  collapse,  394 
types,  435 

Supernova  remnants,  443-450 
radio  radiation,  444 
types,  444 
X-rays,  443,  445 
Synchronous  rotation,  85 
Synchrotron  radiation,  372-375 
Crab  Nebula,  447-449 
lifetime,  374 
luminosity,  374 
Milky  Way,  374,  375 
radio  galaxies,  546,  547 
Synodic  rotation  period,  122 

T 

Tangential  velocity,  106-108 
Tau  lepton,  discovery,  211 
Tau  neutrinos,  242,  243 
Telescope,  20 

resolving  power,  21 
Temperature 

accreting  material,  463 


background  radiation,  526,  531 

big  bang,  526 

center  of  star,  219 

center  of  Sun,  217 

corona,  256 

cosmic,  131 

emission  nebula,  365 

H  II  region,  365 

helium  burning,  342 

kelvin  scale,  28 

kinetic,  133 

nuclear  reactions,  221,  222 
planets,  56,  57 
stars,  303 
star  centers,  329 
Sun  visible  disk,  55 
Terrestrial  planets,  383 
Terrestrial  time,  16 
Thermal  bremsstrahlung,  364 
Thermal  energy,  132 
Thermal  equilibrium,  44 
Thermal  evaporation,  140 
Thermal  motion,  Doppler  line  broadening,  183 
Thermal  radiation,  44 
maximum  intensity,  46 
Thermal  spectrum,  46 
Thermal  velocity,  132,  139 
Thomson  scattering,  63 
cross  section,  63 
in  Sun,  246 

Three-degree  cosmic  microwave  background 
radiation.  See  Background  radiation 
Tidal  friction,  85 
Tidal  locking,  85 
Tides,  80 
Time 

civil,  15 

dilation,  42,  469 
free  fall,  397 
international  atomic,  15 
local  sidereal,  17 
sidereal,  17,  18 
solar,  15,  17 
star,  17,  18 
terrestrial,  16 
universal,  16 
Time  dilation,  42 
gravitational,  469 

Transition  Region  and  Coronal  Explorer 
(TRACE)  ,  262 
Troposphere,  145 
Tunneling  cross  section,  230 
Two  Degree  Field  Galaxy  Survey,  510,  511 
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Tycho  supernova  remnant,  444 
Type  la  supernovae,  436,  437 

redshift-magnitude  diagram,  555 
Type  II  supernovae,  435,  438,  439 

U 

Ultraviolet  radiation,  38,  39 
Uncertainty  principle,  424 
Universal  gravitation,  73 
Universal  time,  15 

Upper  mass  limit,  white  dwarf  star,  429 
Uranium,  194,  195 
Urea  process,  442 

V 

Van  Allen  radiation  belts,  287 
Velocity,  71 
Alfven,  157 
expansion,  184 
radial,  105-107,  180 
rotation,  184 
space,  105-107 
tangential,  105-107 
thermal,  132 

Velocity-distance  relation,  495 
Venus 

parallax,  52 
rotation,  118 
Vernal  Equinox,  4 

Very  Large  Telescope  Interferometer,  308 

Very  Large  Telescopes,  21 

Virial  theorem,  111 

Visual  binary  stars,  314,  315 

Vogt-Russell  theorem,  334 

Voids,  galaxies,  510 

Voyager  1,  215 

Voyager  2,  275 

Vulcan,  93 

VY  Canis  Majoris,  mass  loss,  309 
VY  Cephei  A,  mass  loss,  309 

W 

Walls,  galaxies,  510 
Water,  interstellar,  379 
Wavelength 
colors,  38 
radiation,  35 
Waves 

Alfven,  157 
electromagnetic,  33 


Weight,  74 

White  dwarf  stars,  320,  327,  418-428 
carbon  and  oxygen  nuclei,  419 
discovery,  418 
formation,  415 
former  red  giant  star,  419 
gravitational  redshift,  181,  421 
in  short-period  binary  systems,  431 
magnetic  held  strength,  422 
mass  density,  420 
physical  properties,  423 
radius,  420 

radius-mass  relation,  427 
rotation  period,  455 
supernova,  435,  437 
upper  mass  limit,  429 

Wien  displacement  law,  46 

Wien  tail,  46 

Wilkinson  Microwave  Anisotropy  Probe 
(WMAP),  530-532 

Wind  pressure,  141 

Winds 

Betelgeuse,  308 
Sun,  268-276 

supergiant  stars,  307,  309,  328 

X 

X-rays,  38 

black  holes,  466,  467 
clusters  of  galaxies,  504 
Crab  Nebula,  447-449 
discovery,  192 
pulsars,  460 
solar  hares,  278,  279 
supernova  remnants,  444 

X-ray  pulsars,  460-466 
discovery,  460,  462 
physical  properties,  463 

Y 

Year  (yr),  15,  29 

Ylem,  353 

Young  stars,  rapid  rotation,  124 

Z 

Zeeman  effect,  183,  187,  188 
interstellar,  188 
Sun,  258,  261 
sunspots,  188 

Zero  age,  main  sequence,  332 


